Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Cuneiform Linux

Compare

  Analyzed 26 days ago

Cuneiform is an OCR system originally developed and open sourced by Cognitive technologies. This project aims to create a fully portable version of Cuneiform.

321K lines of code

0 current contributors

almost 14 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

OCRFeeder

Compare

  Analyzed 26 days ago

OCRFeederOCRFeeder is a document layout analysis and optical character recognition system. Given the images it will automatically outline its contents, distinguish between what's graphics and text and perform OCR over the latter. It generates multiple formats being its main one ODT. It ... [More] features a complete GTK graphical user interface that allows the users to correct any unrecognized characters, defined or correct bounding boxes, set paragraph styles, clean the input images, import PDFs, save and load the project, export everything to multiple formats, etc. OCRFeeder was developed as the project of the Master's Thesis in Computer Science of Joaquim Rocha. [Less]

19K lines of code

0 current contributors

about 7 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

Cuneiform-Qt

Compare

  Analyzed 5 months ago

Graphical interface for Cuneiform OCR

1.53K lines of code

1 current contributors

over 5 years since last commit

1 users on Open Hub

Activity Not Available
2.0
   
I Use This

hocr-tools

Compare

  Analyzed 27 days ago

hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages ... [More] , and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation. There is a Public Specification for the hOCR Format. [Less]

1.51K lines of code

4 current contributors

over 2 years since last commit

0 users on Open Hub

Inactive
5.0
 
I Use This

tesserwrap

Compare

  Analyzed 26 days ago

Python bindings to the Tesseract API

980 lines of code

0 current contributors

over 10 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This