Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

tesseract-ocr

Compare

  Analyzed 11 months ago

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text, ALTO ... [More] , hOCR or PDF. Tesseract can read most common image formats. Since 2020 the Internet Archive uses Tesseract to get text for its scanned documents. [Less]

4.05M lines of code

49 current contributors

12 months since last commit

17 users on Open Hub

Activity Not Available
3.5
   
I Use This

OCRopus

Compare

  No analysis available

OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. The OCRopus engine is based on two research projects: a high-performance handwriting ... [More] recognizer developed in the mid-90s and deployed by the US Census bureau, and novel high-performance layout analysis methods. OCRopus development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications. [Less]

0 lines of code

0 current contributors

0 since last commit

3 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

Cuneiform Linux

Compare

  No analysis available

Cuneiform is an OCR system originally developed and open sourced by Cognitive technologies. This project aims to create a fully portable version of Cuneiform.

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: gpl

Optical Character Recognition (GOCR)

Compare

  Analyzed 11 months ago

This is a command line based optical character recognition program.

56.4K lines of code

0 current contributors

almost 18 years since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This

PaddleOCR

Compare

  Analyzed 11 months ago

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

88.1K lines of code

0 current contributors

12 months since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This

EasyOCR

Compare

  Analyzed 11 months ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

14K lines of code

0 current contributors

over 1 year since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This