Open Access Paper
4 February 2013 History of the Tesseract OCR engine: what worked and what didn't
Author Affiliations +
Proceedings Volume 8658, Document Recognition and Retrieval XX; 865802 (2013) https://doi.org/10.1117/12.2010051
Event: IS&T/SPIE Electronic Imaging, 2013, Burlingame, California, United States
Abstract
This paper describes the development history of the Tesseract OCR engine, and compares the methods to general changes in the field over a similar time period. Emphasis is placed on the lessons learned with the goal of providing a primer for those interested in OCR research.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ray W. Smith "History of the Tesseract OCR engine: what worked and what didn't", Proc. SPIE 8658, Document Recognition and Retrieval XX, 865802 (4 February 2013); https://doi.org/10.1117/12.2010051
Lens.org Logo
CITATIONS
Cited by 22 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Machine learning

Systems modeling

Statistical modeling

Associative arrays

Binary data

Data modeling

RELATED CONTENT


Back to Top