History of the Tesseract OCR engine: what worked and what didn't

Ray W. Smith

doi:10.1117/12.2010051

4 February 2013 History of the Tesseract OCR engine: what worked and what didn't

Ray W. Smith

Author Affiliations +

Proceedings Volume 8658, Document Recognition and Retrieval XX; 865802 (2013) https://doi.org/10.1117/12.2010051
Event: IS&T/SPIE Electronic Imaging, 2013, Burlingame, California, United States

Abstract

This paper describes the development history of the Tesseract OCR engine, and compares the methods to general changes in the field over a similar time period. Emphasis is placed on the lessons learned with the goal of providing a primer for those interested in OCR research.

Citation Download Citation

Ray W. Smith "History of the Tesseract OCR engine: what worked and what didn't", Proc. SPIE 8658, Document Recognition and Retrieval XX, 865802 (4 February 2013); https://doi.org/10.1117/12.2010051

Access the abstract

PROCEEDINGS
12 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY