Paper
23 January 2012 Automatic indexing of scanned documents: a layout-based approach
Daniel Esser, Daniel Schuster, Klemens Muthmann, Michael Berger, Alexander Schill
Author Affiliations +
Proceedings Volume 8297, Document Recognition and Retrieval XIX; 82970H (2012) https://doi.org/10.1117/12.908542
Event: IS&T/SPIE Electronic Imaging, 2012, Burlingame, California, United States
Abstract
Archiving official written documents such as invoices, reminders and account statements in business and private area gets more and more important. Creating appropriate index entries for document archives like sender's name, creation date or document number is a tedious manual work. We present a novel approach to handle automatic indexing of documents based on generic positional extraction of index terms. For this purpose we apply the knowledge of document templates stored in a common full text search index to find index positions that were successfully extracted in the past.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Daniel Esser, Daniel Schuster, Klemens Muthmann, Michael Berger, and Alexander Schill "Automatic indexing of scanned documents: a layout-based approach", Proc. SPIE 8297, Document Recognition and Retrieval XIX, 82970H (23 January 2012); https://doi.org/10.1117/12.908542
Lens.org Logo
CITATIONS
Cited by 23 scholarly publications and 5 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Optical character recognition

Detection and tracking algorithms

Distance measurement

Genetic algorithms

Image processing

Image segmentation

Back to Top