Paper
19 January 2009 Simultaneous detection of vertical and horizontal text lines based on perceptual organization
Claudie Faure, Nicole Vincent
Author Affiliations +
Proceedings Volume 7247, Document Recognition and Retrieval XVI; 72470M (2009) https://doi.org/10.1117/12.805504
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
A page of a document is a set of small components which are grouped by a human reader into higher level components, such as lines and text blocs. Document image analysis is aimed at detecting these components in document images. We propose the encoding of local information by considering the properties that determine perceptual grouping. Each connected component is labelled according to the location of its nearest neighbour connected component. These labelled components constitute the input of a rule-based incremental process. Vertical and horizontal text lines are detected without prior assumption on their direction. Touching characters belonging to different lines are detected early and discarded from the grouping process to avoid line merging. The tolerance for grouping components increases in the course of the process until the final decision. After each step of the grouping process, conflict resolution rules are activated. This work was motivated by the automatic detection of Figure&Caption pairs in the documents of the historical collection of the BIUM digital library (Bibliotheque InterUniversitaire Medicale). The images that were used in this study belong to this collection.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Claudie Faure and Nicole Vincent "Simultaneous detection of vertical and horizontal text lines based on perceptual organization", Proc. SPIE 7247, Document Recognition and Retrieval XVI, 72470M (19 January 2009); https://doi.org/10.1117/12.805504
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Digital libraries

Tolerancing

Document image analysis

Image segmentation

Lead

Computer programming

Back to Top