Paper
18 January 2010 Detection of figure and caption pairs based on disorder measurements
Claudie Faure, Nicole Vincent
Author Affiliations +
Proceedings Volume 7534, Document Recognition and Retrieval XVII; 75340S (2010) https://doi.org/10.1117/12.838592
Event: IS&T/SPIE Electronic Imaging, 2010, San Jose, California, United States
Abstract
Figures inserted in documents mediate a kind of information for which the visual modality is more appropriate than the text. A complete understanding of a figure often necessitates the reading of its caption or to establish a relationship with the main text using a numbered figure identifier which is replicated in the caption and in the main text. A figure and its caption are closely related; they constitute single multimodal components (FC-pair) that Document Image Analysis cannot extract with text and graphics segmentation. We propose a method to go further than the graphics and text segmentation in order to extract FC-pairs without performing a full labelling of the page components. Horizontal and vertical text lines are detected in the pages. The graphics are associated with selected text lines to initiate the detector of FC-pairs. Spatial and visual disorders are introduced to define a layout model in terms of properties. It enables to cope with most of the numerous spatial arrangements of graphics and text lines. The detector of FC-pairs performs operations in order to eliminate the layout disorder and assigns a quality value to each FC-pair. The processed documents were collected in medic@, the digital historical collection of the BIUM (Bibliothèque InterUniversitaire Médicale). A first set of 98 pages constitutes the design set. Then 298 pages were collected to evaluate the system. The performances are the result of a full process, from the binarisation of the digital images to the detection of FC-pairs.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Claudie Faure and Nicole Vincent "Detection of figure and caption pairs based on disorder measurements", Proc. SPIE 7534, Document Recognition and Retrieval XVII, 75340S (18 January 2010); https://doi.org/10.1117/12.838592
Lens.org Logo
CITATIONS
Cited by 2 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Image segmentation

Image analysis

Sensors

Databases

Document image analysis

Image processing

RELATED CONTENT


Back to Top