Paper
8 February 2015 Ground truth model, tool, and dataset for layout analysis of historical documents
Kai Chen, Mathias Seuret, Hao Wei, Marcus Liwicki, Jean Hennebert, Rolf Ingold
Author Affiliations +
Proceedings Volume 9402, Document Recognition and Retrieval XXII; 940204 (2015) https://doi.org/10.1117/12.2075858
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of historical documents with complex layouts. The dataset is based on a generic model for ground-truth presentation of the complex layout structure of historical documents. For the purpose of extracting uniformly the document contents, our model defines five types of regions of interest: page, text block, text line, decoration, and comment. Unconstrained polygons are used to outline the regions. A performance metric is proposed in order to evaluate various page segmentation methods based on this model. We have analysed four state-of-the-art ground-truthing tools: TRUVIZ, GEDI, WebGT, and Aletheia. From this analysis, we conceptualized and developed Divadia, a new tool that overcomes some of the drawbacks of these tools, targeting the simplicity and the efficiency of the layout ground truthing process on historical document images. With Divadia, we have created a new public dataset. This dataset contains 120 pages from three historical document image collections of different styles and is made freely available to the scientific community for historical document layout analysis research.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Kai Chen, Mathias Seuret, Hao Wei, Marcus Liwicki, Jean Hennebert, and Rolf Ingold "Ground truth model, tool, and dataset for layout analysis of historical documents", Proc. SPIE 9402, Document Recognition and Retrieval XXII, 940204 (8 February 2015); https://doi.org/10.1117/12.2075858
Lens.org Logo
CITATIONS
Cited by 20 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Analytical research

Image segmentation

Data modeling

Document image analysis

Performance modeling

Java

Image quality

Back to Top