Paper
28 January 2008 Segmentation-based retrieval of document images from diverse collections
Author Affiliations +
Proceedings Volume 6815, Document Recognition and Retrieval XV; 68150L (2008) https://doi.org/10.1117/12.767295
Event: Electronic Imaging, 2008, San Jose, California, United States
Abstract
We describe a methodology for retrieving document images from large extremely diverse collections. First we perform content extraction, that is the location and measurement of regions containing handwriting, machine-printed text, photographs, blank space, etc, in documents represented as bilevel, greylevel, or color images. Recent experiments have shown that even modest per-pixel content classification accuracies can support usefully high recall and precision rates (of, e.g., 80-90%) for retrieval queries within document collections seeking pages that contain a fraction of a certain type of content. When the distribution of content and error rates are uniform across the entire collection, it is possible to derive IR measures from classification measures and vice versa. Our largest experiments to date, consisting of 80 training images totaling over 416 million pixels, are presented to illustrate these conclusions. This data set is more representative than previous experiments, containing a more balanced distribution of content types. Contained in this data set are also images of text obtained from handheld digital cameras and the success of existing methods (with no modification) in classifying these images with are discussed. Initial experiments in discriminating line art from the four classes mentioned above are also described. We also discuss methodological issues that affect both ground-truthing and evaluation measures.
© (2008) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Michael A. Moll and Henry S. Baird "Segmentation-based retrieval of document images from diverse collections", Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150L (28 January 2008); https://doi.org/10.1117/12.767295
Lens.org Logo
CITATIONS
Cited by 16 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Photography

Image retrieval

Image segmentation

Image classification

Digital cameras

Feature extraction

Algorithm development

Back to Top