Paper
3 April 1997 Document matching on CCITT Group 4 compressed images
Jonathan J. Hull
Author Affiliations +
Proceedings Volume 3027, Document Recognition IV; (1997) https://doi.org/10.1117/12.270061
Event: Electronic Imaging '97, 1997, San Jose, CA, United States
Abstract
A method is proposed for detecting whether tow CCITT group 4 images were scanned from the same document. Features are extracted from rectangular patches of text and compared with a modified Hausdorff distance measure. Two images are said to be 'equivalent' if the Hausdorff measure finds that a specified number of features are located within a given distance of one another in both images. This paper explains the technique and presents experimental results that demonstrate its effectiveness.It is shown that features extracted from a one-inch square patch of image data provide better than 95 percent correct retrieval accuracy with no false positives on a database of 800 documents.
© (1997) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jonathan J. Hull "Document matching on CCITT Group 4 compressed images", Proc. SPIE 3027, Document Recognition IV, (3 April 1997); https://doi.org/10.1117/12.270061
Lens.org Logo
CITATIONS
Cited by 22 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Databases

Feature extraction

Distance measurement

Image compression

Chlorine

Error analysis

Binary data

Back to Top