Paper
30 March 1995 Stochastic attribute grammar model of document production and its use in document image decoding
Philip A. Chou, Gary E. Kopec
Author Affiliations +
Proceedings Volume 2422, Document Recognition II; (1995) https://doi.org/10.1117/12.205842
Event: IS&T/SPIE's Symposium on Electronic Imaging: Science and Technology, 1995, San Jose, CA, United States
Abstract
Document image decoding (DID) refers to the process of document recognition within a communication theory framework. In this framework, a logical document structure is a message communicated by encoding the structure as an ideal image, transmitting the ideal image through a noisy channel, and decoding the degraded image into a logical structure as close to the original message as possible, on average. Thus document image decoding is document image recognition where the recognizer performs optimal reconstruction by explicitly modeling the source of logical structures, the encoding procedure, and the channel noise. In previous work, we modeled the source and encoder using probabilistic finite-state automata and transducers. In this paper, we generalize the source and encoder models using context-free attribute grammars. We employ these models in a document image decoder that uses a dynamic programming algorithm to minimize the probability of error between original and reconstructed structures. The dynamic programming algorithm is a generalization of the Cocke-Younger-Kasami parsing algorithm.
© (1995) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Philip A. Chou and Gary E. Kopec "Stochastic attribute grammar model of document production and its use in document image decoding", Proc. SPIE 2422, Document Recognition II, (30 March 1995); https://doi.org/10.1117/12.205842
Lens.org Logo
CITATIONS
Cited by 16 scholarly publications and 3 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Detection and tracking algorithms

Stochastic processes

Communication theory

Image transmission

Raster graphics

Image processing

RELATED CONTENT

Script determination in document images
Proceedings of SPIE (March 30 1995)
Turbo recognition: a statistical approach to layout analysis
Proceedings of SPIE (December 21 2000)
A Microprogrammable Processor For Image Operations
Proceedings of SPIE (December 19 1985)
Comparison of connected-component algorithms
Proceedings of SPIE (August 27 1999)
Communication theory framework for document recognition
Proceedings of SPIE (March 23 1994)

Back to Top