Script determination in document images

Larry Spitz

doi:10.1117/12.205831

30 March 1995 Script determination in document images

Larry Spitz

Proceedings Volume 2422, Document Recognition II; (1995) https://doi.org/10.1117/12.205831
Event: IS&T/SPIE's Symposium on Electronic Imaging: Science and Technology, 1995, San Jose, CA, United States

Abstract

We have developed techniques for distinguishing which language is represented in an image of text. This work is restricted to an important subset of the world's languages, using techniques that should be applicable across even more comprehensive samples. The method first classifies the script into two broad classes: European and Asian. This classification is based on the spatial relationships of fiducial points related to the upward concavities in character structures. Script identification within the Asian class, (Japanese, Chinese, Korean) is performed by analysis of the optical density distribution of the text images.

Citation Download Citation

Larry Spitz "Script determination in document images", Proc. SPIE 2422, Document Recognition II, (30 March 1995); https://doi.org/10.1117/12.205831

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available