Two-stage approach to keyword spotting in handwritten documents

Mehdi Haji; Mohammad R. Ameri; Tien D. Bui; Ching Y. Suen; Dominique Ponson

doi:10.1117/12.2042265

24 March 2014 Two-stage approach to keyword spotting in handwritten documents

Mehdi Haji, Mohammad R. Ameri, Tien D. Bui, Ching Y. Suen, Dominique Ponson

Proceedings Volume 9021, Document Recognition and Retrieval XXI; 90210P (2014) https://doi.org/10.1117/12.2042265
Event: IS&T/SPIE Electronic Imaging, 2014, San Francisco, California, United States

Abstract

Separation of keywords from non-keywords is the main problem in keyword spotting systems which has traditionally been approached by simplistic methods, such as thresholding of recognition scores. In this paper, we analyze this problem from a machine learning perspective, and we study several standard machine learning algorithms specifically in the context of non-keyword rejection. We propose a two-stage approach to keyword spotting and provide a theoretical analysis of the performance of the system which gives insights on how to design the classifier in order to maximize the overall performance in terms of F-measure.

Citation Download Citation

Mehdi Haji, Mohammad R. Ameri, Tien D. Bui, Ching Y. Suen, and Dominique Ponson "Two-stage approach to keyword spotting in handwritten documents", Proc. SPIE 9021, Document Recognition and Retrieval XXI, 90210P (24 March 2014); https://doi.org/10.1117/12.2042265

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available