SemiBoost-based Arabic character recognition method

Bing Su; Liangrui Peng; Xiaoqing Ding

doi:10.1117/12.876622

24 January 2011 SemiBoost-based Arabic character recognition method

Bing Su, Liangrui Peng, Xiaoqing Ding

Proceedings Volume 7874, Document Recognition and Retrieval XVIII; 787409 (2011) https://doi.org/10.1117/12.876622
Event: IS&T/SPIE Electronic Imaging, 2011, San Francisco Airport, California, United States

Abstract

A SemiBoost-based character recognition method is introduced in order to incorporate the information of unlabeled practical samples in training stage. One of the key problems in semi-supervised learning is the criteria of unlabeled sample selection. In this paper, a criteria based on pair-wise sample similarity is adopted to guide the SemiBoost learning process. At each time of iteration, unlabeled examples are selected and assigned labels. The selected samples are used along with the original labeled samples to train a new classifier. The trained classifiers are integrated to make the final classfier. An empirical study on several Arabic similar character pairs with different similarities shows that the proposed method improves the performance as unlabeled samples reveal the distribution of practical samples.

Citation Download Citation

Bing Su, Liangrui Peng, and Xiaoqing Ding "SemiBoost-based Arabic character recognition method", Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 787409 (24 January 2011); https://doi.org/10.1117/12.876622

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available