Paper
24 January 2011 SemiBoost-based Arabic character recognition method
Author Affiliations +
Proceedings Volume 7874, Document Recognition and Retrieval XVIII; 787409 (2011) https://doi.org/10.1117/12.876622
Event: IS&T/SPIE Electronic Imaging, 2011, San Francisco Airport, California, United States
Abstract
A SemiBoost-based character recognition method is introduced in order to incorporate the information of unlabeled practical samples in training stage. One of the key problems in semi-supervised learning is the criteria of unlabeled sample selection. In this paper, a criteria based on pair-wise sample similarity is adopted to guide the SemiBoost learning process. At each time of iteration, unlabeled examples are selected and assigned labels. The selected samples are used along with the original labeled samples to train a new classifier. The trained classifiers are integrated to make the final classfier. An empirical study on several Arabic similar character pairs with different similarities shows that the proposed method improves the performance as unlabeled samples reveal the distribution of practical samples.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Bing Su, Liangrui Peng, and Xiaoqing Ding "SemiBoost-based Arabic character recognition method", Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 787409 (24 January 2011); https://doi.org/10.1117/12.876622
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Image segmentation

Feature extraction

Image processing

Detection and tracking algorithms

Digital imaging

FDA class I medical device development

RELATED CONTENT


Back to Top