Paper
30 September 2011 Lip reading using neural networks
Dhananjay Kalbande, Sanjivani Patil
Author Affiliations +
Proceedings Volume 8285, International Conference on Graphic and Image Processing (ICGIP 2011); 828519 (2011) https://doi.org/10.1117/12.913406
Event: 2011 International Conference on Graphic and Image Processing, 2011, Cairo, Egypt
Abstract
Computerized lip reading, or speech reading, is concerned with the difficult task of converting a video signal of a speaking person to written text. It has several applications like teaching deaf and dumb to speak and communicate effectively with the other people, its crime fighting potential and invariance to acoustic environment. We convert the video of the subject speaking vowels into images and then images are further selected manually for processing. However, several factors like fast speech, bad pronunciation, and poor illumination, movement of face, moustaches and beards make lip reading difficult. Contour tracking methods and Template matching are used for the extraction of lips from the face. K Nearest Neighbor algorithm is then used to classify the 'speaking' images and the 'silent' images. The sequence of images is then transformed into segments of utterances. Feature vector is calculated on each frame for all the segments and is stored in the database with properly labeled class. Character recognition is performed using modified KNN algorithm which assigns more weight to nearer neighbors. This paper reports the recognition of vowels using KNN algorithms
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Dhananjay Kalbande and Sanjivani Patil "Lip reading using neural networks", Proc. SPIE 8285, International Conference on Graphic and Image Processing (ICGIP 2011), 828519 (30 September 2011); https://doi.org/10.1117/12.913406
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Laser induced plasma spectroscopy

Detection and tracking algorithms

Image segmentation

Video

Databases

Neural networks

Optical character recognition

RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
A review on handwriting words recognition using OCR
Proceedings of SPIE (December 20 2021)
Neural networks for handwriting recognition
Proceedings of SPIE (September 16 1992)
Automatic document reading system for technical drawings
Proceedings of SPIE (December 18 2001)

Back to Top