Paper
14 September 2010 Integrating hidden Markov model and PRAAT: a toolbox for robust automatic speech transcription
Author Affiliations +
Proceedings Volume 7745, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2010; 774513 (2010) https://doi.org/10.1117/12.872211
Event: Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2010, 2010, Wilga, Poland
Abstract
An automatic time-aligned phone transcription toolbox of English speech corpora has been developed. Especially the toolbox would be very useful to generate robust automatic transcription and able to produce phone level transcription using speaker independent models as well as speaker dependent models without manual intervention. The system is based on standard Hidden Markov Models (HMM) approach and it was successfully experimented over a large audiovisual speech corpus namely GRID corpus. One of the most powerful features of the toolbox is the increased flexibility in speech processing where the speech community would be able to import the automatic transcription generated by HMM Toolkit (HTK) into a popular transcription software, PRAAT, and vice-versa. The toolbox has been evaluated through statistical analysis on GRID data which shows that automatic transcription deviates by an average of 20 ms with respect to manual transcription.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
A. Kabir, J. Barker, and M. Giurgiu "Integrating hidden Markov model and PRAAT: a toolbox for robust automatic speech transcription", Proc. SPIE 7745, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2010, 774513 (14 September 2010); https://doi.org/10.1117/12.872211
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Statistical analysis

Error analysis

Signal processing

Statistical modeling

Automatic alignment

Prototyping

RELATED CONTENT

Texel-based image classification with orthogonal bases
Proceedings of SPIE (April 29 2016)
Discriminatory power of handwriting
Proceedings of SPIE (December 18 2001)
Modeling the sample distribution for clustering OCR
Proceedings of SPIE (December 21 2000)
Time and space optimization of document content classifiers
Proceedings of SPIE (January 18 2010)

Back to Top