Paper
1 October 2011 Quality of Arabic utterances transformed using different residual prediction techniques
Rania Elmanfaloty, N. Korany, El-Sayed A. Youssef
Author Affiliations +
Proceedings Volume 8285, International Conference on Graphic and Image Processing (ICGIP 2011); 82854C (2011) https://doi.org/10.1117/12.913264
Event: 2011 International Conference on Graphic and Image Processing, 2011, Cairo, Egypt
Abstract
Voice conversion (VC) is a process which modifies the speech signal produced by one source speaker so that it sounds like another target speaker. In this paper the transformation is determined by using equal Arabic utterances from source and target speakers; these utterances are time-aligned using dynamic time warping algorithm. A conversion function based on Gaussian mixture model (GMM) is used for transforming the spectral envelope described by line spectral frequencies (LSF) and the residuals are converted using three residual prediction techniques. We also compare between these techniques in the conversion of some Arabic utterances. The quality of the transformed utterances is measured using subjective and objective evaluations.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Rania Elmanfaloty, N. Korany, and El-Sayed A. Youssef "Quality of Arabic utterances transformed using different residual prediction techniques", Proc. SPIE 8285, International Conference on Graphic and Image Processing (ICGIP 2011), 82854C (1 October 2011); https://doi.org/10.1117/12.913264
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Signal to noise ratio

Molybdenum

Signal processing

Virtual colonoscopy

Data conversion

Data modeling

Detection and tracking algorithms

Back to Top