Hidden Markov Model (HMM)-based synthesis in combination with speaker adaptation has proven to be an approach that is well-suited for child speech synthesis. This paper describes the development and evaluation of different HMM-based child speech synthesis systems. The aim is to determine the most suitable combination of initial model and speaker adaptation techniques to synthesize child speech. The results of the study indicate that gender-independent initial models perform better than gender-dependent initial models and Constrained Structural Maximum a Posteriori Linear Regression (CSMAPLR) followed by maximum a posteriori (MAP) is the speaker adaptation technique combination that yields the most natural and intelligible synthesized child speech.
Reference:
Govender, A., De Wet, F. and Tapamo, J. 2015. HMM adaptation for child speech synthesis. 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), 6 - 10 September 2015, Dresden, Germany, pp 1640-1644.
Govender, A., De Wet, F., & Tapamo, J. (2015). HMM Adaptation for child speech synthesis. Technische Universität Berlin. http://hdl.handle.net/10204/9529
Govender, Avashna, Febe De Wet, and Jules-Raymond Tapamo. "HMM Adaptation for child speech synthesis." (2015): http://hdl.handle.net/10204/9529
Govender A, De Wet F, Tapamo J, HMM Adaptation for child speech synthesis; Technische Universität Berlin; 2015. http://hdl.handle.net/10204/9529 .
Copyright: 2015 Technische Universität Berlin. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, kindly consult the publisher's website.