ResearchSpace

HMM adaptation for child speech synthesis using ASR data

Show simple item record

dc.contributor.author Govender, Avashna
dc.contributor.author Nouhou, B
dc.contributor.author De Wet, Febe
dc.date.accessioned 2017-08-22T13:12:17Z
dc.date.available 2017-08-22T13:12:17Z
dc.date.issued 2015-11
dc.identifier.citation Govender, A., Nouhou, B and De Wet, F. 2015. HMM Adaptation for child speech synthesis using ASR data. Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 25-26 November 2015, Port Elizabeth, South Africa, pp 178-183. en_US
dc.identifier.uri http://ieeexplore.ieee.org/document/7359519/
dc.identifier.uri http://hdl.handle.net/10204/9487
dc.description Copyright: 2015 IEEE. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text, kindly consult the publisher's website. en_US
dc.description.abstract Acquiring large amounts of child speech data is a particularly difficult task. One could therefore consider the possibility to add existing corpora of child speech data to the severely limited resources that are available for developing child voices. This paper reports on a feasibility study that was conducted to determine whether it is possible to synthesize good quality child voices using child speech data that was recorded for automatic speech recognition (ASR) purposes. A text-to-speech system was implemented using hidden Markov model based synthesis since it has proven to be a technique that is less susceptible to imperfect data. The paper describes how data was selected from the ASR corpus to build various child voices. The voices were evaluated to determine whether the data selection methods yield acceptable results within the context of model adaptation for child speech synthesis. The results show that, if data is selected according to particular criteria, ASR data could be used to develop child voices that are comparable to voices that were built using speech data specifically recorded for speech synthesis purposes. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.ispartofseries Worklist;16021
dc.subject Automatic speech recognition en_US
dc.subject ASR en_US
dc.subject Child speech data en_US
dc.subject Hidden Markov model en_US
dc.subject HMM en_US
dc.title HMM adaptation for child speech synthesis using ASR data en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Govender, A., Nouhou, B., & De Wet, F. (2015). HMM adaptation for child speech synthesis using ASR data. IEEE. http://hdl.handle.net/10204/9487 en_ZA
dc.identifier.chicagocitation Govender, Avashna, B Nouhou, and Febe De Wet. "HMM adaptation for child speech synthesis using ASR data." (2015): http://hdl.handle.net/10204/9487 en_ZA
dc.identifier.vancouvercitation Govender A, Nouhou B, De Wet F, HMM adaptation for child speech synthesis using ASR data; IEEE; 2015. http://hdl.handle.net/10204/9487 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Govender, Avashna AU - Nouhou, B AU - De Wet, Febe AB - Acquiring large amounts of child speech data is a particularly difficult task. One could therefore consider the possibility to add existing corpora of child speech data to the severely limited resources that are available for developing child voices. This paper reports on a feasibility study that was conducted to determine whether it is possible to synthesize good quality child voices using child speech data that was recorded for automatic speech recognition (ASR) purposes. A text-to-speech system was implemented using hidden Markov model based synthesis since it has proven to be a technique that is less susceptible to imperfect data. The paper describes how data was selected from the ASR corpus to build various child voices. The voices were evaluated to determine whether the data selection methods yield acceptable results within the context of model adaptation for child speech synthesis. The results show that, if data is selected according to particular criteria, ASR data could be used to develop child voices that are comparable to voices that were built using speech data specifically recorded for speech synthesis purposes. DA - 2015-11 DB - ResearchSpace DP - CSIR KW - Automatic speech recognition KW - ASR KW - Child speech data KW - Hidden Markov model KW - HMM LK - https://researchspace.csir.co.za PY - 2015 T1 - HMM adaptation for child speech synthesis using ASR data TI - HMM adaptation for child speech synthesis using ASR data UR - http://hdl.handle.net/10204/9487 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record