ResearchSpace

Part-of-speech effects on text-to-speech synthesis

Show simple item record

dc.contributor.author Schlunz, Georg I
dc.contributor.author Barnard, E
dc.contributor.author Van Huyssteen, GB
dc.date.accessioned 2010-12-14T13:27:24Z
dc.date.available 2010-12-14T13:27:24Z
dc.date.issued 2010-11
dc.identifier.citation Schlunz, GI, Barnard, E and Van Huyssteen, GB. 2010. Part-of-speech effects on text-to-speech synthesis. 21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010, pp 257-262 en
dc.identifier.isbn 978-0-7992-2470-2
dc.identifier.uri http://hdl.handle.net/10204/4674
dc.description 21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010 en
dc.description.abstract One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags. en
dc.language.iso en en
dc.publisher PRASA 2010 en
dc.relation.ispartofseries Conference Paper en
dc.subject Text-to-speech systems en
dc.subject Natural language processing en
dc.subject Part-of-speech en
dc.subject Hidden Markov model en
dc.subject PRASA 2010 en
dc.title Part-of-speech effects on text-to-speech synthesis en
dc.type Conference Presentation en
dc.identifier.apacitation Schlunz, G. I., Barnard, E., & Van Huyssteen, G. (2010). Part-of-speech effects on text-to-speech synthesis. PRASA 2010. http://hdl.handle.net/10204/4674 en_ZA
dc.identifier.chicagocitation Schlunz, Georg I, E Barnard, and GB Van Huyssteen. "Part-of-speech effects on text-to-speech synthesis." (2010): http://hdl.handle.net/10204/4674 en_ZA
dc.identifier.vancouvercitation Schlunz GI, Barnard E, Van Huyssteen G, Part-of-speech effects on text-to-speech synthesis; PRASA 2010; 2010. http://hdl.handle.net/10204/4674 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Schlunz, Georg I AU - Barnard, E AU - Van Huyssteen, GB AB - One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags. DA - 2010-11 DB - ResearchSpace DP - CSIR KW - Text-to-speech systems KW - Natural language processing KW - Part-of-speech KW - Hidden Markov model KW - PRASA 2010 LK - https://researchspace.csir.co.za PY - 2010 SM - 978-0-7992-2470-2 T1 - Part-of-speech effects on text-to-speech synthesis TI - Part-of-speech effects on text-to-speech synthesis UR - http://hdl.handle.net/10204/4674 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record