dc.contributor.author |
Schlunz, Georg I
|
|
dc.contributor.author |
Barnard, E
|
|
dc.contributor.author |
Van Huyssteen, GB
|
|
dc.date.accessioned |
2010-12-14T13:27:24Z |
|
dc.date.available |
2010-12-14T13:27:24Z |
|
dc.date.issued |
2010-11 |
|
dc.identifier.citation |
Schlunz, GI, Barnard, E and Van Huyssteen, GB. 2010. Part-of-speech effects on text-to-speech synthesis. 21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010, pp 257-262 |
en |
dc.identifier.isbn |
978-0-7992-2470-2 |
|
dc.identifier.uri |
http://hdl.handle.net/10204/4674
|
|
dc.description |
21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010 |
en |
dc.description.abstract |
One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags. |
en |
dc.language.iso |
en |
en |
dc.publisher |
PRASA 2010 |
en |
dc.relation.ispartofseries |
Conference Paper |
en |
dc.subject |
Text-to-speech systems |
en |
dc.subject |
Natural language processing |
en |
dc.subject |
Part-of-speech |
en |
dc.subject |
Hidden Markov model |
en |
dc.subject |
PRASA 2010 |
en |
dc.title |
Part-of-speech effects on text-to-speech synthesis |
en |
dc.type |
Conference Presentation |
en |
dc.identifier.apacitation |
Schlunz, G. I., Barnard, E., & Van Huyssteen, G. (2010). Part-of-speech effects on text-to-speech synthesis. PRASA 2010. http://hdl.handle.net/10204/4674 |
en_ZA |
dc.identifier.chicagocitation |
Schlunz, Georg I, E Barnard, and GB Van Huyssteen. "Part-of-speech effects on text-to-speech synthesis." (2010): http://hdl.handle.net/10204/4674 |
en_ZA |
dc.identifier.vancouvercitation |
Schlunz GI, Barnard E, Van Huyssteen G, Part-of-speech effects on text-to-speech synthesis; PRASA 2010; 2010. http://hdl.handle.net/10204/4674 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - Schlunz, Georg I
AU - Barnard, E
AU - Van Huyssteen, GB
AB - One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags.
DA - 2010-11
DB - ResearchSpace
DP - CSIR
KW - Text-to-speech systems
KW - Natural language processing
KW - Part-of-speech
KW - Hidden Markov model
KW - PRASA 2010
LK - https://researchspace.csir.co.za
PY - 2010
SM - 978-0-7992-2470-2
T1 - Part-of-speech effects on text-to-speech synthesis
TI - Part-of-speech effects on text-to-speech synthesis
UR - http://hdl.handle.net/10204/4674
ER -
|
en_ZA |