Part-of-speech effects on text-to-speech synthesis

Schlunz, Georg I; Barnard, E; Van Huyssteen, GB

dc.contributor.author	Schlunz, Georg I
dc.contributor.author	Barnard, E
dc.contributor.author	Van Huyssteen, GB
dc.date.accessioned	2010-12-14T13:27:24Z
dc.date.available	2010-12-14T13:27:24Z
dc.date.issued	2010-11
dc.identifier.citation	Schlunz, GI, Barnard, E and Van Huyssteen, GB. 2010. Part-of-speech effects on text-to-speech synthesis. 21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010, pp 257-262	en
dc.identifier.isbn	978-0-7992-2470-2
dc.identifier.uri	http://hdl.handle.net/10204/4674
dc.description	21st Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, 22-23 November 2010	en
dc.description.abstract	One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags.	en
dc.language.iso	en	en
dc.publisher	PRASA 2010	en
dc.relation.ispartofseries	Conference Paper	en
dc.subject	Text-to-speech systems	en
dc.subject	Natural language processing	en
dc.subject	Part-of-speech	en
dc.subject	Hidden Markov model	en
dc.subject	PRASA 2010	en
dc.title	Part-of-speech effects on text-to-speech synthesis	en
dc.type	Conference Presentation	en
dc.identifier.apacitation	Schlunz, G. I., Barnard, E., & Van Huyssteen, G. (2010). Part-of-speech effects on text-to-speech synthesis. PRASA 2010. http://hdl.handle.net/10204/4674	en_ZA
dc.identifier.chicagocitation	Schlunz, Georg I, E Barnard, and GB Van Huyssteen. "Part-of-speech effects on text-to-speech synthesis." (2010): http://hdl.handle.net/10204/4674	en_ZA
dc.identifier.vancouvercitation	Schlunz GI, Barnard E, Van Huyssteen G, Part-of-speech effects on text-to-speech synthesis; PRASA 2010; 2010. http://hdl.handle.net/10204/4674 .	en_ZA
dc.identifier.ris	TY - Conference Presentation AU - Schlunz, Georg I AU - Barnard, E AU - Van Huyssteen, GB AB - One of the goals of text-to-speech (TTS) systems is to produce natural-sounding synthesised speech. Towards this end various natural language processing (NLP) tasks are performed to model the prosodic aspects of the TTS voice. One of the fundamental NLP tasks being used is the part-of-speech (POS) tagging of the words in the text. This paper investigates the effects of POS information on the naturalness of a hidden markov model (HMM) based TTS voice when additional resources are not available to aid in the modelling of prosody. It is found that, when a minimal feature set is used for the HMM context labels, the additiion of POS tags does improve the naturalness of the voice. However, the same effect can be accomplished by including segmental counting and positional information instead of the POS tags. DA - 2010-11 DB - ResearchSpace DP - CSIR KW - Text-to-speech systems KW - Natural language processing KW - Part-of-speech KW - Hidden Markov model KW - PRASA 2010 LK - https://researchspace.csir.co.za PY - 2010 SM - 978-0-7992-2470-2 T1 - Part-of-speech effects on text-to-speech synthesis TI - Part-of-speech effects on text-to-speech synthesis UR - http://hdl.handle.net/10204/4674 ER -	en_ZA

Files in this item

Name: Schlunz_2010.pdf

Size: 3.687Mb

Format: PDF

View/Open

This item appears in the following Collection(s)

Conference Publications

Show simple item record

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.