Automatic error detection in alignments for speech synthesis

Barnard, E; Davel, M

Automatic error detection in alignments for speech synthesis

http://hdl.handle.net/10204/1044

Abstract:

The phonetic segmentation of recorded speech is a crucial factor in the quality of concatenative systems for speech synthesis. The authors describe a likelihood-based error detection process that can be used to flag possible errors in such a segmentation, with a view towards manual correction. It is shown that this process can be used to assist in the creation of high-accuracy segmentations. In particular, for an isiZulu corpus used in the creation of a unit-selection synthesizer, almost half of the errors that existed in a manual segmentation were detected by this process, while flagging less than a quarter of all segments. Different phoneme classes are handled with differing amounts of success, with vowels being the most troublesome

Reference:

Barnard, E and Davel, M. 2006. Automatic error detection in alignments for speech synthesis. 17th Annual Symposium of the Pattern Recognition Association of South Africa, Parys, South Africa, 29 Nov - 1 Dec 2006, pp 4

Barnard, E., & Davel, M. (2006). Automatic error detection in alignments for speech synthesis. http://hdl.handle.net/10204/1044

Barnard, E, and M Davel. "Automatic error detection in alignments for speech synthesis." (2006): http://hdl.handle.net/10204/1044

Barnard E, Davel M, Automatic error detection in alignments for speech synthesis; 2006. http://hdl.handle.net/10204/1044 .

Download RIS

Barnard, E
Davel, M

Nov 2006

Speech synthesizer
Phonetic segmentation
Error detection
Unit-selection synthesizer

Show full item record

Files in this item

Barnard_2006.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Automatic error detection in alignments for speech synthesis

Automatic error detection in alignments for speech synthesis

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect