Effort and accuracy during language resource generation: a pronunciation prediction case study

Davel, M; Barnard, E

Effort and accuracy during language resource generation: a pronunciation prediction case study

http://hdl.handle.net/10204/844

Abstract:

When developing a language resource, there is generally a trade-off between the amount of effort invested in the resource creation process and the quality of the resulting resource. We (authors) argue that, in the developing world with its many resource-scarce languages, a ‘usable’ resource in multiple languages may be more valuable than a highly accurate resource for one language only. From this perspective we (the authors) investigate the resource validation process – determining whether a resource is sufficiently accurate– using the creation of a pronunciation dictionary as case study. We (the authors) show that the amount of effort required to validate a 20,000-word pronunciation dictionary can be reduced substantially by employing appropriate computational tools, when compared to both a fully manual validation process and a competing automatic process.

Reference:

Davel, M and Barnard, E. 2006. Effort and accuracy during language resource generation: a pronunciation prediction case study. 17th Annual Symposium of the Pattern Recognition Association of South Africa, Parys, South Africa, 29 Nov - 1 Dec 2006, Pages: 4

Davel, M., & Barnard, E. (2006). Effort and accuracy during language resource generation: a pronunciation prediction case study. http://hdl.handle.net/10204/844

Davel, M, and E Barnard. "Effort and accuracy during language resource generation: a pronunciation prediction case study." (2006): http://hdl.handle.net/10204/844

Davel M, Barnard E, Effort and accuracy during language resource generation: a pronunciation prediction case study; 2006. http://hdl.handle.net/10204/844 .

Download RIS

This paper was later published in the SAIEE Africa Research Journal, Vol 98(4), pp 124-128

Davel, M
Barnard, E

Nov 2006

Language resources
Language technologies
Speech technologies
Dictionary validation

Show full item record

Files in this item

Davel1_2006.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Effort and accuracy during language resource generation: a pronunciation prediction case study

Effort and accuracy during language resource generation: a pronunciation prediction case study

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect