The deployment of speech technology systems in the developing world is often hampered by the lack of appropriate linguistic resources. A suitable pronunciation dictionary is one such resource that can be difficult to obtain for lesser-resourced languages. Researchers design a process for the development of pronunciation dictionaries in resource-scarce environments, and apply this to the development of pronunciation dictionaries for ten of the official languages of South Africa. The paper defines the semiautomated development and verification process in detail and discusses practicalities, outcomes and lessons learnt. It further analyse the accuracy of the developed dictionaries and demonstrate how the distribution of rules generated from the dictionaries provides insight into the inherent predictability of the languages studied.
Reference:
Davel, M and Martirosian, O. 2009. Pronunciation dictionary development in resource-scarce environments. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009). Brighton, UK, 6-10 September, 2009. pp 2851-2854
Davel, M., & Martirosian, O. (2009). Pronunciation dictionary development in resource-scarce environments. International Speech Communication Association. http://hdl.handle.net/10204/3620
Davel, M, and O Martirosian. "Pronunciation dictionary development in resource-scarce environments." (2009): http://hdl.handle.net/10204/3620
Davel M, Martirosian O, Pronunciation dictionary development in resource-scarce environments; International Speech Communication Association; 2009. http://hdl.handle.net/10204/3620 .
Copyright: 2009 ISCA. This is the author's version of the work. It is posted here by permission of ISCA for your personal use. Not for redistribution. The definitive version was published in the Conference Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009). Brighton, UK, 6-10 September 2009, pp 2851-2854
Author:Davel, MH; Barnard, EDate:Sep 2005Bootstrapping techniques are an efficient way to develop electronic pronunciation dictionaries, but require fast system response to be practical for medium-to-large lexicons. In addition, user errors are inevitable during this process, and ...Read more
Author:Martirosian, O; Davel, MDate:Nov 2007The authors explore pattern recognition techniques for verifying the correctness of a pronunciation lexicon, focusing on techniques that require limited human interaction. They evaluate the British English Example Pronunciation (BEEP) dictionary ...Read more
Author:Cooper, Antony K; Majeke, BDate:Apr 2008This project was to develop guidelines for spatial data content standards in Africa. It focused on standards for data dictionaries, feature catalogues and classification, and also looked at standards for feature instances. The project ...Read morecb