Afrikaans is one of the eleven official languages of South Africa. It is classified as an under-resourced language. No annotated broadband speech corpora currently exist for Afrikaans. This article reports on the development of speech resources for Afrikaans, specifically a broadband speech corpus and an extended pronunciation dictionary. Baseline results for an ASR system that was built using these resources are also presented. In addition, the article suggests different strategies to exploit the close relationship between Afrikaans and Dutch for the purposes of technology development.
Reference:
De Wet, F, De Waal, A and Van Huyssteen, GB. 2011. Developing a broadband automatic speech recognition system for Afrikaans. 12 Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, 27-31 August 2011
De Wet, F., De Waal, A., & Van Huyssteen, G. (2011). Developing a broadband automatic speech recognition system for Afrikaans. The International Speech Communication Association. http://hdl.handle.net/10204/5478
De Wet, Febe, A De Waal, and GB Van Huyssteen. "Developing a broadband automatic speech recognition system for Afrikaans." (2011): http://hdl.handle.net/10204/5478
De Wet F, De Waal A, Van Huyssteen G, Developing a broadband automatic speech recognition system for Afrikaans; The International Speech Communication Association; 2011. http://hdl.handle.net/10204/5478 .
Author:Louw, Johannes A; Moodley, AvashlinDate:Dec 2018Speech-to-speech translation can be described as converting a speech signal from a source language into a speech signal of the same meaning or intent into a target language. This process is achieved by the coordinated cooperation of individual ...Read more
Author:De Vries, NJ; Davel, MH; Badenhorst, J; Basson, WD; De Wet, Febe; Barnard, E; De Waal, ADate:Jan 2014Acoustic data collection for automatic speech recognition (ASR) purposes is a particularly challenging task when working with under resourced languages, many of which are found in the developing world. We provide a brief overview of related ...Read more
Author:Badenhorst, J; De Waal, A; De Wet, FebeDate:May 2012The collection of speech data suitable for speech technology development is a challenge for under-resourced languages. Factors such as cost, availability of mother-tongue speakers and vast geographic distances call for techniques to optimise ...Read more