Human language technology (HLT) has been identified as a priority area by the South African government. However, despite efforts by government and the research and development (R&D) community, South Africa has not yet been able to maximise the opportunities of HLT and create a thriving HLT industry. One of the key challenges is the fact that there is insufficient codified knowledge about the current South African HLT components, their attributes and existing relationships. Hence a technology audit was conducted for the South African HLT landscape, to create a systematic and detailed inventory of the status of the HLT components across the eleven official languages. Based on the Basic Language Resource Kit (BLaRK) framework (Krauwer, 1998), we used various data collection methods (such as focus groups, questionnaires and personal consultations with HLT experts) to gather detailed information. The South African HLT landscape is analysed using a number of complementary approaches and based on the interpretations of the results, recommendations are made on how to accelerate HLT development in South Africa, as well as on how to conduct similar audits in other countries and contexts.
Reference:
Grover, AS, Van Huyssteen, GB, and Pretorius, MW. 2011. South African human language technology audit. Language Resources and Evaluation, Vol. 45, pp. 271-288
Grover, A., Van Huyssteen, G., & Pretorius, M. (2011). South African human language technology audit. http://hdl.handle.net/10204/5154
Grover, AS, GB Van Huyssteen, and MW Pretorius "South African human language technology audit." (2011) http://hdl.handle.net/10204/5154
Grover A, Van Huyssteen G, Pretorius M. South African human language technology audit. 2011; http://hdl.handle.net/10204/5154.
Copyright: 2011 Springer. This is a pre print version of the work. The definitive version is published in Language Resources and Evaluation, Vol. 45, pp. 271-288
Author:Peché, M; Davel, MH; Barnard, EDate:Dec 2009This article introduces the first Spoken Language Identification system developed to distinguish among all eleven of South Africa’s official languages. The PPR-LM (Parallel Phoneme Recognition followed by Language Modeling) architecture is ...Read more
Author:Sharma Grover, A; Calteaux, Karen V; Van Huyssteen, G; Pretorius, MDate:Oct 2010South Africa is one of the few countries in the world that boasts a large number of official languages. Due to the efforts of the government and the local research and development (R&D) community all the official languages are enabled with ...Read more
Author:Grover, AS; Van Huyssteen, GB; Pretorius, MWDate:May 2010Human language technologies (HLT) can play a vital role in bridging the digital divide and thus the HLT field has been recognised as a priority area by the South African government. The authors present the work on conducting a technology audit ...Read more