A general-purpose isiZulu text-to-speech (TTS) system was developed, based on the “Multisyn” unit-selection approach supported by the Festival TTS toolkit. The development involved a number of challenges related to the interface between speech technology and linguistics – for example, choosing an appropriate set of phonetic units, producing reliable pronunciations, and developing appropriate cost functions for selecting and joining diphone units. The research should show how solutions were found for each of these challenges, and describe a number of other innovations (such as automated fault detection in manual alignments) that were introduced. Initial evaluations suggest that the synthesizer is usable by a wide spectrum of isiZulu speakers
Reference:
Louw, A, Davel, M and Barnard, E. 2007. General-purpose isiZulu speech synthesiser. The 14th International Conference of the African Language Association of Southern Africa (ALASA), Johannesburg, South Africa, July 2005, pp 15
Louw, A., Davel, M., & Barnard, E. (2005). General-purpose isiZulu speech synthesiser. 4th International Conference of the African Language Association of Southern Africa (ALASA). http://hdl.handle.net/10204/1837
Louw, A, M Davel, and E Barnard. "General-purpose isiZulu speech synthesiser." (2005): http://hdl.handle.net/10204/1837
Louw A, Davel M, Barnard E, General-purpose isiZulu speech synthesiser; 4th International Conference of the African Language Association of Southern Africa (ALASA); 2005. http://hdl.handle.net/10204/1837 .