Text normalisation in text-to-speech synthesis comprises the segmentation and classification of the incoming text and the subsequent expansion of non-standard words into their standard word, spoken forms. We present a rule-based implementation for the 11 official South African languages that uses native number expansion. We discuss the architecture and performance of the implementation based on examples of cardinal and ordinal numbers, money, dates and times. Although the implementation scores well, it is currently limited to handling non-standard words in isolation. Future work will need to address sentence context in order to normalise the African languages correctly.
Reference:
Schlünz, G.I., Dlamini, N., Tshoane, A. & Ramunyisi, N.S. 2017. Text normalisation in text-to-speech synthesis for South African languages: native number expansion. In: 2017 PRASA-RobMech International Conference, Bloemfontein, Free State, South Africa, 29 November - 1 December 2017
Schlunz, G. I., Dlamini, N., Tshoane, A., & Ramunyisi, N. S. (2017). Text normalisation in text-to-speech synthesis for South African languages: native number expansion. IEEE. http://hdl.handle.net/10204/11055
Schlunz, Georg I, Nkosikhona Dlamini, Alfred Tshoane, and Ndivhuwo S Ramunyisi. "Text normalisation in text-to-speech synthesis for South African languages: native number expansion." (2017): http://hdl.handle.net/10204/11055
Schlunz GI, Dlamini N, Tshoane A, Ramunyisi NS, Text normalisation in text-to-speech synthesis for South African languages: native number expansion; IEEE; 2017. http://hdl.handle.net/10204/11055 .
The attached pdf contains the accepted version of the published item. For access to the published version, kindly consult https://ieeexplore.ieee.org/document/8261153