The task of automatically identifying and/or verifying the identity of a speaker from a recording of a speech sample, known as automatic speaker recognition, has been studied for many years and automatic speaker recognition technologies have improved recently and becoming inexpensive and reliable methods for identifying and verifying people. Although automatic speaker recognition research has now spanned over 50 years, there is not adequate research done with regards to low-resourced South African indigenous languages. In this paper, a multi-layer perceptron (MLP) classifier model is trained and deployed on a graphical user interface for real time identification and verification of Sepedi native speakers. Sepedi is a low-resourced language spoken by the majority of residents in the Limpopo province of South Africa. The data used to train the speaker recognition system is obtained from the NCHLT (National Centre for Human Language Technology) project. A total of 34 short-term acoustic features of speech are extracted with the use of py Audio Analysis library and Sklearn is used to train the MLP classifier model which performs well with an accuracy of 95%. The GUI is developed with QT Creator and PyQT4 and it has obtained a true acceptance rate (TAR) of 66.67% and a true rejection rate of (TRR) 13.33%.
Reference:
Mokgonyane, T., Sefara, T.J., Manamela, M. & Modipa, T. 2021. A cross-platform interface for automatic speaker identification and verification. http://hdl.handle.net/10204/12123 .
Mokgonyane, T., Sefara, T. J., Manamela, M., & Modipa, T. (2021). A cross-platform interface for automatic speaker identification and verification. http://hdl.handle.net/10204/12123
Mokgonyane, TB, Tshephisho J Sefara, MJ Manamela, and TI Modipa. "A cross-platform interface for automatic speaker identification and verification." 2021 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), Durban, South Africa, 5-6 August 2021 (2021): http://hdl.handle.net/10204/12123
Mokgonyane T, Sefara TJ, Manamela M, Modipa T, A cross-platform interface for automatic speaker identification and verification; 2021. http://hdl.handle.net/10204/12123 .
2021 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), Durban, South Africa, 5-6 August 2021