Gender identification in Sepedi speech corpus

Sefara, Tshephisho J; Mokgonyane, TB

Gender identification in Sepedi speech corpus

DOI: 10.1109/icABCD51485.2021.9519308
http://hdl.handle.net/10204/12120

Abstract:

Gender identification is the task of identifying the gender of the speaker from the audio signal. Most gender identification systems are developed using datasets belonging to well-resourced languages. There has been little focus on creating gender identification systems for under resourced African languages. This paper presents the development of a gender identification system using a Sepedi speech dataset containing a duration of 55.7 hours made of 30776 males and 28337 females. We build a gender identification system using machine learning models that are trained using multilayer Perceptron (MLP), convolutional neural network (CNN), and long short-term memory (LSTM). Mid-term features are extracted from time domain features, frequency domain features and cepstral domain features, and normalised using the Z-score normalisation technique. XGBoost is used as a feature selection method to select important features. MLP achieved the same F-score and an accuracy of 94% for data with seen speakers while LSTM and CNN achieved the same F-score and an accuracy of 97%. We further evaluated the models on data with unseen speakers. All the models achieved good performance in F-score and accuracy.

Reference:

Sefara, T.J. & Mokgonyane, T. 2021. Gender identification in Sepedi speech corpus. http://hdl.handle.net/10204/12120 .

Sefara, T. J., & Mokgonyane, T. (2021). Gender identification in Sepedi speech corpus. http://hdl.handle.net/10204/12120

Sefara, Tshephisho J, and TB Mokgonyane. "Gender identification in Sepedi speech corpus." 2021 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), Durban, South Africa, 5-6 August 2021 (2021): http://hdl.handle.net/10204/12120

Sefara TJ, Mokgonyane T, Gender identification in Sepedi speech corpus; 2021. http://hdl.handle.net/10204/12120 .

Download RIS

Sefara, Tshephisho J
Mokgonyane, TB

Aug 2021

Gender identification
Convolutional neural network
Sepedi
XGBoost
Feature selection
Long short-term memory
Multilayer Perceptron

Show full item record

Files in this item

Sefara2_2021.pdf

Source

2021 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), Durban, South Africa, 5-6 August 2021

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Gender identification in Sepedi speech corpus

Gender identification in Sepedi speech corpus

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect