The effects of acoustic features of speech for automatic speaker recognition

Mokgonyane, TB; Sefara, Tshephisho J; Manamela, MJ; Modipa, TI; Masekwameng, MS

The effects of acoustic features of speech for automatic speaker recognition

https://ieeexplore.ieee.org/abstract/document/9183889
DOI: 10.1109/icABCD49160.2020.9183889
http://hdl.handle.net/10204/11587

Abstract:

Automatic speaker recognition is the task of automatically determining or verifying the identity of a speaker from a recording of his or her speech sample and has been studied for many decades. One of the most important steps of speaker recognition that significantly influences the speaker recognition performance is known as feature extraction. Acoustic features of speech have been researched by many researchers around the world, however, there is limited research conducted on African indigenous languages, South African official languages in particular. This paper presents the effects of acoustic features of speech towards the performance of speaker recognition systems focusing on South African low-resourced languages. This study investigates the acoustic features of speech using the National Centre for Human Language Technology (NCHLT) Sepedi speech data. Acoustic features of speech such as Time-domain, Frequency-domain and Cepstral-domain features are evaluated on four machine learning algorithms: K-Nearest Neighbours (K-NN), two kernel-based Support Vector Machines (SVM), and Multilayer Perceptrons (MLP). The results show that the performance is poor for time-domain features and good for spectral-domain features and even better for cepstral-domain features. However, the combination of these three features resulted in a higher accuracy and and F₁ score of 98%.

Reference:

Mokgonyane, T.B. (et.al). 2020 The effects of acoustic features of speech for automatic speaker recognition. 2020 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems, Durban, South Africa, 6-7 August 2020, 5pp.

Mokgonyane, T., Sefara, T. J., Manamela, M., Modipa, T., & Masekwameng, M. (2020). The effects of acoustic features of speech for automatic speaker recognition. http://hdl.handle.net/10204/11587

Mokgonyane, TB, Tshephisho J Sefara, MJ Manamela, TI Modipa, and MS Masekwameng "The effects of acoustic features of speech for automatic speaker recognition." (2020) http://hdl.handle.net/10204/11587

Mokgonyane T, Sefara TJ, Manamela M, Modipa T, Masekwameng M. The effects of acoustic features of speech for automatic speaker recognition. 2020; http://hdl.handle.net/10204/11587.

Download RIS

Copyright: 2020 IEEE. This is the preprint version of the work. For access to the published version, please access the publisher's website.

Mokgonyane, TB
Sefara, Tshephisho J
Manamela, MJ
Modipa, TI
Masekwameng, MS

Aug 2020

Acoustic features of speech
Cepstral-domain
Frequency-domain
Speaker recognition
Time-domain

Show full item record

Files in this item

RS_23774_The Effects of Acoustic Features of Speech for Automatic Speaker Recognition_August_2020.pdf

This item appears in the following Collection(s)

Journal Articles

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

The effects of acoustic features of speech for automatic speaker recognition

The effects of acoustic features of speech for automatic speaker recognition

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect