BLSTM harvesting of auxiliary NCHLT speech data

Badenhorst, Jacob AC; Martinus, Laura; De Wet, Febe

BLSTM harvesting of auxiliary NCHLT speech data

http://hdl.handle.net/10204/10860

Abstract:

Since the release of the National Centre for Human Language Technology (NCHLT) Speech corpus, very few additional resources for automatic speech recognition (ASR) system development have been created for South Africa’s eleven official languages. The NCHLT corpus contained a curated but limited subset of the collected data. In this study the auxiliary data that was not included in the released corpus was processed with the aim to improve the acoustic modelling of the NCHLT data. Recent advances in ASR modelling that incorporate deep learning approaches require even more data than previous techniques. Sophisticated neural models seem to accommodate the variability between related acoustic units better and are capable of exploiting speech resources containing more training examples. Our results show that time delay neural networks (TDNN) combined with bi-directional long short-term memory (BLSTM) models are effective, significantly reducing error rates across all languages with just 56 hours of training data. In addition, a cross-corpus evaluation of an Afrikaans system trained on the original NCHLT data plus harvested auxiliary data shows further improvements on this baseline.

Reference:

Badenhorst, J.A.C., Martinus, L. and De Wet, F. 2019. BLSTM harvesting of auxiliary NCHLT speech data. SAUPEC/RobMech/PRASA 2019 Conference, Bloemfontein, South Africa, 28-30 January 2019

Badenhorst, J. A., Martinus, L., & De Wet, F. (2019). BLSTM harvesting of auxiliary NCHLT speech data. http://hdl.handle.net/10204/10860

Badenhorst, Jacob AC, Laura Martinus, and Febe De Wet. "BLSTM harvesting of auxiliary NCHLT speech data." (2019): http://hdl.handle.net/10204/10860

Badenhorst JA, Martinus L, De Wet F, BLSTM harvesting of auxiliary NCHLT speech data; 2019. http://hdl.handle.net/10204/10860 .

Download RIS

Paper presented at the SAUPEC/RobMech/PRASA 2019 Conference, Bloemfontein, South Africa, 28-30 January 2019

Badenhorst, Jacob AC
Martinus, Laura
De Wet, Febe

Jan 2019

Automatic speech recognition
Bidirectional Long Short Term Memory
BLSTM
Kaldi
Languages
NCHLT corpora
Speech data
Under resourced

Show full item record

Files in this item

Badenhorst_22219_2019.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

BLSTM harvesting of auxiliary NCHLT speech data

BLSTM harvesting of auxiliary NCHLT speech data

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect