Speech rate normalization used to improve speaker verification

Van Heerden, CJ; Barnard, E

Speech rate normalization used to improve speaker verification

http://hdl.handle.net/10204/1045

Abstract:

A novel approach to speech rate normalization is presented. Models are constructed to model the way in which speech rate variation of a specific speaker influences the duration of phonemes. The models are evaluated in two ways. Firstly, the mean square error in phoneme duration based on our normalization is compared to the same error when such normalization is not applied. The second evaluation uses the durations of context-dependent phonemes in speaker verification. Both methods show that this approach to normalization is indeed effective to counteract the effect of variable speaking rates.

Reference:

Van Heerden, CJ and Barnard, E. 2006. Speech rate normalization used to improve speaker verification. 17th Annual Symposium of the Pattern Recognition Association of South Africa, Parys, South Africa, 29 Nov - 1 Dec 2006, pp 6

Van Heerden, C., & Barnard, E. (2006). Speech rate normalization used to improve speaker verification. http://hdl.handle.net/10204/1045

Van Heerden, CJ, and E Barnard. "Speech rate normalization used to improve speaker verification." (2006): http://hdl.handle.net/10204/1045

Van Heerden C, Barnard E, Speech rate normalization used to improve speaker verification; 2006. http://hdl.handle.net/10204/1045 .

Download RIS

This paper is published in the SAIEE Africa Research Journal, Vol 98(4), pp 129-135

Van Heerden, CJ
Barnard, E

Nov 2006

Speaker verification
Phoneme
Parametric models
Triphone duration models

Show full item record

Files in this item

vanHeerden_2006.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Speech rate normalization used to improve speaker verification

Speech rate normalization used to improve speaker verification

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect