The durations of phonemes varies for different speakers. To this end, the correlations between phonemes across different speakers are studied and a novel approach to predict unknown phoneme durations from the values of known phoneme durations for a particular speaker are presented, based on the maximum likelihood criterion. Several interesting patterns are observed. Phonemes from the same broad phonetic class tend to covey most strongly (and therefore intra-class predictions of unknown phoneme durations are most accurate), but significant cross-class correlations are also present. Consequently, knowledge of only a few highly-correlated phonemes’ durations is necessary to make a good duration prediction
Reference:
Van Heerden, CJ and Barnard, E. 2007. Speaker-specific variability of phoneme durations. 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Pietermaritzburg, Kwazulu-Natal, South Africa, 28-30 November 2007, pp 6
Van Heerden, C., & Barnard, E. (2007). Speaker-specific variability of phoneme durations. 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA). http://hdl.handle.net/10204/1974
Van Heerden, CJ, and E Barnard. "Speaker-specific variability of phoneme durations." (2007): http://hdl.handle.net/10204/1974
Van Heerden C, Barnard E, Speaker-specific variability of phoneme durations; 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA); 2007. http://hdl.handle.net/10204/1974 .