When doing research into or building systems involving spoken language, one invariably relies on relevantly annotated speech data for analysis and incorporation into such systems. The authors investigate methods and parameters for a baseline phonetic segmentation system on a few South African languages with the intention of determining how accurately they can apply basic methods and characterising typical deficiencies with the goal of defining further refinement strategies. An HMM-based system with a single mixture per triphone is found to work well, though the accurate segmentation of plosives remains a challenge. Suggestions for addressing this challenge are presented
Reference:
Van Niekerk, DR and Barnard, E. 2007. Important factors in HMM-based phonetic segmentation. 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA), Pietermaritzburg, Kwazulu-Natal, South Africa, 28-30 November 2007, pp 6
Van Niekerk, D., & Barnard, E. (2007). Important factors in HMM-based phonetic segmentation. 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA). http://hdl.handle.net/10204/1978
Van Niekerk, DR, and E Barnard. "Important factors in HMM-based phonetic segmentation." (2007): http://hdl.handle.net/10204/1978
Van Niekerk D, Barnard E, Important factors in HMM-based phonetic segmentation; 18th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA); 2007. http://hdl.handle.net/10204/1978 .