Phoneme-to-phoneme (P2P) learning provides a mechanism for predicting the pronunciation of a word based on its pronunciation in a different accent, dialect or language. The authors evaluate the effectiveness of manually-developed as well as automatically derived P2P rules for British to South African English pronunciation conversion. Using the freely-available Oxford Advanced Learners Dictionary of Contemporary English (OALD) as source, the two approaches to P2P conversion are compared to a manually-developed South African English pronunciation dictionary. The authors show that, when the British English pronunciation is known, a small manually-derived rule set is able to approximate the South African pronunciation surprisingly well. Furthermore they demonstrate that the best performance is achieved by data-driven P2P learning, which proves to be a better mechanism for pronunciation prediction than both manually-derived P2P rules as well as data-driven grapheme-to-phoneme (G2P) conversion.
Reference:
Loots, L, Davel, M et al. 2009. Comparing manually-developed and data-driven rules for P2P learning. 20th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA). Stellenbosch, South Africa, 30 November - 01 December 2009, pp 35-40
Loots, L., Davel, M., Barnard, E., & Niesler, T. (2009). Comparing manually-developed and data-driven rules for P2P learning. PRASA 2009. http://hdl.handle.net/10204/3851
Loots, L, M Davel, E Barnard, and T Niesler. "Comparing manually-developed and data-driven rules for P2P learning." (2009): http://hdl.handle.net/10204/3851
Loots L, Davel M, Barnard E, Niesler T, Comparing manually-developed and data-driven rules for P2P learning; PRASA 2009; 2009. http://hdl.handle.net/10204/3851 .