| Closure duration
analysis of incomplete stop consonants due to stop-stop interaction Prasanta Kumar Ghosh and Shrikanth Narayanan J. Acoust. Soc. Am. Express Letters, Volume 126, Issue 1, July, 2009, pp. EL1-EL7 |
Abstract: An incomplete stop consonant is characterized either by an indistinguishable closure or a missing burst. If an incomplete stop happens due to a stop following another stop (stop-stop interaction [SSI]), its acoustics typically resemble that of a complete stop - one closure followed by a single burst. As a consequence, stop detectors would fail to distinguish an SSI from a complete stop. Analysis of the TIMIT corpus shows 35.04% incomplete stops (14.97% SSI). It is shown that using automatically estimated (and hand-labeled) closure duration, complete stops can be distinguished from incomplete stops due to SSI with 69.66% (79.14%) accuracy. |
(pdf) |
References: [1] J. P. Olive, A. Greenwood, and J. Coleman, Acoustics of American English Speech - A Dynamic Approach (Springer-Verlag, 227-312, 1993). [2] C. Browman and L. Goldstein, Tiers in Articulatory Phonology, with some implications for casual speech (J. Kingston and M.E. Beckman, Editors, Papers in Laboratory Phonology, Cambridge University Press, Cambridge, 341-386, 1990). [3] T. H. Crystal and A. S. House, “The duration of american-english stop consonants: An overview,” J. Phonetics 16, 285–294 (1988). [4] T. Deelman and C. M. Connine, “Missing information in spoken word recognition: Nonreleased stop consonants,” J. Experimental Psychology: Human Perception and Performance,656–663 (2001). [5] A. M. A. Ali, J. V. der Spiegel, and P. Mueller, “Acoustic-phonetic features for the automatic classification of stop consonants,” IEEE Trans. Speech and Audio Processing 9, 833–841 (2001). [6] M. F. Dorman, M. Studdert-Kennedy, and L. J. Raphael, “Stop-consonant recognition: Release bursts and formant transitions as functionally equivalent, context-dependent cues,” Percept. Psychophys.,109–122 (1977). [7] P. Niyogi and M. M. Sondhi, “Detecting stop consonants in continuous speech,” J. Acoust. Soc. Am. 111, 1063–1076 (2002). [8] F. Malbos, M. Baudry, and S. Montresor, “Detection of stop consonants with the wavelet transform,” Proc. IEEE-SP Int. Symp. on Time-Frequency and Time-Scale Analysis,612–615 (1994). [9] A. Jansen and P. Niyogi, “Modeling the temporal dynamics of distinctive feature landmark detector for speech recognition,” J. Acoust. Soc. Am. 124, 1739–1758 (2008). [10] J. S. Garofolo, “Timit acoustic-phonetic continuous speech corpus,” LDC, Philadelphia, (1993). [11] Y. Zheng, Acoustic Modeling and Feature Selection for Speech Recognition (PhD Thesis, pp. 40, University of Illinois at Urbana-Champaign, 2005). [12] Y. Homma, “Durational relationship between japanese stops and vowels,” Journal of Phonetics 9, 273–281 (1981). [13] S. J. Manuel, S. S. Hufnagel, M. Huffman, K. N. Stevens, R. Carlson, and S. Hunnicutt,“Studies of vowel and consonant reduction,” Proc. ICSLP,943–946 (1992). |