Closure duration analysis of incomplete stop consonants due to stop-stop interaction
Prasanta Kumar Ghosh and Shrikanth Narayanan

J. Acoust. Soc. Am. Express Letters, Volume 126, Issue 1, July, 2009, pp. EL1-EL7

Abstract: An incomplete stop consonant is characterized either by an indistinguishable closure or a missing burst. If an incomplete stop happens due to a stop following another stop (stop-stop interaction [SSI]), its acoustics typically resemble that of a complete stop - one closure followed by a single burst. As a consequence, stop detectors would fail to distinguish an SSI from a complete stop. Analysis of the TIMIT corpus shows 35.04% incomplete stops (14.97% SSI). It is shown that using automatically estimated (and hand-labeled) closure duration, complete stops can be distinguished from incomplete stops due to SSI with 69.66% (79.14%) accuracy.


(pdf)


References:

[1]  J. P. Olive, A. Greenwood, and J. Coleman, Acoustics of American English Speech - A Dynamic Approach (Springer-Verlag, 227-312, 1993).
[2] C. Browman and L. Goldstein, Tiers in Articulatory Phonology, with some implications for casual speech (J. Kingston and M.E. Beckman, Editors, Papers in Laboratory Phonology, Cambridge University Press, Cambridge, 341-386, 1990).
[3] T. H. Crystal and A. S. House, “The duration of american-english stop consonants: An overview,” J. Phonetics 16, 285–294 (1988).
[4] T. Deelman and C. M. Connine, “Missing information in spoken word recognition: Nonreleased stop consonants,” J. Experimental Psychology: Human Perception and Performance,656–663 (2001).
[5] A. M. A. Ali, J. V. der Spiegel, and P. Mueller, “Acoustic-phonetic features for the automatic classification of stop consonants,” IEEE Trans. Speech and Audio Processing 9, 833–841 (2001).
[6] M. F. Dorman, M. Studdert-Kennedy, and L. J. Raphael, “Stop-consonant recognition: Release bursts and formant transitions as functionally equivalent, context-dependent cues,” Percept. Psychophys.,109–122 (1977).
[7] P. Niyogi and M. M. Sondhi, “Detecting stop consonants in continuous speech,” J. Acoust. Soc. Am. 111, 1063–1076 (2002).
[8] F. Malbos, M. Baudry, and S. Montresor, “Detection of stop consonants with the wavelet transform,” Proc. IEEE-SP Int. Symp. on Time-Frequency and Time-Scale Analysis,612–615 (1994).
[9] A. Jansen and P. Niyogi, “Modeling the temporal dynamics of distinctive feature landmark detector for speech recognition,” J. Acoust. Soc. Am. 124, 1739–1758 (2008).
[10]  J. S. Garofolo, “Timit acoustic-phonetic continuous speech corpus,” LDC, Philadelphia, (1993).
[11] Y. Zheng, Acoustic Modeling and Feature Selection for Speech Recognition (PhD Thesis, pp. 40, University of Illinois at Urbana-Champaign, 2005).
[12] Y. Homma, “Durational relationship between japanese stops and vowels,” Journal of Phonetics 9, 273–281 (1981).
[13] S. J. Manuel, S. S. Hufnagel, M. Huffman, K. N. Stevens, R. Carlson, and S. Hunnicutt,“Studies of vowel and consonant reduction,” Proc. ICSLP,943–946 (1992).










The University of Southern California does not screen or control the content on this website and thus does not guarantee the accuracy, integrity, or quality of such content. All content on this website is provided by and is the sole responsibility of the person from which such content originated, and such content does not necessarily reflect the opinions of the University administration or the Board of Trustees