Abstract
Kewley-Port (1983) recently demonstrated that place of articulation of initial voiced stops could be identified from time-varying features observed in visual displays of linear prediction smoothed spectra. The present study extends this method of analysis in several directions. First, both voiced and voiceless syllable-initial stops produced at three speaking rates—normal, fast, and slow—were examined. Second, a new rule for vocal tract size normalization was tested. Third, the earlier time-varying features were augmented in order to specify the burst and voicing as well as place of articulation. The four time-varying features were (1) an abrupt increase in energy at high frequencies, (2) the onset of a prominent low-frequency peak, (3) the relative tilt of voiceless energy at onset, and (4) the presence of extended midfrequency peaks. Finally, the visual displays were modified to incorporate filtering and other characteristics of processing of speech by the auditory system. Auditory running spectra were generated for stop consonant-vowel syllables read by two males and two females. Employing the four time-varying features, judges first located the burst and onset of voicing, and then identified place of articulation from the visual displays. Over all conditions, place of articulation was identified at an 86% level of accuracy. While these results constitute only a first step towards an automated analysis procedure, they nonetheless indicate that our new time-varying features are appropriate for identifying place of articulation across both voiced and voiceless stops produced by different speakers at different speaking rates.
Article PDF
Similar content being viewed by others
References
Blumstein, S. E., &Stevens, K. N. (1979). Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants.Journal of the Acoustical Society of America,66, 1001–1017.
Carlson, R., &Granstrom, B. (1982).The representation of speech in the peripheral auditory system. New York: Elsevier Biomedical Press.
Delgutte, B. (1980). Representations of speech-like sounds in the discharge patterns of auditory-nerve fibers.Journal of the Acoustical Society of America,68, 843–857.
Fant, G. (1960).Acoustic theory of speech production. The Hague: Mouton.
Fant, G. (1973). Stops in CV-syllables. In G. Fant (Ed.),Speech sounds and features (pp. 110–139). Cambridge, MA: M.I.T. Press.
Flanagan, J. L., &Christensen, S. W. (1980). Computer studies on parametric coding of speech spectra.Journal of the Acouaticul Society of America,68, 420–430.
Jakobson, R., Fant, G., &Halle, M. (1952).Preliminaries to speech analysis: The distinctive features and their correlates. Cambridge, MA: M.I.T. Press.
Kewley-Port, D. (1979).Spectrum: A program for analyzing the spectral properties of speech (Research on Speech Perception: Progress Report No. 5, pp. 475–492). Bloomington: Indiana University, Department of Psychology.
Kewley-Port, D. (1980).Representations of spectral change as cues to place of articulation in stop consonants (Research on Speech Perception: Tech. Rep. No. 3). Bloomington: Indiana University, Department of Psychology.
Kewley-Port, D. (1983). Time-varying features as correlates of place of articulation in stop consonants.Journal of the Acoustical Society of America,73, 322–335.
Kewley-Port, D., Pisoni, D. B., &Studdert-Kennedy, M. (1983). Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants.Journal of the Acoustical Society of America,73, 1779–1793.
Klatt, D. H. (1976). A digital filter bank for spectral matching. In C. Teacher (Ed.),Conference Record of the 1976 IEEE International Conference on Acoustics, Speech, and Signal Processing (IEEE Catalog No. 76CH1067-8 ASSP, pp. 537–540). Philadelphia: IEEE.
Klatt, D. H. (1979). Speech perception: A model of acoustic-phonetic analysis and lexical access.Journal of Phonetics,7, 279–312.
Lahiri, A., &Blumstein, S. E. (1981). A reconsideration of acoustic invariance for place of articulation in stop consonants: Evidence from cross-language studies.Journal of the Acoustical Society of America,70, S39.
Mack, M., &Blumstein, S. E. (1983). Further evidence of acoustic invariance in speech production: The stop glide contrast.Journal of the Acoustical Society of America,73, 1739–1750.
Markel, S. D., &Gray, A. H. (1976).Linear prediction of speech. New York: Springer-Verlag.
Ohde, R. N., &Stevens, K. N. (1983). Effect of burst amplitude on the perception of stop consonant place of articulation.Journal of the Acoustical Society of America,74, 706–714.
Patterson, R. D. (1976). Auditory filter shapes derived with noise stimuli.Journal of the Acoustical Society of America,59, 640–654.
Patterson, R. D., &Nimmo-Smith, I. (1980). Off frequency listening and auditory-filter symmetry.Journal of the Acoustical Society of America,67, 229–245.
Sawusch, J. R., &Pisoni, D. B. (1974). On the identification of place and voicing features in synthetic stop consonants.Journal of Phonetics,2, 181–194.
Scharf, B. (1970). Critical hands. In J. V. Tobias (Ed.),Foundations of modern auditory theory (pp. 157–202). New York: Academic Press.
Stevens, K. N. (1975). The potential role of property detectors in the perception of consonants. In G. Fant & M. A. A. Tatham (Eds.),Auditory analysis and perception of speech (pp. 303–330). New York: Academic Press.
Stevens, K. N. (1980). Acoustic correlates of some phonetic categories.Journal of the Acoustical Society of America,68, 836–842.
Stevens, K. N., &Blumstein, S. E. (1978). Invariant cues for place of articulation in stop consonants.Journal of the Acoustical Society of America,64, 1358–1368.
Stevens, K. N., &Blumstein, S. E. (1981). The search for invariant acoustic correlates of phonetic features. In P. D. Eimas & J. Miller (Eds.),Perspectives on the study of speech (pp. 1–38). Hillsdale, N J: Erlbaum.
Summerfield, A. Q. (1975).Aerodynamics versus mechanics in the control of voicing onset in consonant-vowel syllables (Speech Perception No. 4). Belfast: Queen’s University of Belfast, Department of Psychology.
Tekieli, M. E., &Cullinan, W. L. (1979). The perception of temporally segmented vowels and consonant-vowel syllables.Journal of Speech and Hearing Research,22, 103–121.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Kewley-Port, D., Luce, P.A. Time-varying features of initial stop consonants in auditory running spectra: A first report. Perception & Psychophysics 35, 353–360 (1984). https://doi.org/10.3758/BF03206339
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03206339