Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli

Gordon, Peter C.; Keyes, Lisa; Yung, Yiu-Fai

doi:10.3758/BF03194435

Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli

Published: May 2001

Volume 63, pages 746–758, (2001)
Cite this article

Download PDF

Perception & Psychophysics Aims and scope Submit manuscript

Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli

Download PDF

Peter C. Gordon¹,
Lisa Keyes¹ &
Yiu-Fai Yung¹^nAff2

387 Accesses
12 Citations
Explore all metrics

Abstract

The perception of the distinction between /r/ and /l/ by native speakers of American English and of Japanese was studied using natural and synthetic speech. The American subjects were all nearly perfect at recognizing the natural speech sounds, whereas there was substantial variation among the Japanese subjects in their accuracy of recognizing /r/ and /l/ except in syllable-final position. A logit model, which additively combined the acoustic information conveyed byF1-transition duration and byF3-onset frequency, provided a good fit to the perception of synthetic /r/ and /l/ by the American subjects. There was substantial variation among the Japanese subjects in whether theF1 andF3 cues had a significant effect on their classifications of the synthetic speech. This variation was related to variation in accuracy of recognizing natural /r/ and /l/, such that greater use of both theF1 cue and theF3 cue in classifying the synthetic speech sounds was positively related to accuracy in recognizing the natural sounds. However, multiple regression showed that use of theF1 cue did not account for significant variance in natural speech performance beyond that accounted for by theF3 cue, indicating that theF3 cue is more important than theF1 cue for Japanese speakers learning English. The relation between performance on natural and synthetic speech also provides external validation of the logit model by showing that it predicts performance outside of the domain of data to which it was fit.

Article PDF

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Article 08 February 2021

Degraded and computer-generated speech processing in a bonobo

Article Open access 20 May 2022

Non-native speech recognition sentences: A new materials set for non-native speech perception research

Article Open access 22 April 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Best, C. T. (1994). The emergence of native-language phonological influences in infants: A perceptual assimilation model. In J. C. Goodman & H. C. Nusbaum, (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 167–224). Cambridge, MA: MIT Press.
Google Scholar
Best, C. T., &Strange, W. (1992). Effects of phonological and phonetic factors on cross-language perception of approximants.Journal of Phonetics,20, 305–330.
Google Scholar
Bradlow, A. R., Akahane-Yamada, R., Pisoni, D.B., &Tohkura, Y. (1999). Training Japanese listeners to identify English /r/ and /l/: Long-term retention of learning in perception and production.Perception & Psychophysics,61, 977–985.
Google Scholar
Bradlow, A. R., Pisoni, D. B., Akahane-Yamada, R., &Tohkura, Y. (1997). Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production.Journal of the Acoustical Society of America,101, 2299–2310.
Article PubMed Google Scholar
Crowther, C. S., Batchelder, W. H., &Hu, X. (1995). A measurementtheoretic analysis of the fuzzy logic model of perception.Psychological Review,102, 396–408.
Article PubMed Google Scholar
Dalston, R. M. (1975). Acoustic characteristics of English /w, r, l/ spoken correctly by young children and adults.Journal of the Acoustical Society of America,57, 462–469.
Article PubMed Google Scholar
Espy-Wilson, C. Y. (1992). Acoustic measures for linguistic features distinguishing the semi-vowels /w j r l/ in American English.Journal of the Acoustical Society of America,92, 736–757.
Article Google Scholar
Gordon, P. C., Eberhardt, J. L., &Rueckl, J. G. (1993). Attentional modulation of the phonetic significance of acoustic cues.Cognitive Psychology,25, 1–42.
Article PubMed Google Scholar
Green, D. M., &Swets, J. A. (1966).Signal detection theory and psychophysics. New York: Wiley.
Google Scholar
Lively, S. E., Logan, J. S., &Pisoni, D. B. (1993). Training Japanese listeners to identify English /r/ and /l/: II. The role of phonetic environment and talker variability in learning new perceptual categories.Journal of the Acoustical Society of America,94, 1242–1255.
Article PubMed Google Scholar
Logan, J. S., Lively, S. E., &Pisoni, D. B. (1991). Training Japanese listeners to identify English /r/ and /l/: A first report.Journal of the Acoustical Society of America,89, 874–886.
Article PubMed Google Scholar
Luce, R. D. (1959).Individual choice behavior. New York: Wiley.
Google Scholar
MacKain, K., Best, C., &Strange, W. (1981). Categorical perception of English /r/ and /l/ by Japanese bilinguals.Applied Psycholinguistics,2, 369–390.
Article Google Scholar
Macmillan, N. A., &Creelman, C. D. (1991).Detection theory: A user’s guide. New York: Cambridge University Press.
Google Scholar
Massaro, D. W. (1987).Speech perception by ear and eye: A paradigm for psychological inquiry. Hillsdale, NJ: Erlbaum.
Google Scholar
Massaro, D. W. (1989). Testing between the TRACE model and the fuzzy logical model of perception.Cognitive Psychology,21, 398–421.
Article PubMed Google Scholar
Massaro, D. W., &Oden, G. C. (1980). Speech perception: A framework for research and theory. In N. J. Lass (Ed.),Speech and language: Advances in basic research and practice (Vol. 3, pp. 129–165). New York: Academic Press.
Google Scholar
Massaro, D. W., &Oden, G. C. (1995). Independence of lexical context and phonological information in speech perception.Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1053–1064.
Article Google Scholar
McClelland, J. L. (1991). Stochastic interactive processes and the effect of context on perception.Cognitive Psychology,23, 1–44.
Article PubMed Google Scholar
McClelland, J. L., &Rumelhart, D. E. (1981). An interactive activation model of context effect in letter perception. Part 1. An account of basic findings.Psychological Review,88, 375–407.
Article Google Scholar
Miller, J. L. (1977). Nonindependence of feature processing in initial consonants.Journal of Speech & Hearing Research,20, 519–528.
Google Scholar
Miyawaki, K., Strange, W., Verbrugge, R., Liberman, A.M., Jenkins, J. J., &Fujimura, O. (1975). An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English.Perception & Psychophysics,18, 331–340.
Google Scholar
Nearey, T. M. (1990). The segment as a unit of speech perception.Journal of Phonetics,18, 347–373.
Google Scholar
O’Connor, J. D., Gerstman, L. J., Liberman, A. M., Delattre, P. C., &Cooper, F. S. (1957). Acoustic cues for the perception of initial /w, j, r, l/ in English.Word,13, 24–43.
Google Scholar
Oden, G. C. (1979). A fuzzy logical model of letter identification.Journal of Experimental Psychology: Human Perception & Performance,5, 336–352.
Article Google Scholar
Oden, G. C., &Massaro, D. W. (1978). Integration of featural information in speech perception.Psychological Review,85, 172–191.
Article PubMed Google Scholar
Olive, J. P., Greenwood, A., &Coleman, J. (1993).Acoustics of American English: A dynamic approach. New York: Springer-Verlag.
Google Scholar
Pisoni, D. B., Lively, S. E., &Logan, J. S. (1994). Perceptual learning of nonnative speech contrasts: Implications for theories of speech perception. In J. C. Goodman & H. C. Nusbaum (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 121–166). Cambridge, MA: MIT Press.
Google Scholar
Pitt, M. A. (1995a). Data fitting and detection theory: Reply to Massaro and Oden (1995).Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1065–1067.
Article Google Scholar
Pitt, M. A. (1995b). The locus of the lexical shift in phoneme identification.Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1037–1052.
Article Google Scholar
Polka, L., &Strange, W. (1985). Perceptual equivalence of acoustic cues that differentiate /r/ and /l/.Journal of the Acoustical Society of America,78, 1187–1197.
Article PubMed Google Scholar
Repp, B. H. (1983). Trading relations among acoustic cues in speech perception are largely a result of phonetic categorization.Speech Communication,2, 341–362.
Article Google Scholar
Sheldon, A., &Strange, W. (1982). The acquisition of /r/ and /l/ by Japanese learners of English: Evidence that speech production can precede perception.Applieds Psycholinguistics,3, 243–261.
Article Google Scholar
Strange, W. (1995). Cross-language studies of speech perception: A historical review. In W. Strange (Ed.),Speech perception and linguistic experience (pp. 3–45). Baltimore, MD: York.
Google Scholar
Strange, W., &Dittmann, S. (1984). Effects of discrimination training of the perception of /r-l/ by Japanese adults learning English.Perception & Psychophysics,36, 131–145.
Google Scholar
Underbakke, M., Polka, L., Gottfried, T. L., &Strange, W. (1988). Trading relations in the perception of /r/-/l/ by Japanese learners of English.Journal of the Acoustical Society of America,84, 90–100.
Article PubMed Google Scholar
Vance, T. J. (1987).An introduction to Japanese phonology. Albany: State University of New York Press.
Google Scholar
Werker, J. F. (1994). Cross-language speech perception: Developmental change does not involve loss. In J. C. Goodman & H. C. Nusbaum (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 93–120). Cambridge, MA: MIT Press.
Google Scholar
Wickens, T. D. (1989).Multiway contingency tables analysis for the social sciences. Hillsdale, NJ: Erlbaum.
Google Scholar
Yamada, R. A. (1995). Age of acquisition of second language speech sounds: Perception of American English. In W. Strange (Ed.),Speech perception and linguistic experience (pp. 305–320). Baltimore, MD: York.
Google Scholar
Yamada, R. A., &Tohkura, Y. (1991). Perception of American English /r/ and /l/ by native speakers of Japanese. In Y. Tohkura, E. Vatikiotis-Bateson, & Y. Sagisaka (Eds.),Speech perception, production and linguistic structure (pp. 155–174). Tokyo: Ohmsha.
Google Scholar
Yamada, R. A., &Tohkura, Y. (1992). The effects of experimental variables on the perception of American English /r/ and /l/ by Japanese listeners.Perception & Psychophysics,52, 376–392.
Google Scholar

Download references

Author information

Yiu-Fai Yung
Present address: SAS Institute, 27513, Cary, NC

Authors and Affiliations

Department of Psychology, University of North Carolina, Chapel Hill, 27599-3270, NC
Peter C. Gordon, Lisa Keyes & Yiu-Fai Yung

Authors

Peter C. Gordon
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Keyes
View author publications
You can also search for this author in PubMed Google Scholar
Yiu-Fai Yung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter C. Gordon.

Additional information

The research reported here was supported by Grant IIS-9811129 from the National Science Foundation.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gordon, P.C., Keyes, L. & Yung, YF. Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli. Perception & Psychophysics 63, 746–758 (2001). https://doi.org/10.3758/BF03194435

Download citation

Received: 05 June 1998
Accepted: 23 August 2000
Issue Date: May 2001
DOI: https://doi.org/10.3758/BF03194435

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli

Abstract

Article PDF

Similar content being viewed by others

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Degraded and computer-generated speech processing in a bonobo

Non-native speech recognition sentences: A new materials set for non-native speech perception research

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli

Abstract

Article PDF

Similar content being viewed by others

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Degraded and computer-generated speech processing in a bonobo

Non-native speech recognition sentences: A new materials set for non-native speech perception research

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation