Abstract
The perception of the distinction between /r/ and /l/ by native speakers of American English and of Japanese was studied using natural and synthetic speech. The American subjects were all nearly perfect at recognizing the natural speech sounds, whereas there was substantial variation among the Japanese subjects in their accuracy of recognizing /r/ and /l/ except in syllable-final position. A logit model, which additively combined the acoustic information conveyed byF1-transition duration and byF3-onset frequency, provided a good fit to the perception of synthetic /r/ and /l/ by the American subjects. There was substantial variation among the Japanese subjects in whether theF1 andF3 cues had a significant effect on their classifications of the synthetic speech. This variation was related to variation in accuracy of recognizing natural /r/ and /l/, such that greater use of both theF1 cue and theF3 cue in classifying the synthetic speech sounds was positively related to accuracy in recognizing the natural sounds. However, multiple regression showed that use of theF1 cue did not account for significant variance in natural speech performance beyond that accounted for by theF3 cue, indicating that theF3 cue is more important than theF1 cue for Japanese speakers learning English. The relation between performance on natural and synthetic speech also provides external validation of the logit model by showing that it predicts performance outside of the domain of data to which it was fit.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Best, C. T. (1994). The emergence of native-language phonological influences in infants: A perceptual assimilation model. In J. C. Goodman & H. C. Nusbaum, (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 167–224). Cambridge, MA: MIT Press.
Best, C. T., &Strange, W. (1992). Effects of phonological and phonetic factors on cross-language perception of approximants.Journal of Phonetics,20, 305–330.
Bradlow, A. R., Akahane-Yamada, R., Pisoni, D.B., &Tohkura, Y. (1999). Training Japanese listeners to identify English /r/ and /l/: Long-term retention of learning in perception and production.Perception & Psychophysics,61, 977–985.
Bradlow, A. R., Pisoni, D. B., Akahane-Yamada, R., &Tohkura, Y. (1997). Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production.Journal of the Acoustical Society of America,101, 2299–2310.
Crowther, C. S., Batchelder, W. H., &Hu, X. (1995). A measurementtheoretic analysis of the fuzzy logic model of perception.Psychological Review,102, 396–408.
Dalston, R. M. (1975). Acoustic characteristics of English /w, r, l/ spoken correctly by young children and adults.Journal of the Acoustical Society of America,57, 462–469.
Espy-Wilson, C. Y. (1992). Acoustic measures for linguistic features distinguishing the semi-vowels /w j r l/ in American English.Journal of the Acoustical Society of America,92, 736–757.
Gordon, P. C., Eberhardt, J. L., &Rueckl, J. G. (1993). Attentional modulation of the phonetic significance of acoustic cues.Cognitive Psychology,25, 1–42.
Green, D. M., &Swets, J. A. (1966).Signal detection theory and psychophysics. New York: Wiley.
Lively, S. E., Logan, J. S., &Pisoni, D. B. (1993). Training Japanese listeners to identify English /r/ and /l/: II. The role of phonetic environment and talker variability in learning new perceptual categories.Journal of the Acoustical Society of America,94, 1242–1255.
Logan, J. S., Lively, S. E., &Pisoni, D. B. (1991). Training Japanese listeners to identify English /r/ and /l/: A first report.Journal of the Acoustical Society of America,89, 874–886.
Luce, R. D. (1959).Individual choice behavior. New York: Wiley.
MacKain, K., Best, C., &Strange, W. (1981). Categorical perception of English /r/ and /l/ by Japanese bilinguals.Applied Psycholinguistics,2, 369–390.
Macmillan, N. A., &Creelman, C. D. (1991).Detection theory: A user’s guide. New York: Cambridge University Press.
Massaro, D. W. (1987).Speech perception by ear and eye: A paradigm for psychological inquiry. Hillsdale, NJ: Erlbaum.
Massaro, D. W. (1989). Testing between the TRACE model and the fuzzy logical model of perception.Cognitive Psychology,21, 398–421.
Massaro, D. W., &Oden, G. C. (1980). Speech perception: A framework for research and theory. In N. J. Lass (Ed.),Speech and language: Advances in basic research and practice (Vol. 3, pp. 129–165). New York: Academic Press.
Massaro, D. W., &Oden, G. C. (1995). Independence of lexical context and phonological information in speech perception.Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1053–1064.
McClelland, J. L. (1991). Stochastic interactive processes and the effect of context on perception.Cognitive Psychology,23, 1–44.
McClelland, J. L., &Rumelhart, D. E. (1981). An interactive activation model of context effect in letter perception. Part 1. An account of basic findings.Psychological Review,88, 375–407.
Miller, J. L. (1977). Nonindependence of feature processing in initial consonants.Journal of Speech & Hearing Research,20, 519–528.
Miyawaki, K., Strange, W., Verbrugge, R., Liberman, A.M., Jenkins, J. J., &Fujimura, O. (1975). An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English.Perception & Psychophysics,18, 331–340.
Nearey, T. M. (1990). The segment as a unit of speech perception.Journal of Phonetics,18, 347–373.
O’Connor, J. D., Gerstman, L. J., Liberman, A. M., Delattre, P. C., &Cooper, F. S. (1957). Acoustic cues for the perception of initial /w, j, r, l/ in English.Word,13, 24–43.
Oden, G. C. (1979). A fuzzy logical model of letter identification.Journal of Experimental Psychology: Human Perception & Performance,5, 336–352.
Oden, G. C., &Massaro, D. W. (1978). Integration of featural information in speech perception.Psychological Review,85, 172–191.
Olive, J. P., Greenwood, A., &Coleman, J. (1993).Acoustics of American English: A dynamic approach. New York: Springer-Verlag.
Pisoni, D. B., Lively, S. E., &Logan, J. S. (1994). Perceptual learning of nonnative speech contrasts: Implications for theories of speech perception. In J. C. Goodman & H. C. Nusbaum (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 121–166). Cambridge, MA: MIT Press.
Pitt, M. A. (1995a). Data fitting and detection theory: Reply to Massaro and Oden (1995).Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1065–1067.
Pitt, M. A. (1995b). The locus of the lexical shift in phoneme identification.Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1037–1052.
Polka, L., &Strange, W. (1985). Perceptual equivalence of acoustic cues that differentiate /r/ and /l/.Journal of the Acoustical Society of America,78, 1187–1197.
Repp, B. H. (1983). Trading relations among acoustic cues in speech perception are largely a result of phonetic categorization.Speech Communication,2, 341–362.
Sheldon, A., &Strange, W. (1982). The acquisition of /r/ and /l/ by Japanese learners of English: Evidence that speech production can precede perception.Applieds Psycholinguistics,3, 243–261.
Strange, W. (1995). Cross-language studies of speech perception: A historical review. In W. Strange (Ed.),Speech perception and linguistic experience (pp. 3–45). Baltimore, MD: York.
Strange, W., &Dittmann, S. (1984). Effects of discrimination training of the perception of /r-l/ by Japanese adults learning English.Perception & Psychophysics,36, 131–145.
Underbakke, M., Polka, L., Gottfried, T. L., &Strange, W. (1988). Trading relations in the perception of /r/-/l/ by Japanese learners of English.Journal of the Acoustical Society of America,84, 90–100.
Vance, T. J. (1987).An introduction to Japanese phonology. Albany: State University of New York Press.
Werker, J. F. (1994). Cross-language speech perception: Developmental change does not involve loss. In J. C. Goodman & H. C. Nusbaum (Eds.),The development of speech perception: The transition from speech sounds to spoken words (pp. 93–120). Cambridge, MA: MIT Press.
Wickens, T. D. (1989).Multiway contingency tables analysis for the social sciences. Hillsdale, NJ: Erlbaum.
Yamada, R. A. (1995). Age of acquisition of second language speech sounds: Perception of American English. In W. Strange (Ed.),Speech perception and linguistic experience (pp. 305–320). Baltimore, MD: York.
Yamada, R. A., &Tohkura, Y. (1991). Perception of American English /r/ and /l/ by native speakers of Japanese. In Y. Tohkura, E. Vatikiotis-Bateson, & Y. Sagisaka (Eds.),Speech perception, production and linguistic structure (pp. 155–174). Tokyo: Ohmsha.
Yamada, R. A., &Tohkura, Y. (1992). The effects of experimental variables on the perception of American English /r/ and /l/ by Japanese listeners.Perception & Psychophysics,52, 376–392.
Author information
Authors and Affiliations
Corresponding author
Additional information
The research reported here was supported by Grant IIS-9811129 from the National Science Foundation.
Rights and permissions
About this article
Cite this article
Gordon, P.C., Keyes, L. & Yung, YF. Ability in perceiving nonnative contrasts: Performance on natural and synthetic speech stimuli. Perception & Psychophysics 63, 746–758 (2001). https://doi.org/10.3758/BF03194435
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03194435