Elsevier

Journal of Phonetics

Volume 40, Issue 4, July 2012, Pages 582-594
Journal of Phonetics

Two ways to listen: Do L2-dominant bilinguals perceive stop voicing according to language mode?

https://doi.org/10.1016/j.wocn.2012.05.005Get rights and content

Abstract

How listeners categorize two phones predicts the success with which they will discriminate the given phonetic distinction. In the case of bilinguals, such perceptual patterns could reveal whether the listener's two phonological systems are integrated or separate. This is of particular interest when a given contrast is realized differently in each language, as is the case with Greek and English stop-voicing distinctions. We had Greek–English early sequential bilinguals and Greek and English monolinguals (baselines) categorize, rate, and discriminate stop-voicing contrasts in each language. All communication with each group of bilinguals occurred solely in one language mode, Greek or English. The monolingual groups showed the expected native-language constraints, each perceiving their native contrast more accurately than the opposing nonnative contrast. Bilinguals' category-goodness ratings for the same physical stimuli differed, consistent with their language mode, yet their discrimination performance was unaffected by language mode and biased toward their dominant language (English). We conclude that bilinguals integrate both languages in a common phonetic space that is swayed by their long-term dominant language environment for discrimination, but that they selectively attend to language-specific phonetic information for phonologically motivated judgments (category-goodness ratings).

Highlights

► Greek–English bilinguals and monolinguals categorized and discriminated stop-voicing contrasts. ► Monolinguals perceived native contrasts more accurately than nonnative. ► Bilinguals' category-goodness ratings differed, consistent with the language mode. ► But their discrimination performance was unaffected and biased toward their dominant language (English). ► Bilinguals integrate both languages, but are swayed by their dominant language for discrimination.

Introduction

When a perceiver is bilingual, performance on cross-language speech perception tasks may indicate whether the first language (L1) and second language (L2) are integrated or kept separate. Performance equal to that of native, monolingual listeners in each language has been interpreted as evidence that bilinguals have developed separate L1 and L2 phonetic categories, whereas intermediate discrimination ability has been taken to indicate that the bilinguals rely on a set of merged categories for both languages (Flege, 2002).

Bilinguals may differ in their perception from monolinguals because of their L1–L2 sensitivity and the interaction that occurs between their languages. Even fluent, early bilinguals who use their L2 daily may show some L1 influence on their perception of the L2. For example, Spanish–Catalan1 bilinguals retain perceptual dominance from their L1 (Spanish) when categorizing and discriminating Catalan contrasts that Spanish lacks, despite years of daily experience with both languages in a bilingual environment (Pallier et al., 2001, Sebastián-Gallés and Soto-Faraco, 1999). Conversely, however, there is also much evidence that learning an L2 may change the way the listener perceives the L1 (Flege, 1993, Flege et al., 2003, Flege et al., 1999).

Several theoretical models have been proposed to account for L2 users' perception of speech segments. The Speech Learning Model (Flege, 1995, Flege, 2002) was designed to account for L2 speech production, but assumes that it is strongly influenced by perceptual biases and therefore seeks to explain the perceptual reorganization that occurs due to L2 acquisition. The Native Language Magnet model (Kuhl, 1992, 1993) seeks to explain how early attunement to the native language (L1) warps the perceptual space, such that learning after early childhood is influenced by that L1. The Perceptual Assimilation Model (PAM; Best, 1994, Best, 1995) accounts for naive listeners' discrimination of unfamiliar nonnative contrasts, and its extension to L2 perceptual learning, PAM-L2 (Best & Tyler, 2007) predicts the success with which learners should discriminate L2 contrasts. According to PAM-L2, L2 phones are first assimilated into already existing L1 categories or dissimilated from existing categories and established as new categories firstly on a phonetic level. As the learner's L2 vocabulary expands they attune to the higher-order organization of the L2 phonology, and phones come to be discriminated on the basis of meaningful categorial differences that are lexically relevant in the L2.

However, none of the aforementioned models were designed to specifically address the performance of early fluent bilinguals. Here, our purpose is not to contrast the various models of nonnative speech perception, but rather to focus on testing PAM-L2 predictions, which appear to be best suited to the bilingual case as only that model addresses and distinguishes between two levels in cross-language perception: the phonetic level of sensitivity to gradient, physical details and the abstract phonological level of perceiving meaningful distinctions.

The original PAM framework addresses nonnative speech perception by naive adults. But whereas monolingual listeners assimilate unfamiliar nonnative phones to L1 categories, bilingual listeners face a different challenge, as their perceptual system must accommodate to two known languages (Grosjean, 2008, Romaine, 1989). When perceiving speech in a foreign language, the naive monolingual listener often (but not always) perceptually assimilates nonnative phones to existing native language categories. Based on the possible assimilation patterns to the L1, PAM predicts the success with which naive listeners will discriminate nonnative contrasts. According to PAM predictions, two nonnative phones perceived as exemplars of two different native categories will result in two-category (TC) assimilation, and discrimination will be excellent. When one nonnative phone is perceived as speech, but not as reliably belonging to any single native category, and another is assimilated to a native category, that nonnative contrast will form an uncategorized–categorized pair (UC). Two nonnative phones perceived as equally good or poor exemplars of a native category will result in single category (SC) assimilation, and discrimination will be poor. Category goodness (CG) assimilation will result when two nonnative phones are assimilated to the same native category but one is perceived as a better exemplar of that category than the other. The size of this category-goodness difference will result in discrimination ranging from good to very good (for descriptions of all possible assimilation types see Best, 1995).

PAM-L2 extends the original PAM to explain effects of L2 perceptual learning and predict the success with which learners will discern L2 contrasts depending on how they are assimilated to the L1 on the basis of both phonetic and phonological similarities, and predicts on this basis which new L2 phonological categories will be established. Phonetic categories describe clusters of individual phones, which differ from each other in a gradient fashion. Phonological categories are comprised of one or more phonetic category and specify the linguistic function of phonetic segments within the lexicon of a language, that is, which segments are distinctive and which are not. Bilinguals may thus perceive phonetic differences between L1 and L2 segments (e.g., Greek versus English realizations of /p/) while simultaneously recognizing properties that indicate a shared (or ‘common’) phonological contrast (e.g., /b/ vs. /p/). That is, a common L1/L2 phonological category can include two distinct regions of phonetic space for each language, and those language-specific phonetic categories are free to evolve without necessarily influencing one another.

From a PAM-L2 perspective, language users never stop learning, and yet, there is a qualitative difference between individuals who are actively acquiring an L2 versus stable bilinguals (early state vs. end state). PAM-L2 focuses on learners who are actively acquiring their L2, and identifies factors that are important in the formation of L2 speech categories (Best & Tyler, 2007). For instance, the contribution of age of L2 learning is modulated by the relative quantity and quality of input from native L2 speakers (Flege and Liu, 2001, Jia and Aaronson, 2003, Jia et al., 2006), length of residence in an L2-speaking environment, and relative L1:L2 usage (Flege, 1999, Flege, 2002, Flege and MacKay, 2004). Based on these assumptions, it follows that early bilinguals who have used their L2 so extensively that they have become L2-dominant should be the most likely to have developed L2 categories because they have strong L2 biases on all of these factors. Specifically, the type of L2-dominant bilingual we focus on here acquired both their L1 and L2 from native speakers, they were born and have lived the entirety of their lives in the country in which their L2 is spoken, and their L1:L2 usage ratio is heavily skewed toward the L2 as it is used in a much wider variety of social situations. Thus, this type of bilingualism could be highly informative to our understanding of L2 speech category formation. However, little to no research has focused on this type of bilingual. Most bilingual and L2-learner speech perception research has instead focused on other questions suited to the populations that have much more often been studied: L1-dominant bilinguals, and bilinguals living in more balanced bilingual environments where both languages are spoken widely and on a daily basis.

While researchers have interpreted bilinguals' performance in speech perception tasks as evidence for their stable phonological organization for the two languages, bilingual performance on such tasks may instead depend on what language-specific information a bilingual attends to. This may be especially important to consider in light of possible differences between L2-dominant bilinguals and L1-dominant or bilingual-environment bilinguals. The Language Mode framework (Grosjean, 1998, Grosjean, 2001, Grosjean, 2008) posits that, depending on the communicative situation, just one of a bilingual's languages may be activated (called monolingual mode, although the other language is never completely deactivated), or both languages may be activated (bilingual mode), thereby directing attention to language-appropriate information. As we will be comparing (L2-dominant) bilinguals with monolinguals of each language, we instead refer to monolingual mode as unilingual mode to avoid confusion. These language modes are thought to influence all aspects of language processing (Grosjean, 1998); however, research has barely touched on the perceptual effects of language mode in bilinguals. Although clear language mode effects have been observed for speech production (Antoniou et al., 2010, Antoniou et al., 2011, Hazan and Boulakia, 1993, Magloire and Green, 1999), language mode effects on speech perception are less clear.

Despite the lack of a theory of segmental speech perception that explicitly considers the contribution of the situational language context in bilinguals, the effect of language mode on perception has been investigated in a number of studies. Language mode has not always resulted in shifts in perception. In one study, Canadian-French–English bilinguals identified synthetic /p, t, k/ in separate French and English unilingual language modes (Caramazza, Yeni-Komshian, Zurif, & Carbone 1973). The bilinguals were unaffected by language mode and showed intermediate identification patterns relative to the monolingual French and English listeners, in both English and French language modes. Similarly, language mode failed to influence the identification of Spanish–English bilinguals' discrimination of /b/–/p/ (Williams, 1977). Their category boundaries were at an intermediate point relative to those of monolingual listeners of Spanish and English. These combined findings suggested that bilinguals differ from monolinguals in their perceptual sensitivity.

Yet, some other studies have demonstrated language mode effects on bilinguals' perceptual sensitivity, as evident in phoneme category shifts for both language modes. For example, Dutch–English bilinguals shifted their phonemic boundaries (by ∼5 ms) when identifying tokens from a 16-step voice onset time (VOT) /da/–/ta/ continuum in Dutch versus in English (Flege & Eefting, 1987). Similarly, English–Spanish early bilinguals identified a series of naturally produced syllables which varied in VOT from /ba/ to /pa/ (Elman, Diehl, & Buchwald, 1977). The bilinguals perceived a reliably greater number of the test items as /ba/ in the English than in the Spanish unilingual mode. Importantly for the present study, the magnitude of this perceptual switching effect was more pronounced for bilinguals who were more proficient in the L2. Consistent with these observations, Hazan and Boulakia (1993) reported that ‘stronger’ French–English bilinguals showed phoneme boundary shifts due to language mode (∼5 ms). Therefore, despite their L2 proficiency later in life, bilinguals showed identification patterns that reflected their dominant language, as defined by those researchers. However, we must note that in that study ‘language dominance’ was equated with ‘strength of bilingualism’, and was determined from each bilingual's language of exposure during their early years. Thus, it did not necessarily represent their dominant language at the time of testing.

In a more recent study, Spanish–English early bilinguals (who started learning their L2 between 3 and 12 years of age) categorized stimuli from a /ɡa/–/ka/ continuum in separate language modes (Garcia-Sierra, Diehl, & Champlin, 2009). The bilinguals' category boundaries shifted toward the voicing lead (negative VOT) end of the continuum in Spanish mode and toward lag VOT values of English in English mode. Despite the small shift in boundary with the two language modes, the bilinguals nonetheless showed an English-like phonemic boundary overall, and this was observed even though 12 of the 15 bilinguals had acquired Spanish as their L1. All had reported that they were more proficient at speaking, reading, writing and understanding in English (6.5 out of 7) than in Spanish (5.6). All were students at the University of Texas and reported that they used English (60%) more than Spanish (40%) on a daily basis, with their teachers, classmates and friends. Therefore, it is likely that the bilinguals were L2 dominant,2 and this was reflected in their perceptual performance.

This review of the bilingual perception literature demonstrates that language mode influences bilinguals' perception, and language dominance appears to be an important factor that mediates whether language mode effects will emerge. However, none of these perceptual studies explicitly controlled for the environment in which the bilinguals acquired their two languages, or their language dominance at the time of testing. Therefore, in order to clearly understand the influence of language dominance and language mode on bilingual perception, it is necessary to use strict selection criteria and test the perception abilities of bilinguals who are clearly L2 dominant, and to do so in L2 versus L1 language modes. If both language mode and language dominance affect bilinguals' speech perception, theories of L2 segmental perception (e.g., PAM-L2) will need to be extended to account for these factors.

Section snippets

The present study

To test the effects of language mode and language dominance on native speech perception, we recruited L2-dominant, early bilinguals who were born in Australia, acquired their L1 (Greek) from their native-speaking parents and family members in Australia, and first began speaking their L2 (English) when they attended school. These participants became L2-dominant and now have larger L2 vocabularies and use their L2 in a wider variety of social contexts, but maintain their L1 in certain contexts on

Participants

Twenty Australian-English monolinguals (Mage=23.4 years), 20 Greek monolinguals (Mage=26.1 years), and 40 Greek–English bilinguals participated. The bilinguals were randomly assigned to two groups that were placed in different unilingual language modes (English mode Mage=25.9 years; Greek mode Mage=25.0 years; 10 males, 10 females in each group). The English monolinguals were undergraduate students at the University of Western Sydney and participated in exchange for course credit. The Greek

Categorization of word-initial stops (CV)

Mean percentage categorization of initial-position bilabial and coronal stops are presented in Table 1, Table 2, respectively. Italics indicate the speakers' intended category. In past work, assimilation patterns have been computed for individual listeners, but this was not feasible here due to the modest number of categorization trials. Due to the relatively small number of response options, we adopted a categorization criterion of 70%, rather than 50% or 90%, both of which have been applied

Experiment 2: Word-medial stop-voicing contrasts in stressed syllables

We presented English and Greek voiced and voiceless stops in intervocalic (VˈCV) context to test whether Greek listeners are sensitive to voicing differences between stops in medial position. We presented word-medial stops as the onsets of stressed syllables in iambic disyllables (second syllable stress) because they have similar VOTs to initial-position stops, allowing straightforward comparison with the results of Experiment 1.8

Method

The participants were the same groups of monolingual and bilingual listeners tested in Experiment 1. The experimental testing procedure was identical to that employed in Experiment 1. The medial stops were presented randomly among the word-initial stops reported in Experiment 1.

We controlled the variability regarding the prenasalization of Greek medial stops by testing the listeners' perception of stops in medial intervocalic context in iambic (second syllable stress) disyllables (VˈCV). The

Categorization and rating of intervocalic stops in stressed position (VˈCV)

As in Experiment 1, monolinguals consistently categorized stop consonants of their native language. Categorization and goodness rating data for intervocalic bilabial and coronal stops are presented in Table 4, Table 5, respectively.

When presented with nonnative stops, unlike in Experiment 1, and consistent with our predictions, English monolinguals met the categorization criterion for the intervocalic Greek stops. In contrast, Greek monolinguals failed to reliably categorize English /aˈda/

General discussion

These experiments demonstrate that Greek–English L2-dominant early bilinguals are very accurate, relative to monolinguals in either language, in identifying L1 and L2 phones in two phonetic environments (Experiment 1: word-initial, Experiment 2: word-medial stress-initial). We found no evidence that the bilinguals confuse the phonological status in each language of the phonetically similar short-lag stops of Greek (which they identified as phonologically voiceless, appropriate to Greek) and

Acknowledgments

This research was supported by a UWS Postgraduate Research Award, Marcs Institute Fieldwork Travel grant, NIH grant DC00403 (PI: C. T. Best), and Australian Research Council grant DP0880913 (CI: M. Tyler).

References (48)

  • N. Sebastián-Gallés et al.

    Online processing of native and non-native phonemic contrasts in early bilinguals

    Cognition

    (1999)
  • Arvaniti, A. (1999). Greek voiced stops: Prosody, syllabification, underlying representations or selection of the...
  • A. Arvaniti et al.

    Variation in voiced stop prenasalization in Greek

    Glossologia

    (2000)
  • A. Arvaniti et al.

    Early Modern Greek /b d ɡ/: Evidence from Rebetika and folk songs

    Journal of Modern Greek Studies

    (2004)
  • Best, C. T. (1994). Learning to perceive the sound pattern of English. In C. Rovee- Collier, & L. P. Lipsitt (Eds.),...
  • C.T. Best

    A direct realist view of cross-language speech perception

  • C.T. Best et al.

    Nonnative and second-language speech perception: Commonalities and complementarities

  • R.L. Bundgaard-Nielsen et al.

    Vocabulary size matters: The assimilation of L2 Australian English vowels to L1 Japanese vowel categories

    Applied Psycholinguistics

    (2011)
  • A. Caramazza et al.

    The acquisition of a new phonological contrast: The case of stop consonants in French–English bilinguals

    Journal of the Acoustical Society of America

    (1973)
  • F. Cox et al.

    Australian English

    Journal of the International Phonetic Association

    (2007)
  • J.L. Elman et al.

    Perceptual switching in bilinguals

    Journal of the Acoustical Society of America

    (1977)
  • J.E. Flege

    Production and perception of a novel, second-language phonetic contrast

    Journal of the Acoustical Society of America

    (1993)
  • J.E. Flege

    Second language speech learning: Theory, findings, and problems

  • J.E. Flege

    The relation between L2 production and perception

  • Cited by (74)

    • The role of L2 experience in L1 and L2 perception and production of voiceless stops by English learners of Spanish

      2021, Journal of Phonetics
      Citation Excerpt :

      This difference may be explained in terms of the possible problems with language mode in the Spanish task alluded to above. The activation of the English phonological system under the two task conditions may have resulted in similar identification responses across tasks (but cf. Antoniou, Tyler, & Best, 2012 for a lack of language mode effect in L1 and L2 stop discrimination). Alternatively, it is possible that the two dimensions require different control mechanisms and that are acquired – at least to a certain extent – independently (De Leeuw et al., 2019).

    • A non-contrastive cue in spontaneous imitation: Comparing mono- and bilingual imitators

      2021, Journal of Phonetics
      Citation Excerpt :

      As post-onset f0 is contrastive in Seoul Korean, the bilingual listeners are better at detecting the acoustic difference involving the phonetic property. This further supports the claim that the influence of bilingual speakers' L1 may differ depending on the task (e.g., Antoniou et al., 2012). In the oddity discrimination task, which draws the listeners' attention to acoustic details, the bilingual speakers are influenced by the cue-weighting of their native, and presumably dominant, language (see section 2.1.2).

    • Recognition of code-mixed gated words: Phonotactic features of Spanish and English

      2021, Revista de Logopedia, Foniatria y Audiologia
      Citation Excerpt :

      Hence, features such as consonant voicing and vowel tenseness may aid in bilingual word identification. Code switching and word identification are areas that have been under-investigated (Antoniou, Best, Tyler, & Kroos, 2011; Antoniou, Tyler, & best, 2012; Bürki-Cohen, Grosjean, & Miller, 1989; Li, 1996). Antoniou et al. (2011) stated that, “…code-switching provides the unique opportunity of observing the immediate phonetic effects of L1–L2 interaction” (p. 558).

    • CONCEPTUALLY CUED PERCEPTUAL CATEGORIZATION in ADULT L2 LEARNERS

      2021, Studies in Second Language Acquisition
    View all citing articles on Scopus
    View full text