Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Bilingualism and working memory performance: Evidence from a large-scale online study

  • Karolina M. Lukasik ,

    Roles Formal analysis, Investigation, Writing – original draft

    klukasik@abo.fi

    Affiliation Department of Psychology, Åbo Akademi University, Turku, Finland

  • Minna Lehtonen,

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliations Department of Psychology, Åbo Akademi University, Turku, Finland, Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland, MultiLing Center for Multilingualism in Society across the Lifespan, Department of Linguistics and Scandinavian studies, University of Oslo, Oslo, Norway

  • Anna Soveri,

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliation Turku Brain and Mind Center, University of Turku, Turku, Finland

  • Otto Waris,

    Roles Conceptualization, Data curation, Methodology, Software, Writing – review & editing

    Affiliation Department of Psychology, Åbo Akademi University, Turku, Finland

  • Jussi Jylkkä,

    Roles Writing – review & editing

    Affiliation Department of Psychology, Åbo Akademi University, Turku, Finland

  • Matti Laine

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    Affiliations Department of Psychology, Åbo Akademi University, Turku, Finland, Turku Brain and Mind Center, University of Turku, Turku, Finland

Abstract

The bilingual executive advantage (BEA) hypothesis has attracted considerable research interest, but the findings are inconclusive. We addressed this issue in the domain of working memory (WM), as more complex WM tasks have been underrepresented in the previous literature. First, we compared early and late bilingual vs. monolingual WM performance. Second, we examined whether certain aspects of bilingual experience, such as language switching frequency, are related to bilinguals’ WM scores. Our online sample included 485 participants. They filled in an extensive questionnaire including background factors such as bilingualism and second language (L2) use, and performed 10 isomorphic verbal and visuospatial WM tasks that yielded three WM composite scores (visuospatial WM, verbal WM, n-back). For verbal and visuospatial WM composites, the group comparisons did not support the BEA hypothesis. N-back analysis showed an advantage of late bilinguals over monolinguals and early bilinguals, while the latter two groups did not differ. This between-groups analysis was followed by a regression analysis relating features of bilingual experience to n-back performance, but the results were non-significant in both bilingual groups. In sum, group differences supporting the BEA hypothesis were limited only to the n-back composite, and this composite was not predicted by bilingualism-related features. Moreover, Bayesian analyses did not give consistent support for the BEA hypothesis. Possible reasons for the failure to find support for the BEA hypothesis are discussed.

Introduction

Bilinguals’ possible advantage over monolinguals in executive tasks (bilingual executive advantage, BEA) has been a topic of intensive research interest and growing controversy especially during the past decade [16]. Earlier studies appeared to paint a relatively consistent picture where bilinguals performed better than monolinguals on a range of mostly nonverbal tasks calling for executive functions, such as conflict resolution [7,8] and task switching [9]. BEA has been assumed to stem from the lifelong experience of managing two languages that requires inhibition of the currently irrelevant language, switching between languages, and monitoring the language context to choose the appropriate target language [10]. As bilingual experience varies, e.g., depending on age of acquisition of the second language, language proficiency and time spent using both languages, so may also its effects on executive function [11].

However, the earlier studies supporting the BEA hypothesis have been challenged by several more recent reports that have failed to replicate these findings [1215], and the existence of BEA has been questioned [3,4]. The field has also evidenced a publication bias, with null or negative findings being published less often than positive ones [6]. After controlling for publication bias, a recent extensive meta-analysis on 152 studies on adults found no systematic support for BEA in any of the six studied executive domains (inhibitory control, set shifting, monitoring, working memory, attention, verbal fluency) [16].

One important cognitive domain where the putative BEA has received increasing research interest during the last decade is working memory (WM). It is a temporary memory system that acts as a constantly updated “mental platform” for information relevant for current activities and goals that we are focusing at. WM skills have been linked to many important aspects of everyday life, ranging from language learning and academic achievement to mental health (for example, see [17,18]). In the area of bilingualism, a possible enhancement of WM could stem from the constant “juggling” of two languages in the mind [19] which calls for WM resources and their efficient allocation [20]. From this perspective, bilinguals’ lifelong practice of managing two languages could enhance WM capacity or efficiency.

As with other cognitive domains, results from individual studies contrasting bilinguals’ and monolinguals’ WM performance have so far been inconsistent. In recent years, four meta-analyses have addressed the putative bilingual advantage in WM [2123, 16]. Adesope and colleagues [21], investigating the effects of bilingualism on cognition in children and young adults, pooled together various WM measures from four studies and found a moderate BEA effect (g = .48). Grundy and Timmer [22] reported a small to medium population effect (ρ = .2) in their meta-analysis of 27 studies conducted on both children and adults, leading the authors to conclude that bilinguals have a small advantage over monolinguals in WM. Grundy and Timmer also performed moderator analyses on age group (children, young adults, older adults), task type (verbal vs. nonverbal), and task language for the verbal WM tasks (dominant L1 vs. second language, L2). These analyses indicated the greatest WM advantage for children as compared to young and older adults. The analysis showed no effect of task type on the magnitude of the BEA, but a greater advantage when the bilinguals were completing a verbal WM task in their L1 than in L2. Grundy and Timmer [22] attribute the larger BEA in children’s WM performance to the high neurocognitive demands that children face when learning a new language. However, they did not study whether the outcomes depended on matching of the bilingual and monolingual samples on socioeconomic status (SES) or education. This issue is of particular importance, as SES and education have been shown to influence WM performance [24,25], but unfortunately, studies comparing monolingual and bilingual groups do not always mention whether the groups were matched according to these pertinent background factors. The meta-analysis by von Bastian and colleagues found a small but positive effect (g = .11) of bilingualism on WM performance. However, they also report great heterogeneity in the data, but no task- or background-related moderators that could account for the variability [23].

The fourth and most recent meta-analysis by Lehtonen and colleagues [16] analyzed 251 effect sizes from four separate WM paradigms (complex span, simple span, letter-number sequencing, and n-back) and found a very small positive effect of bilingualism on WM performance (Hedge’s g = .07), which disappeared after correcting for the observed publication bias. It also became nonsignificant when including only original studies that had explicitly matched the participant groups, e.g., with regard to education. The comparison between verbal and nonverbal WM tasks revealed that the effect size was smaller for verbal tasks (Hedge’s g = .01) than for nonverbal tasks (Hedge’s g = .30). However, when correcting for bias in this analysis, even the nonverbal advantage vanished. In the moderator analysis on the role of L2 age of acquisition (AoA), it was found that early acquirers (i.e., AoA = < 6 years) showed a small WM-related bilingual advantage (Hedge’s g = .23, p < .01) that late acquirers did not exhibit (Hedge’s g = .02, p = .735), but also this advantage disappeared when correcting for publication bias. No moderating effects on WM were found for L2 proficiency.

All in all, three of the four earlier meta-analyses [2123] suggested that there is a small-to-moderate positive effect of bilingualism on WM. Two of those studies [21, 22] also found evidence of a publication bias, while von Bastian and colleagues [23] report no asymmetry between the publication of positive and negative results. In turn, the most recent and largest meta-analysis to date by Lehtonen and colleagues [16] found no positive effect in adults after correcting for the asymmetry between positive and negative results. However, one should note that the previous studies on WM and thus the data available for meta-analyses have been dominated by simple span tasks, and more taxing and thereby potentially more sensitive WM tasks (n/back, complex span, letter-number sequencing) have been utilized in fewer studies. Moreover, WM studies using verbal tasks may have been affected by bilinguals’ general disadvantage on verbal measures when compared with monolingual speakers [26,27]. Thus, further research with a wider range of WM tasks is warranted.

It is important to highlight that bilingualism represents a spectrum rather than a categorical variable [11,28,29]. Balanced bilingualism, or equilingualism, is a rare phenomenon, and language usage can change throughout a person’s lifetime in a dynamic fashion. Therefore, in this study, we defined bilingualism in a broad sense as the ability to speak two languages–regardless of AoA, proficiency, and recent language use–while acknowledging that these factors, among others, characterize individual bilingual experience. This large individual variability in bilinguals also opens the way to explore the associations between features of bilingual experience and level of executive function. The latter correlative approach can be seen as a complement to bilingual-monolingual contrasts that directly address BEA. If some features of bilingual experience indeed enhance executive function, it should be possible to find positive correlations between those features and executive task performance. For example, one could hypothesize that earlier AoA would predict better executive task performance, as a person would have had longer practice in using that language and, by extension, more executive training [30]. Another general feature of bilingual experience, L2 proficiency, might also modulate level of executive task performance: it has been suggested that higher L2 proficiency could pose more demands on interference management and lead to better EF [31, 32], or that lower L2 proficiency would necessitate stronger inhibition of L1 [3335]. With regard to more qualitative features of bilingual experience that might enhance executive functions, one prominent candidate is the frequency of switching between two languages [36], as it arguably requires suppressing one language and activating the other [9, 37]. Thus, bilinguals who switch more often might show better executive task performance on, e.g., complex WM tasks. This idea, however, has been challenged by studies that found no link between frequency of language switching and the commonly used measures of resolving incongruency on executive tasks [15, 38], and it has been suggested that language switching does not interact with general executive mechanisms in a consistent fashion [39].

Thus far, possible associations between features of bilingual experience and WM performance have been explored only in a limited number of studies. In 6-year-old bilingual children, Blom and colleagues [40] found that higher bilingual proficiency was related to a better performance on a backward digit recall task. Tse and Altarriba [41] found that higher L2 proficiency in bilingual children predicted better verbal WM. While the child research cited above provides some support to the BEA hypothesis, a study by Soveri and colleagues [36] with bilingual adults failed to find significant correlations between the n-back effect and frequency of language switching (as measured by the Bilingual Switching Questionnaire developed by Rodríguez-Fornells and colleagues [42]), L2 AoA, or frequency of everyday use of both languages. Neither did Jylkkä and colleagues [39] find support to the BEA hypothesis in the domain of WM: in their study of adult bilinguals, higher rates of a subtype of language switches, unintended switches, were associated with worse WM updating as measured by n-back. Unintended switches may simply reflect lapses in cognitive control. All in all, there is conflicting evidence concerning the relationships between bilingual experience and executive functions including WM. However, the limited number of existing studies prompts further experiments, especially because such relationships, if they are found, could provide hints on the underlying cognitive mechanisms of the putative BEA.

The present study

In the present study, we set out to investigate the associations between bilingualism and WM in large, diverse, and well-matched groups of monolinguals, early bilinguals, and late bilinguals. We measured WM with three composite scores that reflect the previously reported latent structure of WM in this sample [43]. To the best of our knowledge, this is the first study that combines the use of an extensive WM task battery and composite scores based on the latent structure of the WM tasks in a study of BEA. This is relevant because individual WM tasks tend to have low intercorrelations and cannot be used interchangeably [44, 45]. We employed two types of analyses. First, we tested directly the BEA hypothesis by comparing WM performance in monolinguals, early bilinguals, and late bilinguals. The BEA hypothesis predicts that bilinguals would outperform monolinguals on WM measures. The difference might be larger for early bilinguals whose longer bilingual experience would have provided more executive practice in managing, switching and monitoring two languages; on the other hand, some studies suggest that for late and unbalanced bilinguals, controlling both languages is more taxing on the EF[33, 37]. Second, we explored whether certain bilingualism-related factors (L2 AoA, L2 proficiency, amount of language switching) would predict WM performance within the bilinguals. If BEA stems from bilingual experience, some crucial aspects of that experience should correlate with measures of executive performance such as success on WM tasks.

Methods

Ethics statement

The study was approved by the Joint Ethics Committee at the Departments of Psychology and Logopedics, Åbo Akademi University. Informed consent was obtained from all participants, participation was anonymous, and all participants were informed of their right to withdraw from the study at any time. They received a $10 payment for their participation.

Participants and procedure

We recruited the participants through the Amazon Mechanical Turk crowdsourcing site [46]and the data (questionnaires and WM measures) were collected using an in-house Java-based test platform. This study used the same data as Waris and colleagues [43]). Overall, 711 U.S. American adult participants completed the study. We excluded 43 participants due to having reported using external tools during task performance, missing data, or taking over 24 hours to complete the study. As depressive symptoms may affect WM performance (see for[47]), we excluded the 136 participants whose scores on the Quick Inventory of Depressive Symptomatology (QIDS;[48]) corresponded to moderate, severe or very severe symptom occurrence, as well as 16 participants who had missing data on the QIDS. Furthermore, we excluded 13 participants whose scores were outliers in the WM tasks according to Mahalanobis distance [49]. At this point, the sample consisted of 503 participants.

We further excluded participants who declared that English was their L2, as we wanted the participants to complete tasks in their first language [16, 22]. We also excluded participants who gave “Latin” (or nonsensical answers) as their L2; however, language proficiency in either L1 or L2 was not an exclusion criterion. These criteria resulted in excluding 7 people. We then excluded 11 participants with missing data on either L2 AoA or L2 proficiency. The final sample comprised 485 participants.

Out of the 485 participants, 265 spoke a second language. 115 reported having learned L2 in the first 12 years of their life, which identified them as early bilinguals. The 150 participants who had acquired their L2 after the age of 12 were classified as late bilinguals. Demographic characteristics of the three language groups (monolinguals, early bilinguals and late bilinguals) are given in Table 1.

thumbnail
Table 1. Summary of the background characteristics of the sample.

https://doi.org/10.1371/journal.pone.0205916.t001

For WM assessment, we used ten tasks: simple span tasks (backward and forward), complex span tasks, running memory tasks, and 2-back tasks. Each task had two isomorphic versions (numerical-verbal and visuospatial; for details, see [43]). Based on the factor analyses by Waris and colleagues [43], we calculated three composite accuracy scores reflecting the following latent factors: verbal WM (4 tasks), visuospatial WM (4 tasks), and n-back (2 tasks). The scores were Z-transformed and then summed and averaged.

To investigate the language background of the participants, we used several questions (see Table 2). First, the participants were asked to list all the languages they had learned or studied (including the native language) as well as AoA and self-reported proficiency (on a Likert-type scale ranging from 1—Beginner, to 6—Native-level mastery) for each language. Finally, one question assessed bilingual language switching frequency, tapping the overall number of switches (On average, I switch between different languages [x] times a day. The response was given on a Likert-type 1–5 scale where 1 = 0–2 switches, 2 = 3–10 switches, 3 = 11–30 switches, 4 = 31–60 switches, and 5 = over 60 daily switches.

thumbnail
Table 2. Bilingual language experience variables in the two bilingual groups.

https://doi.org/10.1371/journal.pone.0205916.t002

L2 AoA, self-reported proficiency, percentage of L2 use in the last two years, and switching are reported in Table 2. The most common second languages were Spanish (141 speakers), French (41 speakers) and German (20 speakers).

Results

Comparison of background factors between language groups

We investigated whether there were significant differences in the background factors age, education, childhood socioeconomic status and gender between the three groups. A main effect of age was found [F(2,484) = 3.85, p < .05]: mean age in the monolingual group was significantly higher than in the early bilingual group (p < .01), but there was no significant difference between early and late bilingual groups, or between late bilinguals and monolinguals. We also noted a main effect of education [F(2,484) = 6.16, p < .01]. Post-hoc tests revealed that the monolingual group differed significantly from both the early and late bilingual groups, being on average less educated (p < .05). There were no significant differences between the two bilingual groups. The three groups did not differ significantly as regards childhood socioeconomic status [F(2,482) = .994, p = .37]. There was no significant difference between the groups as regards gender [χ2(4) = 3.23, p = .52].

Sample matching

Due to the significant differences between groups in background variables, we decided to use a matching procedure to ensure that the covariates were distributed equally between the three language groups. We used the MatchIt package for statistical software R [50, 51]. MatchIt enables researchers to match two samples within one study according to select variables, and allows a variety of matching algorithms to be used. Whereas matching based on Mahalanobis distance or Propensity Score Matching would usually be the method of choice, we could not apply it since the assumption of normal distribution was not met in our data, and thus the use of those standard procedures might increase bias in the data rather than limit it [52]. Therefore, we applied a so-called genetic matching procedure that uses an evolutionary algorithm to create a set of solutions and determine which one produces the best match[52, 53]. This procedure matches groups on variables of choice by dropping out participants in the groups as well as by ascribing weights to each participant. The weights are used to correct for unequal variances and counteract heteroscedasticity in the data. This method has been successfully used in social sciences, most notably in the re-analysis of the National Supported Work Demonstration Program job training experiment [54, 55].

For the genetic matching procedure, we treated both early and late bilinguals as the reference group, and monolinguals as the matched group. We matched the groups on age and education simultaneously. We did not consider gender or childhood SES as basis for matching, since there was no difference on those variables between the unmatched groups. MatchIt allows the user to specify the parameters of matching, and we chose to discard data from both reference and matched groups if it ensured the best fit. The matching resulted in leaving out 44 participants from the monolingual group and ascribing weights (later used in analyses as weighted least squares) to each participant. Thus, the final matched sample consisted of 176 monolinguals, 115 early bilinguals and 150 late bilinguals. The three groups did not differ significantly on any of the background variables (p’s > .1).

Descriptive data

The means and standard deviations on the three WM composite measures for the monolingual, early bilingual and late bilingual groups are reported in Table 3. To see if education and childhood SES correlated with WM performance in our sample [24, 25], we investigated Pearson’s correlations between age, education, childhood SES and the three WM composite scores (see Table 4). Age correlated negatively with the visuospatial WM and n-back composites. Education or childhood SES did not correlate significantly with any of the WM measures.

thumbnail
Table 3. Mean WM performance on the three composite scores in the monolingual, early bilingual and late bilingual groups.

All scores have been z-transformed, then summed and averaged. Higher positive values indicate better performance.

https://doi.org/10.1371/journal.pone.0205916.t003

thumbnail
Table 4. Pearson’s correlations between age, education, childhood SES and WM scores in the matched sample (n = 441).

https://doi.org/10.1371/journal.pone.0205916.t004

Differences in language proficiency and language use between early and late bilinguals

We tested whether the two bilingual groups were equal as regards language proficiency, frequency of L2 use and language switching. T-tests between early and late bilinguals revealed significant differences between these two groups. Early bilinguals were significantly more proficient in their L2 (t(263) = 6.94, p < .001). They also used their L2 significantly more often (t(262) = 4.36, p < .001), and declared more frequent switching between languages (t(140) = 4.19, p < .001).

Visuospatial WM performance in the three language groups

To explore possible differences on visuospatial WM between the three language groups, we conducted an ANCOVA with the visuospatial WM composite as the dependent variable. Age, education and childhood SES served as covariates, and language group (monolingual; early bilingual; late bilingual) as the independent variable. Weights ascribed by the MatchIt algorithm were used as weighted least squares. The analysis revealed a significant main effect of language group [F(2,435) = 6.91, p < .01; partial η2 = .031], stemming from higher accuracy scores in the two bilingual groups compared to the monolinguals. Age was a significant covariate (p < .01), with higher age being related to a lower visuospatial WM performance. Pairwise comparisons revealed a significant mean difference between monolinguals and early bilinguals (p < .01) and between monolinguals and late bilinguals (p < .01). The difference between early and late bilinguals was not statistically significant (p > .05). The group differences are shown in Fig 1. In order to obtain a more informative measure of how strongly the data supported the hypothesis of a difference between the language groups, we conducted a Bayesian ANCOVA. The Bayes Factor for language group compared to a null model with just the covariates was BF10 = 2.46, indicating that the data provided only anecdotal evidence for the hypothesis that there is a difference in visuospatial WM between the language groups [56].

thumbnail
Fig 1. Mean visuospatial WM performance in the three language groups.

Bars represent standard errors.

https://doi.org/10.1371/journal.pone.0205916.g001

Verbal WM performance in the three language groups

We conducted an ANCOVA with the verbal WM composite as the dependent variable, age, education and childhood SES as covariates, and language group (monolingual; early bilingual; late bilingual) as the independent variable. As previously, MatchIt weights were used as weighted least squares. Childhood SES was a significant covariate (p < .05): lower childhood SES was associated with better WM performance. However, a closer examination revealed that this effect was driven by the disproportionate distribution of responses across SES categories. The covariate education was near-significant (p = .057); higher education tended to be associated with better verbal WM performance. The effect of language group was not significant, (F(2,435) = 1.89, p = .15). Bayesian ANCOVA yielded a Bayes Factor of BF10 = .04 for the effect of language group, that is, there was substantial evidence for the null hypothesis that there is no difference between the language groups (BF01 = 25).

N-back task performance in the three language groups

ANCOVA with the n-back task composite as the dependent variable, age and education as covariates, and language group (monolingual; early bilingual; late bilingual) as the independent variable, using MatchIt weights as weighted least squares, showed a significant effect of language group [F(2,435) = 7.36, p < .01; partial η2 = .033]. This stemmed from an advantage of late bilinguals over monolinguals and early bilinguals. Monolinguals did not differ significantly from early bilinguals (p > .05), but they did differ from late bilinguals (p < .01). The difference between early and late bilinguals was significant as well (p < .01). Moreover, age was a significant covariate (p < .01), with lower age being related to a better performance in the n-back tasks. The group differences are shown in Fig 2. Bayesian ANCOVA yielded a Bayes Factor BF10 = 10.98 for language group as compared to a null model with just the covariates, indicating substantial support for the hypothesis that the language groups differ in the n-back task.

thumbnail
Fig 2. Differences between language groups in the n-back performance.

Bars represent standard errors.

https://doi.org/10.1371/journal.pone.0205916.g002

The n-back composite analyzed above included both verbal and visuospatial n-back tasks. Given that there was some indication for larger differences between the language groups in visuospatial than in verbal WM, we ran additional analyses separately for these two n-back tasks. For both tasks, ANCOVAs revealed a significant main effect of language group (verbal n-back task: F(2,435) = 4.15, p < .01; visuospatial n-back task: F(2,435) = 5.73, p < .01), and age was a significant covariate, with younger participants achieving higher scores. In the verbal n-back task, late bilinguals were significantly more accurate than monolinguals (p < .05) and early bilinguals (p < .05), while monolinguals did not differ significantly from early bilinguals (p > .05). In the corresponding Bayesian ANCOVA, the Bayes Factor for language group was BF10 = 0.28, giving moderate evidence for null hypothesis. Likewise, for the visuospatial n-back task late bilinguals achieved significantly higher accuracy than early bilinguals (p < .05) and monolinguals (p < .01). There was no significant difference between early bilinguals and monolinguals (p > .05). In the Bayesian ANCOVA, the Bayes Factor for language group was BF10 = 8.26.

Overall, these results suggest that bilingualism is associated with higher scores in the n-back and visuospatial WM tasks, but the Bayesian analyses supported group differences only for the visuospatial n-back performance. Moreover, in the n-back task, the bilingual advantage was specifically due to higher performance of late bilinguals compared to the other groups. In the next step, we employed a within-group approach to study whether some aspects of bilingual experience were associated with n-back performance, the only WM composite that showed consistent evidence for language group differences. If the BEA hypothesis holds, one would expect that critical aspects of bilingual experience would be associated with performance on an executively loaded measure (here the n-back composite) that at group level has shown a bilingual advantage.

Associations between bilingual experience and WM performance

In hierarchical regression analyses of predictors of n-back performance, we used a within-subjects design and considered all bilingual participants (both early and late) as one group, and AoA as a continuous variable. Model 1 was our baseline model, including the three general background variables age, education, and childhood socioeconomic status. Model 2 included the additional bilingualism-related factors L2 AoA, L2 proficiency and frequency of language switching, and it was compared to Model 1 which served as the null model in this comparison. Table 5 summarizes both models.

thumbnail
Table 5. Hierarchical regression models for general background variables and bilingual language features as predictors of n-back performance.

https://doi.org/10.1371/journal.pone.0205916.t005

Model 1 was not significant (adjusted R2 = .015, F(3,261) = 2.314, p = .076). Bayesian analysis supported the null model (BF₁₀ = .16).

Model 2 was not significant either (adjusted R2 = .025, F(3,258) = 1.91, p = .13). As in Model 1, there was a negative correlation between age and n-back performance (standardized β = -.17, p = .01). Model 2 was not supported by Bayesian analysis either; it did not have a higher fit than Model 1 (BF₁₀ = .35).

We also conducted the same hierarchical regression analysis for n-back performance in late bilinguals only, prompted by the ANCOVA results where late bilinguals outperformed the other groups. Neither model was significant or supported by Bayesian analysis [Model 1: adjusted R2 = .005, F(3,146) = 1.275, p = .285; BF₁₀ = 0.08; Model 2: adjusted R2 = 0, F(6,143) = .979, p = .44; BF₁₀ = 0.01].

Discussion

We investigated the BEA hypothesis in the domain of working memory. WM was measured by three WM composites that were derived from a previous latent factor analysis [43]. We applied two complementary analysis approaches. First, we compared monolingual, early bilingual and late bilingual groups to each other on the three WM measures. Second, we examined whether the WM measure showing a significant bilingual advantage (n-back composite) was associated with some key features of bilingual experience (L2 AoA, L2 proficiency, frequency of language switching) within the bilingual participants. The study was motivated by the fact that most of the existing studies on bilingualism and WM have employed primarily simple span tasks and small samples, and the groups have not necessarily always been fully matched on relevant background variables. We addressed these issues by recruiting a large online sample, matching mono- and bilingual participants on several background variables, using an extensive WM test battery that included more complex tasks, and basing our WM measures on the latent structure of the tasks.

The group comparisons between monolinguals, early bilinguals and late bilinguals showed statistically significant effects for the n-back and visuospatial WM performance, but not for the verbal WM composite. On the other hand, Bayes factors provided evidence for a group difference only in the n-back task, while in the verbal WM composite there was evidence for a lack of group difference and in the visuospatial WM composite there was no evidence for either hypothesis. For the n-back, we observed higher scores in the late bilingual group, while early bilinguals were comparable to monolinguals. In visuospatial WM, the difference between early and late bilinguals was not statistically significant, and both the early and late bilingual groups performed better than the monolingual group. It is, however, important to highlight that the observed effect size was quite small (η2 = .02) and the Bayes factor did not support the group difference (BF10 = 2.46).

Despite this variability, we can conclude that the present results are broadly in line with earlier meta-analyses that have reported small advantages on WM in bilinguals [21, 22], albeit these effects did not survive correction for publication bias in the most recent extensive meta-analysis by Lehtonen and colleagues [16]. Also in line with Lehtonen et al. [16], the effect size appeared larger in nonverbal than in verbal WM. A possible reason for this is that bilinguals, who by definition have had less exposure to each language, show some disadvantages in the verbal domain when compared with monolinguals.

As the present correlative findings cannot establish causality, it could also be that the better WM updating performance (as reflected by the comparatively higher n-back performance) in our late bilinguals reflects higher baseline executive skills. According to this account, well-developed executive abilities would have enabled more efficient L2 learning in our late bilinguals. It has been shown that WM performance is positively correlated with novel word learning (for an overview, see [57]). In their meta-analysis, Linck and colleagues [58] focused specifically on late bilinguals and proficient L2 learners, showing that L2 processing and proficiency measures were positively associated with WM. They hypothesized that people with greater WM capacity are more likely to succeed in learning a second language. In a pre-post longitudinal study with adult L2 learners in classroom settings, Linck and Weiss [59] indeed found that greater WM resources predicted L2 proficiency at posttest 8 weeks later. It could also be that there were some aspects of language learning in the late bilingual group (for example, distinctive language strategies) which promoted WM updating. Furthermore, according to the controlled dose hypothesis recently suggested by Paap ([60], in press), late bilinguals might be experiencing a “boost” in executive performance due to cognitive demands of L2 use which has not yet been automatized (see also [61]). Considering the possible influence of non-linguistic factors, our study would have benefited from employing a non-verbal intelligence measure, as WM updating tasks have been shown to correlate positively with Raven’s matrices scores [45]. Probing the participants’ non-verbal intelligence would have allowed us to control for baseline differences more comprehensively.

We followed up the only clearcut group difference, the late bilinguals’ advantage on the n-back composite, by examining whether some key features of bilingual experience (L2 AoA, L2 proficiency and frequency of language switching) together with demographic factors would predict n-back performance within the bilingual group. Our results were negative for all three variants as the regression models failed to significantly account for variation in n-back performance. Thus, either our measures of bilingual experience were faulty, or the observed group difference in n-back stems from some other, uncontrolled factors than bilingual experience.

Calvo and colleagues [62] have argued that bilingualism-related WM benefits may be specific to some components of WM only, being easily overshadowed by the great variability in WM tasks and the common use of verbal WM measures which can be disadvantageous to bilinguals. However, this idea does not quite fit to our finding that the only clearcut group difference concerned an updating composite including both verbal and visuospatial task variants, and that the variability on this composite was not explained by bilingualism-related factors. Yet another possible variant of the BEA hypothesis is that the type of bilingual experience may be crucial for the emergence of BEA. According to the Adaptive Control hypothesis put forth by Green and Abutalebi [63], the executive load (and thus the executive practice provided by bilingualism) depends on the type of communicative context that a bilingual person usually lives in. More specifically, they argue that a dual-language context where each language is used with different communication partners would tax executive functions most heavily. In contrast, dense code-switching and also single-language context would involve less executive load. Thus, it is possible that many of our bilinguals had not lived in the cognitively most demanding dual-language communicative context. It is also possible that the responses to our language switching question reflect a mixture of switches in dense code-switching and dual-language contexts, thus weakening our chances for observing a correlation between WM and switching frequency.

In summary, despite some group differences favoring bilinguals, our study failed to find consistent support for the BEA hypothesis in the domain of working memory. A major difficulty in this venue of research is that the original hypothesis does not delineate the exact conditions of bilingual experience that are required for BEA to emerge. So far, most studies on bilingualism have focused on comparing bilinguals to monolinguals, with less attention given to individual differences within the bilingual groups. If bilingual experience enhances EF, it is important to specify what aspects of this experience are beneficial. Future studies will show whether a more stringent version of the BEA hypothesis, such as the one based on the Adaptive Control model by Green and Abutalebi [63], would provide consistent, replicable effects. Meanwhile, the null hypothesis should prevail.

Supporting information

Acknowledgments

Matti Laine is supported by the Academy of Finland (grant #260276) and the Åbo Akademi University Endowment (the BrainTrain project). Minna Lehtonen is supported by Academy of Finland (grant #288880).

References

  1. 1. Costa A, Hernández M, Costa-Faidella J, Sebastián-Gallés N. On the bilingual advantage in conflict processing: Now you see it, now you don’t. Cognition. 2009 Nov 1;113(2): 135–49. pmid:19729156
  2. 2. Kousaie S, Phillips NA. Conflict monitoring and resolution: Are two languages better than one? Evidence from reaction time and event-related brain potentials. Brain Research. 2012 Mar 29;1446: 71–90. pmid:22356886
  3. 3. Paap KR, Greenberg ZI. There is no coherent evidence for a bilingual advantage in executive processing. Cognitive Psychology. 2013 Mar 1;66(2): 232–58. pmid:23370226
  4. 4. Paap KR. The role of componential analysis, categorical hypothesising, replicability and confirmation bias in testing for bilingual advantages in executive functioning. Journal of Cognitive Psychology. 2014 Apr 3;26(3): 242–55.
  5. 5. Paap KR, Sawi O. Bilingual advantages in executive functioning: problems in convergent validity, discriminant validity, and the identification of the theoretical constructs. Frontiers in Psychology. 2014 Sep 9;5:962. pmid:25249988
  6. 6. De Bruin A, Treccani B, Della Sala S. Cognitive advantage in bilingualism: An example of publication bias? Psychological Science. 2015 Jan;26(1): 99–107. pmid:25475825
  7. 7. Bialystok E, Viswanathan M. Components of executive control with advantages for bilingual children in two cultures. Cognition. 2009 Sep 1;112(3): 494–500. pmid:19615674
  8. 8. Bialystok E. Reshaping the mind: the benefits of bilingualism. Canadian Journal of Experimental Psychology/Revue canadienne de psychologie expérimentale. 2011 Dec;65(4):229. pmid:21910523
  9. 9. Prior A, MacWhinney B. A bilingual advantage in task switching. Bilingualism: Language and Cognition. 2010 Apr;13(2): 253–62.
  10. 10. Rodriguez‐Fornells A, De Diego Balaguer R, Münte TF. Executive control in bilingual language processing. Language Learning. 2006 Jul 1;56(s1):133–90.
  11. 11. Luk G, Bialystok E. Bilingualism is not a categorical variable: Interaction between language proficiency and usage. Journal of Cognitive Psychology. 2013 Aug 1;25(5): 605–21. pmid:24073327
  12. 12. Duñabeitia JA, Hernández JA, Antón E, Macizo P, Estévez A, Fuentes LJ, Carreiras M. The inhibitory advantage in bilingual children revisited: Myth or reality? Experimental Psychology. 2014;61(3): 234. pmid:24217139
  13. 13. Paap KR, Liu Y. Conflict resolution in sentence processing is the same for bilinguals and monolinguals: The role of confirmation bias in testing for bilingual advantages. Journal of Neurolinguistics. 2014 Jan 1;27(1): 50–74.
  14. 14. von Bastian CC, Souza AS, Gade M. No evidence for bilingual cognitive advantages: A test of four hypotheses. Journal of Experimental Psychology: General. 2016 Feb;145(2): 246. pmid:26523426
  15. 15. Paap KR, Myuz HA, Anders RT, Bockelman MF, Mikulinsky R, Sawi OM. No compelling evidence for a bilingual advantage in switching or that frequent language switching reduces switch cost. Journal of Cognitive Psychology. 2017 Feb 17;29(2): 89–112.
  16. 16. Lehtonen M, Soveri A, Laine A, Järvenpää J, de Bruin A, Antfolk J. Is bilingualism associated with enhanced executive functioning in adults? A meta-analytic review. Psychological Bulletin. 2018 Mar 1. pmid:29494195
  17. 17. Titz C, Karbach J. Working memory and executive functions: effects of training on academic achievement. Psychological Research. 2014 Nov 1;78(6): 852–68. pmid:24389706
  18. 18. Andreotti C, Thigpen JE, Dunn MJ, Watson K, Potts J, Reising MM, Robinson KE, Rodriguez EM, Roubinov D, Luecken L, Compas BE. Cognitive reappraisal and secondary control coping: associations with working memory, positive and negative affect, and symptoms of anxiety/depression. Anxiety, Stress & Coping. 2013 Jan 1;26(1): 20–35. pmid:22091879
  19. 19. Kroll JF, Dussias PE, Bogulski CA, Kroff JR. Juggling two languages in one mind: What bilinguals tell us about language processing and its consequences for cognition. In: Psychology of learning and motivation 2012 Jan 1 (Vol. 56, pp. 229–262). Academic Press.
  20. 20. Thorn AS, Gathercole SE. Language-specific knowledge and short-term memory in bilingual and non-bilingual children. The Quarterly Journal of Experimental Psychology: Section A. 1999 Apr 1;52(2): 303–24.
  21. 21. Adesope OO, Lavin T, Thompson T, Ungerleider C. A systematic review and meta-analysis of the cognitive correlates of bilingualism. Review of Educational Research. 2010 Jun;80(2): 207–45.
  22. 22. Grundy JG, Timmer K. Bilingualism and working memory capacity: A comprehensive meta-analysis. Second Language Research. 2017 Jul;33(3): 325–40.
  23. 23. von Bastian CC, De Simoni C, Kane M, Carruth N, Miyake A (November 2017). Does being bilingual entail advantages in working memory? A meta analysis. Presented at the 58th Annual Meeting of Psychonomic Society, Vancouver.
  24. 24. Evans GW, Schamberg MA. Childhood poverty, chronic stress, and adult working memory. Proceedings of the National Academy of Sciences. 2009 Apr 21;106(16): 6545–9. pmid:19332779
  25. 25. Hackman DA, Farah MJ. Socioeconomic status and the developing brain. Trends in Cognitive Sciences. 2009 Feb 1;13(2): 65–73. pmid:19135405
  26. 26. Gollan TH, Montoya RI, Fennema-Notestine C, Morris SK. Bilingualism affects picture naming but not picture classification. Memory & Cognition. 2005 Oct 1;33(7): 1220–34.
  27. 27. Luo L, Craik FI, Moreno S, Bialystok E. Bilingualism interacts with domain in a working memory task: Evidence from aging. Psychology and Aging. 2013 Mar;28(1): 28. pmid:23276212
  28. 28. Kroll JF, Bialystok E. Understanding the consequences of bilingualism for language processing and cognition. Journal of Cognitive Psychology. 2013 Aug 1;25(5): 497–514. pmid:24223260
  29. 29. Kaushanskaya M, Prior A. Variability in the effects of bilingualism on cognition: It is not just about cognition, it is also about bilingualism. Bilingualism: Language and Cognition. 2015 Jan;18(1): 27–8.
  30. 30. Luk G, Bialystok E, Craik FI, Grady CL. Lifelong bilingualism maintains white matter integrity in older adults. Journal of Neuroscience. 2011 Nov 16;31(46): 16808–13. pmid:22090506
  31. 31. Rosselli M, Ardila A, Jurado MB, Salvatierra JL. Cognate facilitation effect in balanced and non-balanced Spanish–English bilinguals using the Boston Naming Test. International Journal of Bilingualism. 2014 Dec;18(6): 649–62.
  32. 32. Yow WQ, Li X. Balanced bilingualism and early age of second language acquisition as the underlying mechanisms of a bilingual executive control advantage: why variations in bilingual experiences matter. Frontiers in Psychology. 2015 Feb 26;6: 164. pmid:25767451
  33. 33. Green DW. Mental control of the bilingual lexico-semantic system. Bilingualism: Language and Cognition. 1998 Aug;1(2): 67–81.
  34. 34. Linck JA, Hoshino N, Kroll JF. Cross-language lexical processes and inhibitory control. The mental lexicon. 2008 Jan 1;3(3): 349–74. pmid:19907674
  35. 35. Laine M, Lehtonen M. Cognitive consequences of bilingualism: where to go from here? Language, Cognition and Neuroscience. 2018 Apr 13: 1–8.
  36. 36. Soveri A, Rodriguez-Fornells A, Laine M. Is there a relationship between language switching and executive functions in bilingualism? Introducing a within group analysis approach. Frontiers in Psychology. 2011 Aug 1;2:183. pmid:21869878
  37. 37. Meuter RF, Allport A. Bilingual language switching in naming: Asymmetrical costs of language selection. Journal of Memory and Language. 1999 Jan 1;40(1): 25–40.
  38. 38. Paap KR, Sawi OM, Dalibar C, Darrow J, Johnson HA. The brain mechanisms underlying the cognitive benefits of bilingualism may be extraordinarily difficult to discover. AIMS Neuroscience. 2014 Dec 18;1(3):245–56.
  39. 39. Jylkkä J, Soveri A, Wahlström J, Lehtonen M, Rodríguez-Fornells A, Laine M. Relationship between language switching experience and executive functions in bilinguals: an Internet-based study. Journal of Cognitive Psychology. 2017 May 19;29(4): 404–19.
  40. 40. Blom E, Küntay AC, Messer M, Verhagen J, Leseman P. The benefits of being bilingual: Working memory in bilingual Turkish–Dutch children. Journal of Experimental Child Psychology. 2014 Dec 1;128: 105–19. pmid:25160938
  41. 41. Tse CS, Altarriba J. The effects of first-and second-language proficiency on conflict resolution and goal maintenance in bilinguals: Evidence from reaction time distributional analyses in a Stroop task. Bilingualism: Language and Cognition. 2012 Jul;15(3): 663–76.
  42. 42. Rodriguez-Fornells A, Kramer U, Lorenzo-Seva U, Festman J, Münte TF. Self-assessment of individual differences in language switching. Frontiers in Psychology. 2012 Jan 10;2:388. pmid:22291668
  43. 43. Waris O, Soveri A, Ahti M, Hoffing RC, Ventus D, Jaeggi SM, Seitz AR, Laine M. A latent factor analysis of working memory measures using large-scale data. Frontiers in Psychology. 2017 Jun 28;8:1062. pmid:28701976
  44. 44. Schmiedek F, Lövdén M, Lindenberger U. A task is a task is a task: putting complex span, n-back, and other working memory indicators in psychometric context. Frontiers in Psychology. 2014 Dec 23;5:1475. pmid:25566149
  45. 45. De Simoni C., & von Bastian C. C. (2018). Working memory updating and binding training: Bayesian evidence supporting the absence of transfer. Journal of Experimental Psychology: General, 147(6), 829. pmid:29888939
  46. 46. Crump MJ, McDonnell JV, Gureckis TM. Evaluating Amazon's Mechanical Turk as a tool for experimental behavioral research. PloS one. 2013 Mar 13;8(3):e57410. pmid:23516406
  47. 47. Salazar-Villanea M, Liebmann E, Garnier-Villarreal M, Montenegro-Montenegro E, Johnson DK. Depressive symptoms affect working memory in healthy older adult Hispanics. Journal of Depression & Anxiety. 2015 Oct;4(4). pmid:27104091
  48. 48. Rush AJ, Trivedi MH, Ibrahim HM, Carmody TJ, Arnow B, Klein DN, Markowitz JC, Ninan PT, Kornstein S, Manber R, Thase ME. The 16-Item Quick Inventory of Depressive Symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biological Psychiatry. 2003 Sep 1;54(5): 573–83. pmid:12946886
  49. 49. Field A. Discovering statistics using IBM SPSS statistics. Sage; 2013.
  50. 50. Ho DE, Imai K, King G, Stuart EA. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis. 2007 Jan 31;15(3): 199–236.
  51. 51. Ho DE, Imai K, King G, Stuart EA. MatchIt: nonparametric preprocessing for parametric causal inference. Journal of Statistical Software. 2011 Jun 14;42(8):1–28.
  52. 52. Diamond A, Sekhon JS. Genetic matching for estimating causal effects: A general multivariate matching method for achieving balance in observational studies. Review of Economics and Statistics. 2013 Jul 1;95(3): 932–45.
  53. 53. Stuart EA. Matching methods for causal inference: A review and a look forward. Statistical Science. 2010 Feb 1;25(1):1. pmid:20871802
  54. 54. LaLonde RJ. Evaluating the econometric evaluations of training programs with experimental data. The American Economic Review. 1986 Sep 1: 604–20.
  55. 55. Dehejia R. Practical propensity score matching: a reply to Smith and Todd. Journal of Econometrics. 2005 Mar 1;125(1–2): 355–64.
  56. 56. Jeffreys H. Theory of probability. Oxford: Oxford University Press; 1961.
  57. 57. Szmalec A, Brysbaert M, Duyck W. Working memory and (second) language processing. In: Memory, language, and bilingualism: Theoretical and applied approaches 2012 (pp. 74–94). Cambridge University Press.
  58. 58. Linck JA, Osthus P, Koeth JT, Bunting MF. Working memory and second language comprehension and production: A meta-analysis. Psychonomic Bulletin & Review. 2014 Aug 1;21(4): 861–83. pmid:24366687
  59. 59. Linck J. A., & Weiss D. J. (2015). Can working memory and inhibitory control predict second language learning in the classroom?. SAGE Open, 5(4), 2158244015607352.
  60. 60. Paap KR. The bilingual advantage debate: Quantity and quality of the evidence. In: Schwieter JW (Ed.), The Handbook of the Neuroscience of Multilingualism. Wiley-Blackwell, in press.
  61. 61. Chein JM, Schneider W. The brain’s learning and control architecture. Current Directions in Psychological Science. 2012 Apr;21(2): 78–84.
  62. 62. Calvo N, Ibáñez A, García AM. The impact of bilingualism on working memory: a null effect on the whole may not be so on the parts. Frontiers in Psychology. 2016 Feb 25;7:265. pmid:26941704
  63. 63. Green DW, Abutalebi J. Language control in bilinguals: The adaptive control hypothesis. Journal of Cognitive Psychology. 2013 Aug 1;25(5): 515–30. pmid:25077013