Speaking, or language production, is a fundamental aspect of communication that involves several processes: activating semantic information, selecting the correct lexical information from the mental lexicon, retrieving phonological information, phonetic encoding, and articulation (Burke & Shafto, 2008; Levelt, 1999; Levelt, Roelofs, & Meyer, 1999; Martin, 2003; Schwartz, Dell, Martin, Gahl, & Sobel, 2006). Although these are conceptually distinct processes, many word production models suggest that these processes are highly interactive (e.g., Dell, 1986). Moreover, language production almost always involves general cognitive abilities such as planning, paying attention, response selection, and monitoring output. However, the interaction between language production and executive components and their neural bases remain unclear. The goal of the current study was to examine these issues using functional magnetic resonance imaging (fMRI) with a phonological go/no-go picture-naming paradigm.

The go/no-go paradigm has been widely used as a way to measure aspects of cognitive control such as response inhibition or conflict monitoring. In this task, participants are asked to respond to a particular stimulus or type of stimulus (go trials) while withholding their responses to other types of stimuli (no-go trials). Researchers have identified a right lateralized frontoparietal network that is sensitive to these inhibition and conflict monitoring processes (no-go trials; Bellgrove, Hester, & Garavan, 2004; de Zubicaray, Andrew, Zelaya, Williams, & Dumanoir, 2000; Niendam et al., 2012; Nieuwenhuis, Yeung, Van Den Wildenberg, & Ridderinkhof, 2003; Simmonds, Pekar, & Mostofsky, 2008). This network includes the lateral prefrontal cortex including superior, middle, and inferior frontal gyri, which support aspects of cognitive control; the anterior cingulate cortex, which supports performance monitoring; the bilateral parietal regions, which guide attention; and the presupplementary motor area, supporting response inhibition (Botvinick, Braver, Barch, Carter, & Cohen, 2001; Botvinick, Cohen, & Carter, 2004; MacDonald, Cohen, Stenger, & Carter, 2000; Ridderinkhof, Van Den Wildenberg, Segalowitz, & Carter, 2004). Additionally, the dorsal anterior cingulate and presupplementary motor cortices have a role in heteromodal processing in which information from unimodal regions (e.g., somatosensory regions, motor regions) is integrated (Mayer et al., 2012; Ridderinkhof, Ullsperger, Crone, & Nieuwenhuis, 2004; Ridderinkhof, Van Den Wildenberg, et al., 2004). Task difficulty can be manipulated by altering the proportion of go or no-go trials to create a response bias. For example, a run that contains a majority of go trials (e.g., 75%) generates a prepotent response tendency that would need to be inhibited during no-go trials. Functional MRI studies have localized these response-inhibition effects to supplementary motor cortices extending to the anterior cingulate cortex, along with the right dorsal lateral prefrontal cortex (e.g., Criaud & Boulinguez, 2013; Nieuwenhuis et al., 2003). The go/no-go paradigm has also been incorporated within the language domain to study the temporal properties of syntactic, semantic, and phonological processes in speech production, using event-related potential (ERP) indices such as the N200 or lateralized readiness potential (LRP; Rodriguez-Fornells, Schmitt, Kutas, & Münte, 2002; Schmitt, Rodriguez-Fornells, Kutas, & Münte, 2001; Van Turennout, Hagoort, & Brown, 1997). Additionally, Rodriguez-Fornells et al. (2005) used a cross-language go/no-go paradigm in bilinguals to explore the interference of phonological information from the nontarget language using functional MRI. They found that cognitive control regions such as left middle frontal gyrus were crucial in inhibiting the production of the nontarget language.

In contrast to cognitive control manipulations, language tasks typically engage a left-lateralized frontal-temporal-parietal network (Geranmayeh et al., 2012; Geranmayeh, Wise, Mehta, & Leech, 2014; Price, 2010). Specifically, semantic processing and conceptually driven lexical selection involve regions supporting perceptual or feature-based aspects (e.g., ventral and lateral temporal cortices) as well as regions that support semantic integration (e.g., anterior temporal lobe, angular gyrus; Binder, Desai, Graves, & Conant, 2009; Mirman et al., 2015; Pobric, Jefferies, & Ralph, 2007; Price, 2010; Visser, Jefferies, & Ralph, 2010). Lexical processing is supported by the posterior middle temporal gyrus (Hickok & Poeppel, 2007; Indefrey & Levelt, 2004), as well as regions that are involved in lexical selection, such as the left inferior frontal gyri (Hirshorn & Thompson-Schill, 2006; Price, 2010). Neural structures supporting phonological encoding and retrieval include left posterior superior and middle temporal cortices, as well as left supramarginal and posterior aspects of inferior frontal gyri (Indefrey & Levelt, 2000; Poldrack et al., 2001; Poldrack et al., 1999). Finally, articulatory processes involve the left anterior insula, and several motor regions, including the bilateral premotor, motor, and presupplementary motor regions (Brown et al., 2009; Chang, Kenney, Loucks, Poletto, & Ludlow, 2009; Price, 2010), as well as subcortical regions such as the left putamen and caudate (Chang et al., 2009; Price, 2010; Seghier & Price, 2009). Although distinct regions have been attributed to specific processes, the brain functions as a network. For example, Hickok and Poeppel (2007) and others (Friederici & Alter, 2004) have proposed two pathways for language: a ventral pathway, supporting early stages of processing and semantics, and a dorsal pathway, involved in language production and syntactic processing.

These previous lines of research on cognitive control and language production largely reflect isolated investigations into one or the other domain. However, language production often involves cognitive control.Footnote 1 For instance, many behavioral and neuroimaging studies have demonstrated that language production involves conflict monitoring and control (Freund, Gordon, & Nozari, 2016; Gauvin, De Baene, Brass, & Hartsuiker, 2016; Hanley, Cortis, Budd, & Nozari, 2016; Nozari, Dell, & Schwartz, 2011; Nozari & Novick, 2017; Riès, Janssen, Dufau, Alario, & Burle, 2011; Shitova, Roelofs, Schriefers, Bastiaansen, & Schoffelen, 2017; Van Maanen & Van Rijn, 2010), engaging language-related regions such as the left middle temporal cortex (de Zubicaray, McMahon, & Howard, 2015; de Zubicaray, Wilson, McMahon, & Muthiah, 2001) and domain-general regions such as the presupplementary motor area, anterior cingulate, and ventrolateral prefrontal cortex (Nozari & Novick, 2017). Language production also involves inhibitory control (Shao, Meyer, & Roelofs, 2013; Shao, Roelofs, Acheson, & Meyer, 2014; Shao, Roelofs, & Meyer, 2012; Tydgat, Diependaele, Hartsuiker, & Pickering, 2012; Tydgat, Stevens, Hartsuiker, & Pickering, 2011) and other types of control processes (Nozari, Freund, Breining, Rapp, & Gordon, 2016). Additionally, other neuroimaging studies have reported that the dorsal anterior cingulate cortex and regions in the lateral prefrontal cortex, such as the inferior frontal gyrus, assist in aspects of cognitive control required during language production (Fedorenko, Duncan, & Kanwisher, 2012; Piai, Riès, & Swick, 2016; Piai et al., 2013). Language and general cognitive control networks have also been found to overlap in other brain regions, such as the left parietal lobe (Geranmayeh et al., 2014). Moreover, studies on individuals with aphasia have reported that domain-general cognitive control regions, such as the dorsal anterior cingulate cortex, presupplementary motor area, and superior frontal gyrus, contribute significantly to patients’ recovery from aphasia (Brownsett et al., 2013; Geranmayeh, Chau, Wise, Leech, & Hampshire, 2017). Furthermore, studies have found that manipulating executive demands modulated brain activation associated with language production. For instance, increasing lexical selection demands engaged the left inferior frontal gyrus (Kan & Thompson-Schill, 2004; Novick, Trueswell, & Thompson-Schill, 2010; Schnur et al., 2009) and dorsal anterior cingulate cortex (Piai et al., 2013). The resolution of lexical-semantic conflict involved the left middle temporal cortex (de Zubicaray et al., 2015; de Zubicaray et al., 2001). Other studies suggest that phonological competition involves the left posterior superior temporal gyrus, extending to the supramarginal and precentral gyri (e.g., Peramunage, Blumstein, Myers, Goldrick, & Baese-Berk, 2011).

In the present study, we wanted to explore the role of cognitive control in phonological aspects of language production, and their neural bases. To investigate this, we examined how varying cognitive control demands for response inhibition and conflict monitoring affected language production by incorporating the go/no-go paradigm with a commonly used language production task: picture naming. Phonological characteristics of the target photograph’s name were used to make go (overt naming) or no-go (withhold naming) decisions. In both trial types, participants were required to process phonological aspects of the photographs to arrive at correct decisions. This task is similar to the phoneme-monitoring task in which participants monitor specific phonemes across words to explore the incremental nature of phonological encoding (Jansma & Schiller, 2004; Sasisekaran & Luc, 2006; Schiller, Jansma, Peters, & Levelt, 2006; Wheeldon & Levelt, 1995). Therefore, we anticipated that this task involved a higher level of monitoring processes compared with a typical picture-naming task. We manipulated the proportion of go and no-go trials to alter the difficulty in picture naming and response inhibition. We expected that such increases in naming difficulty would elicit more activation in language regions that are sensitive to task demands, such as the left inferior frontal gyrus, posterior superior temporal sulcus, supramarginal gyrus, and precentral gyrus (Novick et al., 2010; Peramunage et al., 2011; Roskies, Fiez, Balota, Raichle, & Petersen, 2001; Schnur et al., 2009). As naming difficulty increased, we also expected to observe greater involvement of domain-general control regions, such as the anterior cingulate cortex and lateral prefrontal cortex. Similarly, the difficulty in withholding responses to no-go trials should also change with the number of no-go trials in a run. For the no-go trials, we hypothesized that inhibition failures and activation in the anterior cingulate and right inferior and middle frontal gyri would change as a function of cognitive control demands, considering the sensitivity of these regions to task manipulations (Criaud & Boulinguez, 2013; Nieuwenhuis et al., 2003). Moreover, conjunction analyses across trial types can identify brain regions that reflect general task difficulty whether the picture names were overly produced or not. Based on the previous literature, we expected that the lateral prefrontal cortex, including superior and middle frontal gyri, and the anterior cingulate cortex would be involved in such domain-general roles.

We were also interested in the relationship between behavioral performance and brain activation in response to task difficulty. Increased task difficulty has typically been associated with increased brain activation, across a broad range of cognitive tasks (Banich, 1998; Barch et al., 1997; Dove, Pollmann, Schubert, Wiggins, & von Cramon, 2000; Rypma & D’Esposito, 1999; Rypma, Prabhakaran, Desmond, Glover, & Gabrieli, 1999; E. E. Smith & Jonides, 1999). However, linking such changes in functional activation to behavior is critical for interpreting activation changes. If increases in activation are associated with declines in behavioral performance, these increases could potentially be interpreted as less efficient processing. On the other hand, if increases in activation are associated with equivalent or improved performance, such increases may serve a beneficial or compensatory role. Moreover, by examining the patterns of such brain–behavior activations across trial types, we can identify whether such relationships are specific to language production or extend more broadly to cognitive control in general.

Method

Participants

Twenty individuals (10 females; mean age = 22.65 years, age range: 18–34 years, SD = 4.30) participated in this study. All participants had normal or correct-to-normal vision and reported no known psychiatric or neurological illnesses. They were all native English speakers with little knowledge of another language. Before the MRI session, all participants underwent psychometric testing to assess language and basic cognitive skills such as speed, executive functions, and working memory (see Table 1). All participants gave written, informed consent, and all procedures were approved by the Institutional Review Board at the Pennsylvania State University.

Table 1 Participant demographic and neuropsychological testing information

Stimuli and procedure

Participants completed a phonological go/no-go picture-naming task in the MRI scanner. Photographs were presented, and participants were instructed to overtly name the photograph as quickly and accurately as possible. Task difficulty was manipulated via the proportion of trials that needed to be named or inhibited across three conditions: all go, go bias, and no-go bias (see Fig. 1 for an illustration of the task). In the all go condition, participants were required to name all of the photographs (100% go trials). In the go bias condition, participants were instructed to name the photograph only if the name of the photograph started with a consonant (75% go trials; e.g., nose) but withhold their response if the name started with a vowel (25% no-go trials; e.g., ear). In the no-go bias condition, the go cue was reversed. Participants were instructed to name the photograph only if the name started with a vowel (25% go trials; e.g., apple) but withhold their response if the name started with a consonant (75% no-go trials; e.g., chair). Before scanning, participants practiced overt picture naming while minimizing head movement in a simulation scanner. In the scanner, participants always performed the all go runs first, prior to being informed about the go/no-go manipulation to minimize the possible influence of go/no-go task demands. After the all go condition, participants underwent a practice run and then completed the other two conditions, whose order was counterbalanced across participants. Stimuli were not repeated across practice runs or conditions.

Fig. 1
figure 1

Task design. Overview of the phonological go/no-go picture-naming task. Examples of go and no-go trials for each of the three conditions: all go, go bias, and no-go bias. Correct names to the two no-go trials (withhold naming, in green) are ant and cat, respectively. Naming difficulty (go trials, shown in red) increased from the all go condition to the go bias condition to the no-go bias condition. Cognitive control demands in terms of conflict monitoring and response inhibition (no-go trials) increased from the no-go bias condition to the go bias condition. (Color figure online)

Photographs were taken from two normed databases (Brodeur, Guérard, & Bouras, 2014; Moreno-Martínez & Montoro, 2012) and depicted a broad range of common objects. Additionally, an independent group of 21 healthy, native English-speaking adults was recruited to norm the selected stimuli. Items were included if naming consistency was 67% or higher. A total of 330 colored photographs, 110 unique items per condition (all go, go bias, no-go bias) were selected for the MRI experiment. For the go bias and no-go bias conditions, these trials were further divided into 82 trials (75%) of the biased trial type (e.g., go trials in the go bias runs) and 28 trials (25%) of the nonbiased trial type (e.g., no-go trials in the go bias runs). The linguistic characteristics of the object names were obtained from the English Lexicon Project (ELP; Balota et al., 2007). Stimuli across the three conditions were similar in terms of word length, F(2, 327) = .75, p = .47; word frequency, F(2, 327) = .17, p = .84; number of phonemes, F(2, 327) = .30, p = .74; number of syllables, F(2, 327) = .33, p = .72; and reaction time (RT) and accuracy in the ELP lexical-decision task, F1(2, 327) = .72, p1 = .49; F2(2, 327) = 1.43, p2 = .24; and the ELP word-naming task, F1(2, 327) = .44, p1 = .65; F2(2, 327) = .69, p2 = .50. See Supplemental Table 1 for additional details.

For each trial, a color photograph of an object on a white background was presented, and the participant was instructed to respond with its name or withhold a response based on the specific condition requirements. Participants were asked to use one word to name the photograph and not to say anything else. Photographs (duration = 2 s) were presented with a variable interstimulus interval (ISI; range: 1–12 s, average ISI = 3.40 s) to optimize the hemodynamic response (Optseq2; Dale, 1999). Participants completed six runs, two runs per condition in the scanner. Overt verbal responses in the scanner were recorded and filtered using a dual-channel, MR-compatible, fiber optic microphone system (Optoacoustics Ltd., Or-Yehuda, Israel). After the scan, an additional debriefing task was included in which participants were asked to name all the photographs from the go bias and no-go bias conditions, to verify their identification and naming of the photographs, and to confirm that they knew the name of the objects when a response was withheld (e.g., in the no-go trials).

Acquisition of MRI data

MRI data were collected on a 3T Siemens Prisma Fit MRI scanner with a 20-channel head coil. We collected a sagittal T1 weighted localizer image to define a volume for data collection and higher order shimming. The anterior and posterior commissures were identified for slice selection and shimming. T1 weighted structural images were then collected using a magnetization-prepared rapid acquisition gradient echo (MP-RAGE) sequence (repetition time [TR] = 2,300 ms; echo time [TE] = 2.28 ms; inversion time [TI] = 900 ms; flip angle = 8°; echo spacing = 7 ms; acceleration factor = 2; field of view [FOV] = 256 mm2; voxel size = 1 × 1 × 1 mm; 160 contiguous slices).

Functional images sensitive to blood-oxygen-level-dependent (BOLD) contrast were collected using an echo-planar imaging (EPI) sequence (TR = 2,500 ms; TE = 25 ms; flip angle = 90°; echo spacing = 0.49 ms; FOV = 240 mm2; voxel size = 3 × 3 × 3 mm; 41 contiguous axial slices, parallel to the AC–PC line, interleaved acquisition). Two additional volumes were acquired and deleted at the beginning of each functional run to reach steady state equilibrium. Each of six functional runs was approximately 122 volumes (309 s) in length.

Behavioral data analyses

Responses were coded based on both the recordings from the scanner session and the debriefings after the scanner session. In the all go condition, responses were marked as correct if the participant provided the exact target name (e.g., chicken for “chicken”) or its plural form (e.g., chickens for “chicken”), or an acceptable alternative word that corresponded to the photograph (e.g., hen for “chicken”). Responses were marked as incorrect if the response did not match the photograph (e.g., ice for “tea”), or if no response was provided.

For go trials in the other conditions, responses were similarly coded; however, acceptable alternative words that matched the photograph also had to have the same onset category (vowel/consonant) as the target word (e.g., raven for “crow” but not spaceman for “astronaut”). For no-go trials in both the go bias condition and the no-go bias condition, they were marked as correct if no response was provided in the scanner and the response in the debriefing task indicated that they knew the name of the object in the photograph.

An error analysis was also conducted across three types of errors: incorrect responses as defined above; commission errors (failures to inhibit a response during a no-go trial when the response matched the photograph and its onset category); and omission errors (no response was provided in the scanner for a go trial). Errors were treated as categorical variables and analyzed separately using generalized logistic mixed-effect modeling, employing the glmer function in the lme4 package (Bates, Mächler, Bolker, & Walker, 2014) in the R environment (Venables & Smith, 2006). For each trial type, errors of that type were coded as ones and other trials were coded as zeros. We obtained p values for regression coefficients using the lmerTest package (Kuznetsova, Brockhoff, & Christensen, 2017). This approach has the advantage of taking into account individual data points, allowing intercepts and slopes to be random across participants and allowing different items to have random intercepts. As recommended by Barr, Levy, Scheepers, and Tily (2013), linear mixed-effect models generalize best when they include the maximal random effects structure justified by the design. Therefore, for each type of error, a full model including the task condition as the independent variable and the error as the dependent variable was run while allowing different participants to have random intercepts and slopes and different items to have random intercepts. Additionally, for commission errors, a model adding the error rates in the AY condition in the AX-CPT (an index of inhibition ability) as the participant level variable was also conducted to further validate our manipulation.

Error rates were also calculated for each type of error. The overall error rate (overall ER) was calculated by dividing the total number of errors by the total number of trials in that condition. The incorrect response rate in each condition was calculated by dividing the number of incorrect responses by the total number of trials. Note that because commission errors could only occur on no-go trials, and the number of no-go trials varied across conditions, the commission error rate (commission ER) in each condition was calculated by dividing the number of commission errors by the number of no-go trials in that condition. Similarly, because omission errors could only be observed on go trials, and the number of go trials in each condition varied, the omission error rate (omission ER) in each condition was calculated by dividing the number of omission errors by the number of go trials in that condition.

Reaction times (RTs) to go trials were calculated using customized PRAAT scripts. The PRAAT scripts identified response onsets by searching the recordings for pitch deviations from the pitch of the filtered auditory signal. These onsets were then manually verified by using both the audio and visual speech stream. The reaction times were calculated as the difference between the photograph onsets (from E-Prime output) and the response onsets. Only trials with correct responses and reaction times within 2.5 standard deviations were included in further analyses. Reaction times were also analyzed using mixed-effect regression modeling, employing the lmer function in the lme4 package (Bates et al., 2014) in the R environment (Venables & Smith, 2006). Task condition was included as the categorical independent variable, and reaction time was included as the continuous dependent variable, while allowing different participants to have random intercepts and slopes and different items to have random intercepts.

Functional MRI data analyses

A quality assurance tool was used to analyze data quality, including the number of potentially clipped voxels, mean signal fluctuation to noise ratio (SFNR), and per-slice variation (Glover et al., 2012). Additionally, both the structural and the functional images were visually inspected for artifacts and blurring. The skull and other coverings were stripped from the structural brain images using Optimized Brain Extraction for Pathological Brains (optiBET; Lutkenhoff et al., 2014). We used FSL (Version 5.0.9), with FEAT (fMRI expert analysis tool) Version 6.0 (S. M. Smith et al., 2004; Woolrich, Behrens, Beckmann, Jenkinson, & Smith, 2004) to carry out preprocessing and statistical analyses. Preprocessing steps included motion correction (FSL MCFLIRT), B0 unwarping with field mapping, slice timing correction, spatial smoothing (FWHM = 5 mm), high-pass filtering, coregistration (first to the participants’ own brain-extracted structural image, then to the MNI space using the MNI 152 T1 2-mm template), and normalization. We used a double-gamma hemodynamic response function to model the BOLD signal for each event, and only correct trials were included in the analyses. We conducted first-level analyses on each participant’s individual runs. To remove the residual effects of motion that remained after motion correction, standard motion parameters were included in the first-level models as confound EVs (Explanatory Variables). We then combined analyses from the previous step across runs and participants in group-level analyses using FMRIB’s local analysis of mixed effects (FLAME 1+2; Beckmann, Jenkinson, & Smith, 2003; Woolrich et al., 2004). Within FSL, we first identified regions that were responsive to our basic trial types (go trials in all go runs, go trials in the biased runs, no-go trials in the biased runs) compared with the implicit baseline using each participant’s individual runs as inputs. We then compared functional activation differences across task conditions for both go trials (all go < go bias < no-go bias) and no-go trials (no-go bias < go bias). All significant activations were determined using a two-step process in which Z (Gaussianized T/F) statistical images were initially thresholded at the voxel level (p < .01). Clusters of identified voxels were then corrected for multiple comparisons (p < .05, corrected) based on Gaussian random field theory (Worsley, 2001) in which each cluster’s estimated significance level was compared with the cluster probability threshold, and then only clusters whose estimated significance exceeded the threshold were included in the results (Hayasaka & Nichols, 2003). Additionally, results from comparisons between conditions were masked to ensure that significant differences were based on significant positive hemodynamic responses (e.g., go trials: all go < go bias < no-go bias, masked with significant activation of go trials in the no-go bias condition).

To assess brain–behavior relations, we conducted three correlational analyses on go trials and no-go trials separately. First, we correlated participants’ overall reaction times and brain activation to all of the go trials in a whole-brain analysis (all go, go bias, no-go bias). This approach allowed us to examine the relationship between overall speed and functional activation across individuals. We hypothesized that brain regions that showed a positive correlation (i.e., longer reaction times associated with more activation on go trials) may indicate less efficient processing across individuals, while regions that showed a negative correlation (i.e., shorter reaction times associated with more activation on go trials) would reflect a potentially beneficial recruitment. Second, we correlated reaction time and brain activation for go trials as a function of naming difficulty in a second whole-brain analysis. That is, we correlated changes in RT from the easier condition (all go) to the harder condition (no-go bias) with changes in brain activation between these two conditions. Positive correlations (i.e., larger reaction time increases associated with larger increases in activation on go trials) would indicate larger individual change score differences and functional activation as a function of naming difficulty (e.g., reflecting reduced processing efficiency when difficulty increased across individuals), while negative correlations (i.e., smaller reaction time increases associated with larger increases in activation on go trials) would suggest a potential beneficial function of increases in activation across individuals. Third, we calculated correlations between reaction time and activation to go trials within each condition to compare the strength of these correlations and to observe if the correlation strength changed as a function of task difficulty (all go < go bias < no-go bias). We hypothesized that the strength of both positive and negative brain–behavior correlations to go trials may increase as task difficulty increased.

Three similar correlation analyses were also conducted on commission error rate and brain activation for no-go trials, again using whole-brain analyses. All reported brain regions were identified according to the Harvard-Oxford Structural Atlas (Desikan et al., 2006). Reported coordinates are in MNI space, and results are overlaid on a representative brain in MNI space.

Additionally, to estimate the potential effect of power differences due to the unbalanced number of critical trials across runs, we randomlyFootnote 2 selected an equal number of critical trials from all conditions for both go and no-go trials and statistically compared the results from this analysis to the main analysis that included all correct trials (e.g., we selected an equal number of go trials from the all go condition and the go bias condition to match the number of go trials in the no-go bias condition, and selected an equal number of no-go trials from the no-go bias condition to match the number of no-go trials in the go bias condition).

Results

Behavioral results

Error rates

A mixed-logistic regression was conducted on the number of overall errors to explore the differences across the three conditions (all go, go bias, no-go bias). Results showed that the overall error rates were not significantly different across the three conditions (all go vs. go bias: β = −.47, SE = .35, p = .18; all go vs. no-go bias: β = −.55, SE = .32, p = .09; go bias vs. no-go bias: β = −.08, SE = .35, p = .82). See Table 2 for behavioral data.

Table 2 Behavioral results (number of errors, reaction times, and error rates)

We also conducted a mixed-logistic regression on the number of commission errors between the go bias and no-go bias conditions. The results showed that the number of commission errors was significantly higher in the go bias condition than in the no-go bias condition (β = −3.24, SE = .95, p < .001), suggesting that cognitive control demands in terms of conflict monitoring and response inhibition were higher during the go bias condition, consistent with the intended effect of our response-bias manipulation. Additionally, the number of commission errors in the go bias condition was also significantly positively predicted by the error rates on the AY condition of the AX-CPT, a measure of inhibition (Braver et al., 2007), suggesting that these two measures of inhibition were consistent across individuals (β = 2.49, SE = 1.09, p = .02). Moreover, the mixed-logistic regression on omission errors showed that the go bias condition elicited fewer omission errors compared with the no-go bias condition (β = 1.77, SE = .52, p < .001), confirming that our manipulation biased participants toward not naming during the no-go bias condition.

Reaction times

A generalized linear mixed-effect model was conducted on go trial RTs to explore the differences across the three conditions (all go, go bias, no-go bias). Results showed that the RTs in the no-go bias condition were significantly longer than the all go condition (β = .21, SE = .04, p < .001) and the go bias condition (β = .17, SE = .03, p < .001), while the RTs in the all go condition and the go bias condition were not significantly different from each other (β = .05, SE = .03, p = .13). These results confirmed the gradual increase of reaction times as the number of go trials decreased and suggest that the no-go bias condition was the most difficult and had the highest language production demands.

Neuroimaging results

Head motion

Because participants produced overt speech in the scanner, we conducted a repeated-measures ANOVA on the average amount of head motion across three conditions (all go: 0.19 mm, go bias: 0.19 mm, no-go bias: 0.24 mm). First, all estimates of motion were less than the standard inclusion criteria for task-based fMRI studies (i.e., < ½ voxel, < 1.5 mm). Moreover, the main effect of condition was only marginally significant, F(2, 38) = 3.6, p = .06, and trending in the opposite direction than one would anticipate. Further pairwise comparisons did not show any significant differences between any of the conditions (ps > .1). This suggests that the potential effects of motion were comparable across conditions.

Activation to basic trial types

The go trials in the all go condition elicited activation in established regions that are involved in language processing, including the bilateral inferior frontal gyri, bilateral superior temporal gyri, and other motor and visual regions such as the right cerebellum and bilateral lateral occipital cortex (see Supplemental Table 2 and Fig. 2a). Go trials in the bias conditions (collapsing across go bias and no-go bias) elicited activation in similar regions,Footnote 3 but patterns of activation were more extensive and also included bilateral frontal pole (see Supplemental Table 2 and Fig. 2b). The no-go trials elicited activation in established regions that are involved in cognitive control, such as the bilateral frontal cortices (e.g., frontal pole, middle and inferior frontal gyri), and in visual regions such as the bilateral lateral occipital cortex (see Supplemental Table 2 and Fig. 2c). We were specifically interested in the brain responses to go and no-go trials as a function of task difficulty. Therefore, analyses comparing activation differences across different conditions on the two trial types are reported below.

Fig. 2
figure 2

FMRI activation to basic trial types. Overview of the regions in which there was significant activation in (a) go trials in the all go condition, (b) go trials in the go bias and no-go bias conditions, and (b) no-go trials in the go bias and no-go bias conditions. Slices are depicted in increments of 10 mm, starting at z = −5 and ending at z = 45. (Color figure online)

Go trials comparison

To explore how the brain responded to changes in language production difficulty, a parametric linear trend analysis was conducted for go trials across the three conditions (all go < go bias < no-go bias). Results revealed that go trials in the no-go bias condition elicited greater activation than go trials in other conditions in the bilateral frontal pole, right superior and inferior frontal gyri (pars triangularis, pars opercularis), bilateral orbitofrontal cortex, bilateral insula, bilateral temporal pole, right posterior superior and middle temporal gyri, left anterior cingulate, bilateral posterior cingulate gyrus, bilateral supramarginal and bilateral precuneus (see Table 3 and Fig. 3a).

Table 3 FMRI activation as a function of task difficulty for go trials and no-no trials
Fig. 3
figure 3

FMRI activation as a function of task difficulty. Shown is an overview of regions that comprise (a) increased naming difficulty (go trials): all go < go bias < no-go bias, and (b) increased cognitive control demands in terms of response inhibition (no-go trials): no-go bias < go bias. (Color figure online)

No-go trials comparison

To explore how the brain responded to changes in cognitive control aspects of language production (i.e., no-go trials), a direct comparison was made between no-go trials in the go bias condition and no-go trials in the no-go bias condition. No-go trials in the go bias condition elicited greater activation than no-go trials in the no-go bias condition in the right frontal pole; right superior frontal gyrus; bilateral orbitalfrontal cortex, which extended to bilateral insula and inferior frontal gyri; bilateral cingulate cortex, left middle temporal gyrus, and left posterior supramarginal gyrus (see Table 3 and Fig. 3b). Additionally, a conjunction analysis was conducted to identify the overlap between these go trial and no-go trial comparisons to identify regions that reflected general difficulty across both language production and response inhibition. Regions that were responsive to both types of task difficulty included the right superior and inferior frontal gyri, right frontal orbital cortex, left supramarginal gyrus, and left insula.

Brain–behavior correlations

Because we were also interested in how these patterns of brain activation related to behavioral performance in response to task difficulty, we conducted correlational analyses between behavioral performance and brain activation on go trials and no-go trials separately, specifically examining effects of task difficulty.

Go trials: Brain activation–reaction time

First, correlation analyses were conducted between overall reaction time and brain activation across participants for all of the go trials in three conditions (all go, go bias, no-go bias). Across all of the go trials, positive correlations between RT and activation were found in the bilateral orbital frontal cortex, extending subcortically, and the left occipital pole, extending to left lingual gyrus and the occipital portion of fusiform gyrus. These findings suggest that individuals with slower response times had more activation in these regions. On the other hand, we also found negative correlations in which individuals with faster RTs had more activation in the right precentral and postcentral gyri, extending anteriorly to the right superior frontal gyrus and posteriorly to the right supramarginal gyrus, and in right precuneus, extending to the bilateral lingual gyrus. These results suggest that individuals who had greater recruitment of sensorimotor, frontal control regions, and some higher order visual processing regions had better naming performance (see Table 4 and Fig. 4).

Table 4 Correlation between reaction time and fMRI activation across all of the go trials (no-go bias, go bias, and all go)
Fig. 4
figure 4

Correlation between reaction time and fMRI activation across all go trials (no-go bias, go bias, and all go). Regions in which longer reaction times were associated with increases in activation are shown in red, and regions in which faster reaction times were associated with increases in activation are shown in blue. Slices are depicted in increments of 10 mm, starting at z = −5 and ending at z = 65. (Color figure online)

To examine the relationship between RT and fMRI activation in response to naming difficulty (all go < go bias < no-go bias), we further conducted correlational analyses comparing changes in RT and brain activation from the all go condition to the no-go bias condition. Positive correlations (larger increases in reaction times associated with larger increases in activation) were found in the right frontal pole and left middle frontal gyrus. This result suggests that those individuals who were slowed the most across conditions also had larger increases in activation as a function of increased task difficulty in these regions, suggesting less efficient neural recruitment. On the other hand, these patterns could also reflect participants’ attempts to adapt to the changes in task difficulty. Negative correlations (smaller increases in reaction time correlated with larger increases in activation as a function of naming difficulty) were found in the left precentral and postcentral gyri (extending to the left supplementary motor cortex and left superior frontal gyrus), bilateral lingual gyrus, and cerebellum. Consistent with the results of the negative correlations across all of the go trials, these negative correlation results suggest that as production difficulty increased, increased recruitment of sensorimotor and frontal regions was associated with better naming performance across individuals (see Supplemental Table 3 and Supplemental Fig. 2).

Additionally, we were interested in whether the strength of these brain–behavioral correlations changed as a function of increased naming difficulty (all go < go bias < no-go bias). However, no differences in the strength of correlations across the different conditions were found.

No-go trials: Brain activation–commission error rate

Similar to the analyses with the go trials, we conducted three types of brain–behavior correlation analyses on no-go trial performance and activation. The commission error rate was used as the index of behavioral performance for no-go trials because it represents the ability to inhibit responses correctly. First, correlation analyses were conducted between commission error rate and brain activation across all of the no-go trials in both conditions (collapsing across go bias and no-go bias). No significant positive correlation was found. Negative correlations (i.e., lower commission error rates associated with increased activation to no-go trials across individuals) were found in the right anterior cingulate, which extended into the right precentral, postcentral, and supplementary motor cortices, and in the bilateral cerebellum. These results indicate that increases in activation in sensorimotor and cognitive control regions was associated with participants who were more successful in response inhibition (see Table 5 and Fig. 5).Footnote 4

Table 5 Correlation between commission error rate and fMRI activation across all of the no-go trials (go bias and no-go bias)
Fig. 5
figure 5

Correlations between commission error rate and fMRI activation across all no-go trials (go bias and no-go bias). Regions in which lower commission error rates were associated with increases in activation are shown in blue. Axial view slices are depicted in increments of 10, starting at z = −5 and ending at z = 65. Sagittal view slides are depicted in increments of 10, starting at x = 6 and ending at x = 36. No significant positive correlations were found. (Color figure online)

A second set of commission ER–fMRI activation correlation analyses were conducted on no-go trials to examine potential correlations between changes in commission error rate and changes in fMRI activation from the no-go bias condition to the go bias condition (no-go bias < go bias). However, no significant positive or negative brain–behavior correlations were found.

Lastly, we also explored whether the strength of correlations between commission error rate and fMRI activation differed between the two bias conditions. As there were no significant positive correlations, we limited our analysis to the negative correlations. The negative brain–commission ER correlation was stronger in the go bias condition than in the no-go bias condition in the left postcentral gyrus, extending to the left posterior superior temporal gyrus and left planum temporale (see Supplemental Table 4).

Power analysis

To rule out the potential effect of power differences due to the unbalanced number of critical trials across runs, we randomly selected an equal number of critical trials from all conditions and compared fMRI activation between an analysis including all critical trials and one including equal numbers of trials across conditions. For both go trials and no-go trials, there were no significant differences between these analyses. Furthermore, we compared two similar sets of analyses of brain activation as a function of increased task difficulty for both go trials and no-go trials (e.g., go trials: all go < go bias < no-go bias; no-go trials: no-go bias < go bias). There were no significant differences between the analysis including all critical trials and the one including equal numbers of trials across conditions (see Supplemental Fig. 1). These results suggest that differences in the number of trials per condition were not driving any differences that we observed.

Discussion

The primary goal of this experiment was to investigate the influence of cognitive control (in terms of response inhibition and conflict monitoring) on the phonological aspects of language production by using a phonological go/no-go picture-naming task. We manipulated task difficulty for both naming pictures and inhibiting responses by varying the proportions of go trials (overt naming) and no-go trials (withhold naming). Previously, researchers have used language go/no-go paradigms to study the temporal and spatial properties of speech production (e.g., Rodriguez-Fornells et al., 2002; Rodriguez-Fornells et al., 2005; Schmitt et al., 2001; Van Turennout et al., 1997). However, here we used this paradigm to test the interaction between language production and cognitive control. We predicted that as language-production difficulty increased, increased activation would be observed in not only language related regions—such as the left inferior frontal gyrus, posterior superior temporal sulcus, supramarginal gyrus, and precentral gyrus—but also in regions related to cognitive control, such as the anterior cingulate cortex and lateral and superior prefrontal cortex. Moreover, we examined whether the engagement of such regions was specific to language production or whether it was related to domain-general cognitive control. We were also interested in the relationships between behavioral performance and fMRI activation, the effect of task difficulty on these relations, and whether these brain–behavior relations were specific to language production, cognitive control processes such as conflict monitoring and response inhibition, or both.

In general, we predicted that increases in difficulty in both language production and response inhibition would elicit increases in brain activity and declines in behavioral performance, as suggested by previous studies (de Zubicaray et al., 2015; de Zubicaray et al., 2001; Kan & Thompson-Schill, 2004; Novick et al., 2010; Piai et al., 2013; Schnur et al., 2009). As predicted, behavioral results showed that go trial reaction times in the no-go bias condition, where naming should be the most difficult, were significantly longer compared with go trials in the other conditions. We also found that increased difficulty in naming pictures (go trials: all go < go bias < no-go bias) was associated with more activation in the right prefrontal regions (such as the frontal pole, superior and inferior frontal gyri, and orbitofrontal cortex), anterior and posterior cingulate gyri, right posterior superior and middle temporal gyri, and bilateral supramarginal gyri (see Table 3 and Fig. 3a for more details). These results are consistent with previous studies focusing on brain mechanisms of cognitive control and language production (Botvinick et al., 2004; Geranmayeh et al., 2012; Geranmayeh et al., 2014; Indefrey & Levelt, 2004; MacDonald et al., 2000).

The prefrontal cortex and cingulate have well-established roles in cognitive control and conflict monitoring in general (Botvinick et al., 2004; MacDonald et al., 2000), and also in language control during production (Abutalebi et al., 2008; Abutalebi & Green, 2008; Fedorenko & Thompson-Schill, 2014; Kan & Thompson-Schill, 2004; Piai et al., 2016; Riès et al., 2011; Schnur et al., 2009). Within these regions, the role of the right inferior frontal gyrus in inhibition, particularly as measured by the go/no-go paradigm, has also been well documented (Aron, Robbins, & Poldrack, 2004). Although these right prefrontal regions are not part of the core language-production network, their activation suggests that regions related to general cognitive control were recruited to a greater extent in response to increased naming difficulty.

In addition to cognitive control regions, increased naming difficulty for go trials also elicited activation in both the typical language network (e.g., left insula, temporal pole, supramarginal gyrus) and homologous right hemisphere regions. The left insula and supramarginal gyrus have been associated with language-production processes and phonological processing (Hickok & Poeppel, 2007; Indefrey & Levelt, 2000, 2004; Poldrack et al., 2001; Poldrack et al., 1999), whereas the temporal cortex has long been associated with semantic processes. Interestingly the increases in homologous right hemisphere activation in response to increased naming difficulty have also been previously observed by others (e.g., Bench et al., 1993; de Zubicaray et al., 2001). Taken together, the changes we observed in language-production regions and general-control regions indicated that the control process in language production involved both domain-specific and domain-general resources.

The phonological go/no-go picture-naming paradigm also allowed us to examine behavioral and brain responses to changes in conflict monitoring and response inhibition via the no-go trials. Response inhibition demands were the highest in the condition with the fewest no-go trials (go bias condition), and consistent with this, participants exhibited a more commission errors in this condition compared with the condition with lower response inhibition demands (no-go bias condition). Moreover, the average number of errors in the go bias condition was significantly positively correlated with the error rate on a separate behavioral assessment of inhibitory control (i.e., the AY condition of the AX-CPT; Braver et al., 2007), again, consistent with the idea that commission errors reflects inhibition and conflict monitoring demands. In general, no-go trials engaged bilateral frontal regions that have been associated with cognitive control, such as the bilateral frontal pole and middle and inferior frontal gyri (Simmonds et al., 2008), and the bilateral lateral occipital cortex that is associated with processing visual objects (Grill-Spector, Kourtzi, & Kanwisher, 2001; Grill-Spector et al., 1999). This pattern of results suggests that response inhibition demands may modulate the recruitment of basic sensory processing regions. Moreover, we also found significant differences in no-go trial activation as a function of the changing response inhibition demands. Increases in response inhibition demands (i.e., no-go trials: no-go bias condition < go bias condition) were associated with more extensive fMRI activation in the bilateral orbitofrontal cortex, which extended to the bilateral superior and inferior frontal gyri, bilateral anterior cingulate cortex, and left posterior supramarginal gyrus, which is consistent with previous studies focusing on brain networks of cognitive control (Botvinick et al., 2001; Botvinick et al., 2004; MacDonald et al., 2000; Nieuwenhuis et al., 2003; Simmonds et al., 2008). Specifically, in the go/no-go paradigm, the role of anterior cingulate cortex in conflict monitoring has been well established (Botvinick et al., 2001; Botvinick et al., 2004; MacDonald et al., 2000). Additionally, the anterior cingulate is one of several domain-general control regions that is involved in speech production, not only in healthy participants (Piai et al., 2013) but also in aphasia recovery (Brownsett et al., 2013; Geranmayeh et al., 2017). In the current task, the anterior cingulate cortex could be involved in monitoring the conflict between the response bias to name pictures and the trial-level demand to not name the picture. This conflict was stronger in the go bias condition compared with the no-go bias condition, and correspondingly the anterior cingulate cortex showed more activation to no-go trials during the go bias condition. The greater recruitment of the left supramarginal gyrus, which has an established role in phonological processing (Indefrey & Levelt, 2000; Poldrack et al., 2001; Poldrack et al., 1999), to no-go trials in the go bias condition, may reflect a greater focus on phonological aspects of target names, particularly when naming must be inhibited (i.e., during no-go trials). Furthermore, as reflected in the conjunction analysis, the right superior and inferior frontal gyri and left supramarginal gyrus were sensitive to increases in task difficulty during both go and no-go trials, suggesting that recruitment of these regions may reflect task difficulty in general, whether or not the item was overtly produced.

To better understand the nature of these fMRI activation patterns, we also examined the relationships between behavioral performance and fMRI activation on both go trials and no-go trials. Across all of the go trials, individuals with longer reaction times showed increases in activation in the bilateral orbitofrontal cortex, right frontal pole, left middle frontal gyrus, and left occipital regions (see Table 4 and Supplemental Table 3). This suggests that participants who named pictures the slowest engaged additional cognitive control and visual regions outside of core language regions. On the other hand, individuals who had better naming performance on naming trials (i.e., faster RTs) had increased activation in sensorimotor regions such as the bilateral precentral and postcentral gyri, as well as the right superior frontal gyrus, right supramarginal gyrus, and bilateral precuneus. The association between efficient behavioral performance and increased activation, across individuals, in the context of increasing naming difficulty, suggests a potential compensatory function of these sensorimotor and frontal regions in language production.

Similarly, across all the no-go trials, individuals who were more successful inhibitors (i.e., lower commission error rates) had more activation in core cognitive control regions, such as the right anterior cingulate gyrus (Botvinick et al., 2004; MacDonald et al., 2000) and sensorimotor regions such as the right precentral and postcentral gyri and supplementary motor cortex. Additionally, we found that this negative brain–behavior correlation was stronger when response inhibition demands increased (from no-go bias condition to the go bias condition). These effects were found in the left postcentral gyrus, extending to the left posterior superior temporal gyrus and left planum temporale, suggesting that individuals who engaged these regions to a greater extent were more successful at behavioral response inhibition. It is noteworthy that these sensory and motor regions, such as the right precentral gyrus, were associated with not only successful naming but also successful inhibition of naming, as suggested in the conjunction analysis. This result suggests that similar regions may contribute to successful language production performance regardless of whether an item is overtly produced.

Although our results highlight the interplay of cognitive control and language production processes, there are a few caveats to our results. First, although the phonological go/no-go paradigm allowed us to examine this interplay of processes, cognitive control demands were likely much higher in this context compared with naturalistic language production. Moreover, a necessity of the go/no-go paradigm is that the number of critical trials across different conditions must be unbalanced to manipulate response biases, which inevitably leads to differences in power across conditions. However, the conditions with fewer critical trials (e.g., go trials in the no-go bias condition, or no-go trials in the go bias condition) always elicited more extensive activation than conditions with more critical trials (e.g., no-go trials in the no-go bias condition, or go trials in the go bias condition). This suggests that any power differences may result in an underestimation of potential effects. Additionally, to assess for the potential influence of these effects, we randomly selected equal numbers of critical trials from all conditions. A comparison of brain activation from our main analysis and this subset analysis revealed that there were no statistically significant differences between these analyses (see Supplemental Fig. 1).

In conclusion, our results provide new behavioral and neural evidence on the effect of task difficulty on executive aspects of language production. Overall, as task demands increased for both go trials and no-go trials, behavioral performance declined (i.e., longer reaction times to go trials, more commission errors to no-go trials) and brain activation increased. The increased activation involved not only language-specific but also domain-general neural resources. Brain regions such as the right superior and inferior frontal gyri and left supramarginal gyrus were recruited during all trials and may reflect general effects of task difficulty, as well as aspects of phonological processing that are required during both overt language production and the inhibition of language production. Critically, the increased activation in sensorimotor regions, such as the right precentral gyrus and frontal regions such as the right superior frontal gyrus, potentially helped participants maintain their behavioral performance, not only during language production but also as response inhibition demands increased. Overall, these results suggest that cognitive control demands affect language production, and that successfully coping with increases in task difficulty relies on both language specific and domain-general cognitive control regions.