Introduction

Research on emotion regulation (ER)—the processes influencing which, when, and how emotions are experienced and expressed—has established that the ability to cope adaptively with emotionally challenging situations is vital for both physical and mental health (Gross, 2008, 2015). The process model of ER, proposed by Gross (1998), distinguishes four families of ER strategies based on the emotion generation stage they are primarily affecting. Situational strategies, which seek to change emotion generation at the situation stage, by selecting which situations are encountered or by modifying the situation; Attentional Deployment strategies, which seek to change emotion generation by changing the aspects to which individuals attend to in a situation; Cognitive strategies, which involve changing emotion generation at the appraisal stage, by modifying the way individuals look at/construe the situation; finally, Response Modulation strategies, which seek to change emotion at the response stage, by modifying aspects of the interoceptive or exteroceptive world through overt or covert behavior.

Although significant progress has been made in elucidating the neurobehavioral mechanisms of ER over the past decade (e.g., reviewed in Dolcos et al., 2017; Ochsner, Silvers, & Buhle, 2012; Sheppes, Suri, & Gross, 2015), prior neuroimaging studies have most commonly focused on examining cognitive strategies, especially cognitive reappraisal, which involves attempts to change the meaning of stimuli or situations, and a response modulation strategy, such as expressive suppression, which involves attempts to decrease emotionally expressive behaviorFootnote 1 (Gross, 2008). Recent investigations have begun to reveal the efficacy of the attentional deployment family of ER strategies, typically involving shifts in attention away from emotional aspects of the stimulus, or away from the stimulus altogether, in order to alter emotional responses (Gross, 2008; Sheppes et al., 2014). In particular, one type of attentional deployment strategies, called focused attention (FA), has been proven quite effective in reducing the negative impact of emotional memory encoding (Dolcos et al., 2020b) and retrieval (Denkova, Dolcos, & Dolcos, 2015; Iordan, Dolcos, & Dolcos, 2019). Despite its promise as an effective ER strategy, however, it remains unclear how engaging FA influences the immediate subjective experience of emotions, and how the engagement of this particular ER strategy is linked to changes in activity in regions typically associated with emotion processing and regulation. We investigated the neural mechanisms associated with the impact of self-guided FA on subjective emotional experience using functional magnetic resonance imaging (fMRI) in a sample of healthy adults. Clarifying the neurobehavioral mechanisms of self-guided FA can improve our understanding of alterations in affective disorders, which are typically characterized by excessive attention to negative emotional stimuli and debilitating rumination on negative memories.

Attentional Deployment as an Emotion Regulation Strategy

Available evidence confirms the efficacy of attentional deployment strategies while also identifying their unique advantages over different types of ER strategies. Recent studies have demonstrated the effectiveness of specific types of attentional deployment strategies in regulating emotions (Wadlinger & Isaacowitz, 2011; Webb, Miles, & Sheeran, 2012), both in promoting adaptive emotional responses to negative stimuli presented in laboratory settings (Ferri, Schmidt, Hajcak, & Canli, 2013; Urry, 2010) and in improving the efficacy of longer-term fear reduction interventions (Johnstone & Page, 2004; Oliver & Page, 2003, 2008). Several fMRI studies also have begun to reveal the neural mechanisms associated with the impact of engaging such strategies (Dolcos et al., 2020b; Dorfel et al., 2014; Ferri et al., 2013; Kanske, Heissler, Schonfelder, Bongers, & Wessa, 2011; McRae et al., 2010; for recent meta-analysis, see Morawetz et al., 2017). Notably, attentional deployment strategies such as FA may be considered more efficient than other strategies such as cognitive reappraisal in controlling emotional responses, because their fast deployment enables intervention during earlier stages of the emotion-generative sequence (Paul, Simon, Kniesche, Kathmann, & Endrass, 2013; Thiruchselvam, Blechert, Sheppes, Rydstrom, & Gross, 2011). In fact, current evidence suggests that attentional deployment is effective in modulating emotional responses even when it is engaged substantially after the onset of emotion-eliciting stimuli (Sheppes & Meiran, 2007). This makes attentional deployment particularly useful in real-life situations in which individuals may unexpectedly encounter highly emotional stimuli. In addition, attentional deployment may also be a useful tool for those who have limited cognitive resources. When given a choice to use different ER strategies in down-regulating emotional responses to negative stimuli, those participants exhibiting lower executive control abilities were more likely to choose an attentional deployment strategy over reappraisal (Scheibe, Sheppes, & Staudinger, 2015).

Among the ER strategies typically classified as being part of the attentional deployment family, FA may be especially well suited for prompt regulation of emotional responses in the natural environment. As an attention-based ER strategy, previous studies have demonstrated the efficacy of cognitive distraction, which involves the engagement of cognitive tasks to distract from and mitigate the impact of negative emotions (Kanske et al., 2011; McRae et al., 2010; Nolen-Hoeksema, Wisco, & Lyubomirsky, 2008; Sheppes, Scheibe, Suri, & Gross, 2011). For instance, in the face of an anxiety-inducing stimulus (e.g., roller-coaster ride), one might choose to close his eyes and think about the time they felt happy and remember all the details associated with the event, to distract themselves from emotions triggered by the imminent rush of adrenaline. However, it is important to note that using cognitive distraction as an ER strategy typically requires access to a sufficiently demanding secondary task, which may not be readily available in everyday life. FA, involving attentional shifts between emotional and non-emotional aspects within the experienced context, may therefore be a more practical means to regulate emotions in a variety of real-world situations.

Self-guided Focused Attention as an Emotion Regulation Strategy

Current evidence regarding the impact of FA on subjective emotional experience and its associated neural mechanisms is still scarce (Ferri et al., 2013; Urry, 2010). The few studies examining the neurobehavioral mechanisms of FA instructed participants to process emotional scenes by artificially introducing effects that do not exist in real life, thus reducing the ecological validity of the manipulations. For instance, in one study participants were asked to view emotional images under an arousing or a non-arousing focus, the latter of which was achieved by prompting them to look at and pay attention to non-arousing areas marked by a circle superimposed on each image (Ferri et al., 2013). In another study, participants were prompted to look at the targeted (e.g., nonemotional) component of each image within which all other (e.g., emotional) areas were blurred (Urry, 2010). Both of these manipulations resulted in reliable changes in subjective emotional experience by directing participants to look at certain highlighted aspects of the stimulus after its onset (i.e., externally-guided FA), but again these approaches have limited applicability beyond the laboratory setting. Altogether, converging evidence suggests that attentional deployment strategies, including cognitive distraction and externally-guided FA, may be highly useful, but a more ecologically valid manipulation of FA may be to instruct participants to guide their attention to focus on different aspects of an emotion-eliciting stimulus in a self-guided manner (Dolcos et al., 2020a).

To our knowledge, no published study has investigated the neural mechanisms associated with the effect of self-guided FA on subjective emotional experience associated with the viewing of emotion-eliciting stimuli. However, the aforementioned investigations of externally-guided FA (Ferri et al., 2013; Urry, 2010) and of cognitive distraction (Kanske et al., 2011; McRae et al., 2010), along with previous studies from our group examining self-guided FA in autobiographical recollection (Denkova, Dolcos, & Dolcos, 2013a, 2013b; Denkova et al., 2015; Iordan et al., 2019), all demonstrated that the engagement of these attention-based ER strategies is associated with modulation of activity in brain regions typically involved in top-down control and bottom-up emotion processing. Specifically, externally guided focus on non-arousing aspects of emotional images (Ferri et al., 2013) was associated with increased activity in regions including the dorsolateral prefrontal cortex (dlPFC) and the lateral parietal cortex (LPC), typically implicated in top-down executive control, coupled with decreased activity in regions including the amygdala (AMY) and visual association areas, typically activated during basic emotion processing. In addition, the engagement of self-guided FA during autobiographical recollection showed that instructing participants to focus away from emotional aspects and on other nonemotional contextual aspects (e.g., when/where the event happened, who else was involved) was also associated with increased activity in regions associated with ER and regions involved in processing contextual details (parahippocampal place area [PPA]), coupled with decreased activity in the AMY (Denkova et al., 2015; Iordan et al., 2019).

Overall, while differences can be noted, these findings are consistent with available evidence concerning the neural correlates of other ER strategies, such as cognitive reappraisal and emotional suppression (Dorfel et al., 2014; Kohn et al., 2014), suggesting that various ER strategies are in part subserved by a common network of brain regions, with top-down control regions likely modulating activity in bottom-up emotional and perceptual regions. Notably, these opposing patterns of neural activity in regions associated with top-down vs. bottom-up processing are overall consistent with the involvement of large-scale neural systems associated with executive control (e.g., dlPFC, LPC, as part of a dorsal executive system) vs. emotion processing (e.g., AMY, as part of a ventral affective system), linked to the processing of and coping with emotional distraction (Anticevic, Repovs, & Barch, 2010; Dolcos et al., 2006; Dolcos & McCarthy, 2006; for reviews, see Dolcos, Iordan, & Dolcos, 2011; Iordan, Dolcos, & Dolcos, 2013). That is, reduced distraction is associated with incresed activity in executive regions and reduced activity in basic emotion processing regions. It should be noted that the dorso-ventral dissociation linked to top-down vs. bottom-up processing is not absolute, as some of the ventral PFC regions, both lateral (ventrolateral PFC, vlPFCFootnote 2) and medial (ventromedial PFC, vmPFC), also are involved in top-down processing of external (Iordan et al., 2013) and internal emotional stimuli (Denkova et al., 2015; Iordan et al., 2019).

The Present Study

Despite the potential advantages of self-guided FA over other ER strategies, it remains unclear how engaging this strategy influences the subjective experience of emotions associated with processing of external emotional stimuli, and how the engagement of this particular ER strategy is linked to changes in brain activity in regions typically associated with emotion processing and regulation. Clarification of this issue is important in further understanding the utility of self-guided FA as an ER strategy and the associated neural mechanisms with increased specificity. To fill this important gap in the literature, the present study investigated whether the subjective emotional experience associated with viewing external emotional stimuli (i.e., images) is mitigated by engaging self-guided FA. The associated neural mechanisms were investigated by recording fMRI data during an emotional rating task, during which participants were shown composite negative and neutral images with distinguishable foreground (FG) and background (BG) areas and were instructed to focus either on the FG or BG content of the images.

Based on the available evidence reviewed above, we formulated the following hypotheses regarding the impact of self-guided FA on the subjective experience of emotions and the associated neural mechanisms. First, we expected that focusing on the contextual aspects and away from the emotional aspects of negative images would reduce the subjective emotional experience associated with the viewing of these images. Second, at the neural level, we expected that focusing on the contextual aspects of negative images would be associated with increased activity in fronto-parietal regions (dlPFC, lateral parietal cortex [LPC]), implicated in top-down attentional/executive control, and decreased activity in the AMY, implicated in bottom-up emotion processing. Third, aside from this dissociation between emotion control and emotion processing regions, we also tested the hypothesis that focusing on the contextual aspects of negative images would be associated with differential activity in category-selective higher-order visual processing areas, such as the PPA, which has been identified in studies of self-guided FA with internal stimuli (Denkova et al., 2015; Iordan et al., 2019).

Methods

Participants

A total of 24 adult females participated in this study (Mage = 34.0, SDage = 4.76). Only females were recruited because they were part of a larger study investigating mother-child dyads. All participants were healthy, right-handed, native English speakers (88% white, 8% hispanic, 4% black), with no recent history of psychiatric or neurological conditions. Data from one participant were excluded from the behavioral analysis due to software malfunction during data collection, and data from three additional participants also were excluded from the eye-tracking, behavioral, and fMRI analyses due to outlier responses in their emotional ratings (based on a z-standardization criterion of >2.5) (Seo, 2006). Hence, the behavioral analyses were performed on data from 20 participants (Mage = 33.85, SDage = 4.82), and the eye-tracking and fMRI analyses were performed on data from 21 participants (Mage = 34.24, SDage = 4.90). All participants provided written informed consent under a protocol approved by the Institutional Review Board at the University of Illinois. Each participant was financially compensated for the time spent in the study ($15/hour for time spent in the MRI scanner and $10/hour for time spent outside of the scanner), over two sessions lasting approximately 2 hours each. During the first session, participants performed an emotional rating task in the MRI scanner, and the second session involved behavioral assessments, which also included a surprise memory test (see below).

Emotional Rating Task with FA Manipulation

Functional MRI and eye-movement data were recorded while participants viewed and rated a total of 90 composite images (60 negative and 30 neutral). Each composite image was created by overlaying a negative or neutral FG component upon a visually complex BG component. The FG components were extracted from images part of the International Affective Picture System (IAPS; Lang, Bradley, & Cuthbert, 2008), the Geneva Affective Picture Database (GAPED; Dan-Glauser & Scherer, 2011), the Military Affective Picture System (MAPS; Goodman, Katz, & Dretsch, 2016), the Nencki Affective Picture System (NAPS; Marchewka, Zurawski, Jednorog, & Grabowska, 2014), and the Emotional Picture Set (EmoPicS; Wessa et al., 2010). Images from these sources, along with domain-free online image databases, were used as BG components. Negative and neutral composite images were matched for human presence, animacy, FG location (i.e., top, bottom, left, right), complexity, brightness, and contrast (all p values > 0.05). To obtain data regarding the basic affective properties of these images, we conducted a validation study using an independent sample of participants (N = 16; 10 females; Mage = 19.30, SDage = 1.30), who were asked to passively view and rate these images in terms of perceived valence, arousal, and visual complexity on 9-point Likert scales. Based on the data obtained from this validation sample, we made sure that our negative and neutral images were significantly different both in terms of valence and arousal. Specifically, negative images were reliably more negatively valenced (MValence = 2.46, SDValence = 0.79) and more arousing (MArousal = 4.95, SDArousal = 1.05) than the neutral images (MValence = 4.79, SDValence = 0.48; MArousal = 2.17, SDArousal = 0.46) (ps < 0.001). We also made sure that the images assigned to the FG and BG conditions did not differ in these properties (all ps > 0.05).

The pool of 90 images was divided into sets of images that were randomly assigned to five study runs, counterbalanced across image type and attentional cue categories. The run orders were randomly assigned to the participants. To avoid negative mood induction, the order of trials was counterbalanced within each run, such that no more than three images of the same emotional category or cue type were presented consecutively. Each image was presented for 4 s and then was removed to minimize possible confounding effects of eye movements associated with prolonged scanning of images (Figure 1). Approximately 4 days later, participants returned for a surprise memory test. Analyses of the data acquired during this follow-up session have been reported elsewhere (Dolcos et al., 2020b).

Figure 1.
figure 1

Diagram of the Emotional Rating Task with FA Manipulation. Participants were asked to view each image as cued by a preceding instruction screen (FG Focus or BG Focus). After viewing each image, participants rated their emotional experience on a 5-point scale (1 = “Not Negative at All”; 5 = “Very Negative”).

Participants were asked to view each image under different attentional manipulation conditions, cued by the preceding instruction screen. The cues, presented for 0.5 s, directed them to focus either on the image foreground (FG Focus), which was emotional or neutral, or on the image background (BG Focus), which was always neutral. Following the cue, each negative and neutral image was presented for 4 s. Half of both negative and neutral images were preceded by FG Focus cues and the other half by BG Focus cues, and the focus cues were counterbalanced across participants. Following image presentation, participants were asked to rate their subjective emotional experience triggered by the images on a 5-point scale (1 = Not Negative at All, 5 = Very Negative). All responses were made on a response pad attached to the participant’s right hand. The trial ended with an inter-trial interval of 9.5 s, allowing the hemodynamic response to return to baseline.

Eye-Tracking Data Acquisition

To assess participants’ gaze patterns during the emotional rating task, eye positions and movements were recorded from each participant’s right eye using an MR-compatible model (with a long-range mount) of the Eyelink1000 system (SR Research, ON, Canada), at a sampling rate of 1,000 Hz. A pseudorandom 9-point calibration was performed at the beginning of the experimental session and after every other experimental block. The size of the LCD monitor was 23.25 (height) by 41.33 (width) inches, which corresponded to approximately 8.94 (height) and 15.89 (width) degrees of visual angle at the participant’s viewing distance (to the mirror) of approximately 3 inches.

fMRI Data Acquisition

Scanning was conducted on a 3 T Siemens MAGNETOM scanner with a 64-channel head coil, at the Beckman Institute’s Biomedical Imaging Center. After the sagittal localizer and the 3D MPRAGE anatomical images (TR = 2000 ms; TE = 2.25 ms; flip angle = 8°; FOV = 230 × 230 mm2, matrix size = 256 × 256 mm2; slice thickness = 1 mm; volume size = 172 slices; voxel size = 1 mm isotropic), 5 blocks/runs of whole-brain EPI functional images were acquired axially with a simultaneous multi-slice (SMS) sequence (TR = 1,500 ms, TE = 30 ms; flip angle = 40°; FOV = 230 × 230 mm2; matrix size = 144 × 144 mm2; slice thickness = 1.6 mm; volume size = 76 slices; multi-band acceleration factor = 4, voxel size = 1.6 mm isotropic; phase encoding direction from anterior to posterior).

Manipulation Check and Behavioral Data Analyses

To confirm that participants oriented their gaze as instructed, we calculated the proportion of time spent on fixations within the FG and BG areas for each trial using the EyeLink Data Viewer. These data were compared across Image Type (Negative vs. Neutral) and FA Manipulation (FG Focus vs. BG Focus) conditions. Similarly, to assess the behavioral impact of the FA manipulation on the subjective experience of emotion, repeated-measures analyses of variance (ANOVAs) and paired t-tests were performed to compare emotional ratings across Image Type (Negative vs. Neutral) and FA Manipulation (FG Focus vs. BG Focus) conditions. Behavioral and eye-tracking data were analyzed using the SPSS software (IBM Corp. 2017. Version 25.0); the effect sizes for the paired t-tests (Cohen’s drm) were calculated using the equations detailed by Lakens (2013).

fMRI Data Analyses

Preprocessing of the fMRI data was performed using SPM12 (Wellcome Department of Cognitive Neurology, London, UK). Functional images were first corrected for acquisition order and realigned to correct for motion artifacts. Next, the high-resolution anatomical image was co-registered with the first functional image for each participant, and functional images were spatially normalized (resampled to 2-mm isotropic voxels) to the Montreal Neurological Institute (MNI152) template. Lastly, the functional images were spatially smoothed using a 6-mm Gaussian kernel full-width-at-half-maximum (FWHM), to increase the signal-to-noise ratio. For subject-level analyses, evoked hemodynamic responses during the image presentation period in each trial were modeled by convolution with a canonical hemodynamic response function. These data were then analyzed in a general linear model framework using the attentional manipulation and the image type condition, as well as the six motion parameters as regressors.

For group-level analyses, targeted voxel-wise random-effects analyses were conducted to identify brain regions showing differential brain activation linked to the image type and attentional manipulation conditions. First, for replication purposes, given the present manipulation involving processing of emotional and neutral stimuli, brain regions involved in processing emotional visual stimuli were identified by contrasting the condition that produced the strongest emotional responses with the neutral condition (Emotional FG Focus > Neutral). Then, to investigate the effects of FA manipulation, and paralleling the behavioral comparisons, further targeted contrasts were performed as follows. The brain regions showing reduced response as a result of engaging FA as an ER strategy were identified by the contrasts between the Emotional FG Focus minus Emotional BG Focus (EmoFG > EmoBG). To make sure that this contrast identifies brain regions showing a reduction in activity linked to the FA manipulation in regions that are also engaged by emotion processing, the EmoFG > EmoBG contrast was masked with the functional map identified by the contrast mentioned above for specifying brain regions involved in processing emotional visual stimuli (EmoFG > Neu). Finally, brain regions showing increased response as a result of engaging FA to focus away from the negative content of images were identified by the reverse contrast (EmoBG > EmoFG). An intensity threshold of p < 0.005 and an extent threshold of 46 contiguous voxels were used for the group-level targeted contrasts (e.g., EmoFG > EmoBG, EmoBG > EmoFG). These correspond to a threshold corrected for multiple comparisons at p < 0.05, as determined by a Monte Carlo Simulation (1000 iterations; Slotnick, 2017; Slotnick, Moo, Segal, & Hart, 2003). In the tables, we also denote results that survive a threshold cluster-corrected for false discovery rate at p < 0.05 (Benjamini & Hochberg, 1995; Genovese, Lazar, & Nichols, 2002). Additionally, an exploratory extent threshold of 30 contiguous voxels was employed, and notations were provided in the tables to indicate such exceptions (Allard & Kensinger, 2014). Finally, in-house custom MATLAB scripts developed at Duke University and adapted for the purpose of the present investigation also were used to extract the time course of fMRI responses, as previously employed in published work (Denkova et al., 2010; Dolcos, Diaz-Granados, Wang, & McCarthy, 2008; Dolcos et al., 2013; Dolcos & McCarthy, 2006; Iordan & Dolcos, 2017; Iordan et al., 2019; Morey et al., 2009).

Results

Manipulation Check: Eye-tracking Results

Supporting successful engagement of FA, eye-tracking data showed that participants directed their gaze toward the target components of both emotional and neutral images. Specifically, participants spent more time gazing within the FG areas when instructed to do so (FG Focus) than when instructed to focus on the BG areas (BG Focus), for both emotional (t(20) = 8.33, p < 0.001, drm = 2.60) and neutral (t(20) = 10.01, p < 0.001, drm = 3.51) images. Similarly, participants spent more time gazing within the BG areas when instructed to do so (BG Focus) than when instructed to focus on the FG areas (FG Focus), for both emotional (t(20) = 9.57, p < 0.001, drm = 3.25) and neutral (t(20) = 10.10, p < 0.001, drm = 3.74) images. Analyses of eye-tracking data also showed that participants were less able to focus on the neutral background when the corresponding foreground was negative. This effect was revealed by a repeated-measures ANOVA, with Image Type (negative, neutral) and Attentional Focus (FG focus, BG focus) as factors, showing a marginally significant interaction (F(1, 20) = 4.29; p = 0.052, ηp2 = 0.18). Post hoc t-tests showed that gaze time within the FG area was significantly greater for negative than for neutral images under the BG focus (t(20) = 2.91, p = 0.009), which may reflect increased difficulty to keep the attentional focus away from the emotional content of images (Dolcos et al., 2020a). Findings from fMRI data analyses provided further evidence that participants complied with task instructions (see below).

Behavioral Results: Reduced Subjective Emotional Experience following Self-Guided FA

Confirming our first prediction, there was an overall reduction in the emotional ratings for the BG Focus compared with FG Focus condition, and this was driven by reduced ratings for the negative images (Figure 2; Table 1). A repeated-measures ANOVA revealed a main effect of Image Type [Negative vs. Neutral; F(1, 19) = 577.98, p < 0.001, ηp2 = 0.97] and a main effect of Attentional Focus [FG Focus vs. BG Focus; F(1, 19) = 212.51, p < 0.001, ηp2 = 0.92]. There also was a significant Image Type × Attentional Focus interaction [F(1, 19) = 260.37, p < 0.001, ηp2 = 0.93], which was driven by a significant reduction in ratings under BG Focus compared with FG Focus for negative images [t(19) = −17.32, p < 0.001, drm = 4.66], but not for neutral images [t(19) = −0.04, p = 0.97, drm = −0.01].

Figure 2.
figure 2

Reduced Emotional Ratings following FA Instructions. Instructions to focus on non-emotional contextual content (BG Focus) resulted in reduced emotional ratings, and this effect was driven by the emotional pictures. FG, Foreground; BG, Background; NS, not significant. Error bars indicate the standard error of the emotional rating means for each condition. ***p < 0.001.

Table 1. Eye-tracking and behavioral results

fMRI Results: Opposing Patterns of Response in Executive, Affective, and Perceptual Brain Regions, following FA

First, for replication purposes, given the present manipulation involving processing of emotional and neutral stimuli, brain regions involved in processing emotional visual stimuli were identified by contrasting the condition associated with increased emotion processing (Emotional FG Focus) and the neutral pictures condition (EmoFG > Neu). Consistent with previous studies, this analysis identified a host of brain regions that are part of the emotion processing network, such as the AMY and the ventrolateral PFC (vlPFC), as well as brain regions involved in visual processing that are also responsive to the emotional content of stimuli, such as the temporo-occipital cortex and the fusiform gyrus (Table 2).

Table 2. Brain regions showing sensitivity to the emotional content of images

Dissociable activity between executive vs. affective processing regions

Confirming our second and third hypotheses, focusing on the contextual aspects of negative images was associated with increased activity in regions, typically implicated in top-down executive control, including the dlPFC and LPC, and with decreased activity in regions typically implicated in bottom-up emotion processing, including the AMY. Focusing on the contextual aspects was also associated with decreased activity in bilateral vlPFC (BA 45). Dissociable activity by FA also was identified in occipital regions, with superior areas showing increased activity and inferior occipital areas showing decreased activity when focusing on non-emotional contextual details of images (Figure 3; Tables 3-4).

Figure 3.
figure 3

Dissociable activity in executive vs. emotion processing regions by the engagement of self-guided FA. Focusing on contextual aspects of negative images (BG Focus) was associated with increased activity in regions including the dlPFC, LPC, and superior occipital cortex (SOC), coupled with decreased activity in regions including the vlPFC, lateral occipital cortex (LOC), and amygdala (not shown). These activation maps were identified based on the Emotional-BG > Emotional-FG and Emotional-FG > Emotional-BG contrasts, respectively (see Methods section); for illustration purposes, a voxel-wise intensity threshold of p < 0.01 (uncorrected) was used. Time courses represent changes in the average fMRI signal extracted from peak and neighboring voxels; given the similarity in response in homologous regions across the two hemispheres, the displayed fMRI signals were averaged from the left and right activation clusters. Notably, increased activity for the BG condition is equivalent to decreased activity for the FG condition. Hence, displaying the cold-colored clusters as reflecting the EmoFG < EmoBG contrast allows consistency with the neuroimaging literature which typically shows increases using warm colors and decreases via cold colors. L, Left Hemisphere; R, Right Hemisphere; FG, Foreground; BG, Background; dlPFC, Dorsolateral Prefrontal Cortex; vlPFC, Ventrolateral Prefrontal Cortex; LPC, Lateral Parietal Cortex; SOC, Superior Occipital Cortex; LOC, Lateral Occipital Cortex; PPA, Parahippocampal Place Area.

Table 3. Brain regions showing increased activity when focusing on the contextual aspects of the negative images
Table 4. Brain regions showing increased activation when focusing on the emotional aspects of the negative images

Dissociable activity in higher-order visual processing regions

In addition to the dissociation between brain regions typically involved in executive vs. emotion processing, the engagement of self-guided FA was also linked to dissociable patterns of activity in regions involved in category-selective higher-order visual processing, with increased vs. decreased activity in the parahippocampal vs. fusiform gyri, respectively, when focusing on non-emotional contextual details of images (Figure 4). This provides further evidence that our task manipulation was indeed successful, as the BGs of images mainly consisted of scenes, and the FGs mainly consisted of faces.

Figure 4.
figure 4

Dissociable activity in perceptual processing regions by the engagement of self-guided FA. Focusing on the contextual aspects of negative images was associated with increased vs. decreased activity in the parahippocampal place area (PPA) vs. fusiform face area (FFA) and temporo-occipital cortex (TOC), respectively. Additional dissociable activity was also observed in the medial posterior regions between the precuneus (PreC)/superior parietal cortex (SPC) and posterior cingulate cortex (PCC). These activation maps were identified based on the Emotional-BG > Emotional-FG and Emotional-FG > Emotional-BG contrasts, respectively (see Methods section); for illustration purposes, a voxel-wise intensity threshold of p < 0.01 (uncorrected) was used. Time courses represent changes in the average fMRI signal extracted from peak and neighboring voxels; given the similarity in response in homologous regions across the two hemispheres, the displayed fMRI signals were averaged from the left and right activation clusters. Notably, increased activity for the BG condition is equivalent to decreased activity for the FG condition. Hence, displaying the cold-colored clusters as reflecting the EmoFG < EmoBG contrast allows consistency with the neuroimaging literature which typically shows increases using warm colors and decreases via cold colors. L, Left Hemisphere; R, Right Hemisphere; FG, Foreground; BG, Background.

Discussion

The present report investigated the neural mechanisms associated with the impact of self-guided FA on the subjective emotional experience triggered by viewing unpleasant pictures. There were three main novel findings. First, focusing on contextual aspects reduced subjective experience of emotions associated with the viewing of unpleasant pictures. Second, at the neural level, this behavioral effect was associated with opposing patterns of brain activity identifying increased vs. decreased responses in regions implicated in attentional/executive control (e.g., dlPFC, LPC) vs. emotion processing (e.g., AMY, vlPFC). Third, dissociable patterns of brain activity were also identified in higher-order visual areas, showing increased vs. decreased responses in parahippocampal vs. fusiform gyri, respectively, when focusing on non-emotional contextual details of images. These findings are discussed in turn below.

Reduced Subjective Emotional Experience following Self-Guided FA

Extending available evidence regarding the effect of FA (Ferri et al., 2013; Urry, 2010), our results showed that focusing on the contextual aspects and away from the emotional aspects of negative (but not neutral) images decreased self-reported emotional experiences associated with the viewing of such stimuli. This is overall consistent with recent theoretical frameworks that highlight the importance of the situational context in the effectiveness of ER strategies (Aldao, 2013; Aldao & Tull, 2015). For example, in one study, participants’ use of adaptive ER strategies was highly variable across different situations (e.g., social vs. achievement-related situations; Aldao & Nolen-Hoeksema, 2012). Our results extend these models by demonstrating the efficacy of focusing on visual contextual details within a specific ER strategy, and its effect in reducing the associated negative emotional responses. These results complement evidence identified by our recent investigations (Denkova et al., 2015; Iordan et al., 2019) by showing that self-guided FA is effective not only in reducing negative emotional experiences associated with the recollection of unpleasant autobiographical memories, but also when employed while viewing unpleasant pictorial stimuli.

The effectiveness of attentional deployment strategies in regulating emotions has been demonstrated previously (Wadlinger & Isaacowitz, 2011; Webb et al., 2012), most commonly in the form of cognitive distraction (e.g., diverting attention to something task-irrelevant; Oliver & Page, 2008). However, some studies also identified the efficacy of externally guided FA in modulating emotional responses to pictorial stimuli (Ferri et al., 2013; Urry, 2010), but these latter studies employed experimental manipulations that involved the addition of guiding visual cues that do not exist in real life, thus reducing the ecological validity of the findings. By overcoming this limitation, and together with the findings from our studies of FA with internal emotional stimulation mentioned above, the present results show that self-guided FA can be a powerful tool in mitigating the subjective experience of negative emotions evoked by both internal and external emotional stimuli. This is important, because this ER strategy, which simply involved shifting of attentional focus within the same stimulus without forcibly directing participants where to look, does not require exhausting processing resources and thus can be easily deployed to cope with unexpected emotional challenges.

Dissociable Activity between Executive vs. Affective Processing Regions, following FA

Linked to these behavioral effects, the engagement of self-guided FA to focus on the contextual aspects of negative images was associated with increased activity in regions including the dlPFC, LPC, and SOC, coupled with decreased activity in regions including the AMY, vlPFC, and LOC. These findings are overall consistent with available evidence identifying the involvement of these regions linked to the engagement of attentional deployment ER strategies, in general (Dorfel et al., 2014; Kanske, Heissler, Schonfelder, Bongers, & Wessa, 2011; McRae et al., 2010), and in externally guided FA, in particular (Ferri et al., 2013). In addition, the observed opposing pattern of activity is also consistent with our prior work on emotional distraction, showing that reduced distraction is associated with incresed activity in executive regions and decreased activity in basic emotion processing regions; these regions are part of large-scale dorsal vs. ventral neural systems, respectively (Dolcos et al., 2006; Dolcos & McCarthy, 2006; Dolcos et al., 2011; Iordan et al., 2013; Iordan et al., 2019). Notably, as previously emphasized (Iordan and Dolcos 2017; Iordan et al., 2019), there are considerable overlaps between these larger neural systems, identified through task manipulations of emotional distraction, and the large-scale functional networks, commonly identified through examination of resting-state functional connectivity—i.e., fronto-parietal/dorsal attention/central-executive vs. salience/ventral attention/cingulo-opercular, respectively (Corbetta and Shulman, 2002; Dosenbach et al., 2007; Dosenbach et al., 2006; Power et al., 2011; Seeley et al., 2007; Yeo et al., 2011). Hence, it is not surprising that voluntarily focusing attention away from emotionally salient information is associated with increased engagement of brain regions involved in top-down attentional control and reduced response in regions involved in bottom-up processing of saliency.

Due to the ability of emotional stimuli to rapidly “capture attention” and reallocate processing resources (Anderson & Phelps, 2001; Pourtois, Schwartz, Seghier, Lazeyras, & Vuilleumier, 2006; Shafer et al., 2012), it is possible that focusing away from the emotional aspects of negative images is effortful, resulting in greater engagement of brain regions associated with strategic top-down control. Voluntarily engaging FA requires increased recruitment of regions, such as the fronto-parietal regions, typically implicated in cognitive/executive control (Seeley et al., 2007; Vincent, Kahn, Snyder, Raichle, & Buckner, 2008; Yeo et al., 2011). Although the current results do not allow for the interpretation of causality, it is possible that these regions that are important for executive control in turn modulate activity in emotion processing regions, such as the AMY and vlPFC, leading to the reduction in subjective emotional experience when viewing negative images by focusing on their context. Notably, previous studies have demonstrated functional heterogeneity in the vlPFC area (Cai et al., 2014; Gordon et al., 2016; Levy & Wagner, 2011; Power et al., 2011; Warren et al., 2014), characterizing this region as a site of cross-modal integration involved in a wide range of affective and cognitive/executive processes. Available evidence points to the vlPFC’s involvement not only in the generation and upregulation of emotion (Lindquist et al., 2012; Morawetz et al., 2017; Phan et al., 2002; Wager et al., 2008) but also in cognitive control and emotion downregulation (Badre & Wagner, 2007; Buhle et al., 2014; Iordan et al., 2013; Kober et al., 2008; Kohn et al., 2014; Ochsner et al., 2012). Therefore, another possibility is that increased vlPFC activity when focusing on the emotional aspects of the images may, in part, also reflect some form of less effortful ER, possibly to avoid negative imagery as a result of having to maintain attention on aversive material. This is consistent with available evidence showing that the vlPFC is involved in spontaneous coping with distracting emotions (Dolcos et al., 2006; Dolcos & McCarthy, 2006).

Interestingly, together with evidence from other FA studies, the present findings also point to a possible dissociation in the PFC recruitment associated with the source of emotional stimuli. Specifically, a recent investigation from our group showed that focusing away from the emotional aspects during negative autobiographical memory recollection was associated with increased activity in the vmPFC, which mediated the link between AMY activity and emotional ratings of negative memories (Denkova et al., 2015). This finding, coupled with the present results involving the dlPFC, suggests the intriguing possibility that the engagement of self-guided FA to regulate emotional responses to internal (e.g., recollected personal memories) vs. external (e.g., viewed pictures) stimuli is subserved by partially dissociable mechanisms involving medial vs. lateral PFC regions, respectively. This is consistent with available evidence identifying associations between intrinsic functional connectivity of the medial vs. lateral anterior PFC and one’s evaluations of their performance on memory vs. perceptual decision tasks, respectively (Baird, Smallwood, Gorgolewski, & Margulies, 2013), thus pointing to differential contributions of the medial and lateral PFC subdivisions to internal and external modes of processing. Future studies of self-guided FA should clarify this issue by directly contrasting brain responses to internal and external stimuli within the same sample.

Dissociable Activity in Higher-Order Visual Processing Regions, following FA

In addition to the executive vs. affective dissociation discussed above, self-guided FA also was associated with opposing patterns of activity in higher-order visual processing regions (i.e., parahippocampal and fusiform gyri). On the one hand, focusing on the contextual aspects was associated with increased activity in the PPA, which is consistent with recent evidence regarding the involvement of this region in processing spatial and non-spatial contextual information (Diana, 2017) and with FA investigations from our group (Denkova et al., 2015; Iordan et al., 2019) identifying the involvement of this region in processing contextual aspects of recollected emotional autobiographical memories. On the other hand, focusing on the emotional aspects of negative images was associated with increased activity in the fusiform gyrus, which is consistent with available evidence identifying a specialized role of this region in face processing (Kanwisher, McDermott, & Chun, 1997) as well as sensitivity to emotional facial expressions (Denkova et al., 2010; Kawasaki et al., 2012).

These findings raise an intriguing possibility that focusing on contextual vs. emotional aspects of negative pictorial material is linked to opposing modulations of category/domain-specific visual processing in these regions. However, because the FGs of the composite images often contained faces and the BGs represented scenes, the current experimental design does not allow us to determine the extent to which the observed dissociation in higher-order visual processing regions reflects top-down influences linked to the engagement of FA or differential specificity in basic perceptual processing of scenes vs. faces, respectively (Anderson, Christoff, Panitz, De Rosa, & Gabrieli, 2003; Epstein & Kanwisher, 1998). More research is needed to clarify this aspect, for instance, by examining how the degree of such dissociations in these higher-order perceptual regions relates to behavioral indices of self-guided FA. Nevertheless, this finding provides compelling evidence that participants did indeed comply with instructions to focus on the foreground or background aspects of images.

Caveats

One caveat of the present study is that the emotional content was always presented in the FG component of the negative images. To completely counterbalance the factors of interest, it would have been more ideal to also manipulate the emotional content of the BG areas. Although this may influence responses in specific brain regions (PPA vs. FFA), it does not limit the overall conclusion of the present study, given that we carefully selected and created our emotional and neutral composite stimuli, to allow a direct comparison of their effects. Other potential limitations are that the present study did not use positive emotional stimuli, and that the subject sample consisted solely of female participants, limiting the generalizability of the findings. Finally, given the relatively small sample of the current study, future studies of FA should aim for larger sample sizes, to reduce susceptibility to Type I and Type II errors (Cremers, Wager, & Yarkoni, 2017). It is worth noting that sample sizes similar to that of the present studies are not uncommon in the broader ER literature (Kanske et al., 2011; McRae et al., 2010) and have yielded robust fMRI findings in previous FA studies (Denkova et al., 2015; Iordan et al., 2019).

Conclusions

The present study identified novel neurobehavioral effects of self-guided FA, shedding light on how simply focusing away from the emotional details of pictorial stimuli toward nonemotional contextual details modulates the subjective experience of emotions. At the behavioral level, self-guided FA reduced the subjective ratings of negative images. Paralleling this behavioral effect, fMRI results showed that focusing on the contextual aspects and away from the emotional aspects of negative images was associated with increased activity in executive regions as well as decreased activity in emotion processing regions. Engaging self-guided FA was also associated with modulations of activity in the parahippocampal and fusiform gyri, possibly reflecting domain-specific visuo-attentional and perceptual processing. Overall, our present results extend current evidence regarding the impact of attentional deployment strategies on subjective emotional experiences associated with processing of external stimuli, by showing that self-guided FA reduces negative emotional experience, possibly through increased involvement of top-down executive regions that modulate activity in bottom-up emotion processing regions. These findings advance our understanding of self-guided FA as an emotion control strategy and provide insights into possible ways of modulating the impact of distressing emotional stimuli to enhance emotional well-being (Dolcos et al., 2020c). Clarification of the neurobehavioral mechanisms of self-guided FA has important implications for understanding healthy functioning and affective disorders in which excessive attention to negative stimuli and rumination on negative memories often are among the core debilitating features.