Cognitive control is the mental process that allows adaptive goal-directed behavior, and its principal function is to contain or inhibit prevalent responses to maintain focus on current goals (Inzlicht & Al-Khindi, 2012; Koechlin et al., 2003; Miller & Cohen, 2001). Convergent evidence from cognitive neuroscience points out that mental conflict generates control efforts (Inzlicht et al., 2015; Schiffer et al., 2015). Conflict monitoring theory suggests that the monitoring system is vital in analyzing the actual representations of action tendencies for potential conflicts. Thus, inhibitory mechanisms may be engaged to override the unwanted bias and promote active goal pursuit (Botvinick, 2007; Shenhav et al., 2016; Yeung, 2014).

Knowing that cognitive control begins with the appearance of conflict, it also is relevant to point out that conflict is not affectively neutral. Conflict represents an aversive event for the organism and includes negative affective states and emotional costs (Dreisbach & Fischer, 2012). Consistent with this view, Inzlicht et al. (2015) suggested that negative affect is an integral, instantiating aspect of cognitive control. Therefore, it states that cognitive control depends on emotion or its properties, such as arousal and valence.

Akin to conflict monitoring, error monitoring detects and signals errors to optimize behaviors across various tasks and situations. Error detection is necessary for adaptive behavioral adjustments (Moser et al., 2013). An organism can use it to inform behavioral strategies to achieve higher accuracy or preserve the executed task speed (Ridderinkhof et al., 2004; Zhou et al., 2019). For instance, error monitoring is correlated with the number of response alternatives in experimental tasks (Maier et al., 2010) and the difficulty of the chosen task. More complicated tasks provoke more errors (Riesel et al., 2013), and increasing the number of alternative responses in such tasks decreases the response monitoring mechanisms (Riesel et al., 2013). That is due to the cognitive overload on the different strategies, neural structures (Prefrontal Cortex, Motor Cortex, Basal Ganglia), and functions (hold and manipulate information) that are involved in error commissioning (Hoffmann & Beste, 2015).

As in conflict, the detection of committed errors is regularly accompanied by some negative emotional response (Hajcak et al., 2004; Spunt et al., 2012). Errors are unexpected and aversive endogenous events threatening the organism (Hajcak et al., 2012) and engage cognitive control to correct behavior and avoid more error commissioning. This threatening nature of errors suggests motivational salience and aversive properties (Jackson et al., 2015; Weinberg et al., 2012). Therefore, error commissioning could not be dissociated from negative affect states and the experience of aversiveness (Shackman et al., 2011). This perspective is consistent with findings linking errors to psychophysiological changes such as Heart Rate Variability (Mackersie & Calderon-Moultrie, 2016), Skin Conductance (Hajcak et al., 2003), and Electromyographic Activity (Dignath et al., 2019; Elkins-Brown et al., 2018; Hajcak & Foti, 2008).

Theoretical Frameworks of Error-Related Negativity

The Error-Related Negativity (ERN) is a component of event-related potentials (ERPs) that reaches maximum negative amplitude in frontocentral regions about 100 ms after an error has occurred in simple tasks of reaction time (Falkenstein et al., 1991; Gehring et al., 1993; Nieuwenhuis et al., 2001). The component marks the moment the brain indicates a motor error was committed, thus allowing the individual to adapt their responses and continue the task. Converging evidence from fMRI, EEG source modeling, and brain lesion research points to the dorsal portion of the anterior cingulate cortex as its site (Hajcak et al., 2012; Moser et al., 2013). The ERN has been identified as a valuable and reliable measure of partial or total detection of errors in healthy participants and individuals with various mental disorders (Maruo et al., 2016; Riesel et al., 2019; Weinberg et al., 2012).

Several cognitive and computational perspectives have attempted to explain the processing of errors and their neural signal, the ERN. According to mismatch theory, the error signal reflects a process that compares the neural representations of goal and actual responses (Bernstein et al., 1995; Coles et al., 2001; Falkenstein et al., 1991). From this perspective, the more significant the mismatch, the larger the ERN amplitude (Bernstein et al., 1995; Falkenstein et al., 1995). The reinforcement learning hypothesis (RFL) (Holroyd & Coles, 2002; Holroyd et al., 2005) suggests that errors are coded at the dopamine neurons from the basal ganglia (at the Ventral Tegmental Area) and alert the daMCC that outcomes of responses are worse than expected. Thus, the ERN is conceptualized as a reinforcement learning signal that trains the daMCC and the motor system. The role of the daMCC is to use the signal to adapt the response selection process for future better outcomes, acting as a control motor filter (Holroyd & Coles, 2002).

Another computational model, the conflict monitoring theory (Botvinick et al., 2001; Yeung et al., 2004), focuses on conflict detection rather than error detection, the ERN signals conflict. When a task requires selection among a set of responses, conflict emerges when overlapping preactivated (task representations) response sets exist. The daMCC reflects a signal of response conflict between correct and incorrect response processes. The response conflict signal (ERN) informs the Prefrontal Cortex of increasing cognitive control.

An additional, prominent model within the cognitive conceptualization of error processing is the predicted response- outcome model (Alexander & Brown, 2010). The PRO is a probabilistic model and focuses on the role of the Anterior Cingulate Cortex response–outcome based on models of reinforcement learning and their findings. The PRO model aims to explain various processes that include a more significant predicted activity for error, conflict, error likelihood, and unexpected outcomes in general (positive and negative results). Once an action is generated, the actual outcome is compared with the expected result, and any discrepancy leads to an update of the learned response–outcome predictions (Alexander & Brown, 2010).

Nonetheless, other exciting models suggest that the ERN reflects the motivational meaning or the motivational salience and aversiveness of errors (Hajcak, 2012; Hajcak & Foti, 2008; Jackson et al., 2015; Weinberg et al., 2012). The following section will account for its implications in recent literature regarding early error-detection processes.

ERN on affective states’ manipulations

When the monitoring processes evaluate error commissioning, those processes generate a signal, the ERN. Studies from the past decade have shown that this neural signal is influenced by affective and motivational factors (Hobson et al., 2014; Inzlicht & Al-Khindi, 2012; Jackson et al., 2015). In addition, errors are characterized as aversive events for the individual (Hajcak & Foti, 2008). Within this framework, the commission of errors is a distressing experience; it is perceived as threatening (Spunt et al., 2012) and engages a defensive motivational response (Hajcak, 2012; Weinberg et al., 2016). Thus, the ERN would indicate that the activation of the monitoring system is sensitive to the motivational significance and value of errors (Hajcak et al., 2005; Weinberg et al., 2016). This sensitiveness to affective and motivational context (de Bruijn et al., 2020; Riesel et al., 2012) may be a critical point in the path of understanding within and between subjects’ variations of the ERN (Levsen & Bartholow, 2018; Proudfit et al., 2013; Weinberg et al., 2012).

Two lines of research have emerged to unveil the relationships between affect and error-related negativity: manipulations with state affect and measurement of levels of trait affect. If the ERN is sensitive to state affect manipulations, its amplitudes will reflect responses to emotional context, as has been hypothesized (Inzlicht & Al-Khindi, 2012; Tullett et al., 2015; Wiswede et al., 2009; Wiswede et al., 2009). In contrast, measures of more stable levels of trait affect and individual differences would determine the ERN generation (Amodio et al., 2008; Hajcak, 2012; Riesel et al., 2019c; Weinberg & Hajcak, 2011).

Moreover, the literature extensively reported the ERN as stable and related to trait vulnerability. Trait affective measures of personality like perfectionism, neuroticism, and high in negative affect are significantly associated with increased error detection processes (Luu et al., 2000; Olvet & Hajcak, 2012; Weinberg et al., 2012; Weinberg & Hajcak, 2011). The considerable evidence of the state-independent characteristics of the ERN has even pointed it out to be considered as a reliable psychiatric endophenotype (Miller & Rockstroh, 2013; Riesel, 2019; Weinberg et al., 2012; Weinberg & Hajcak, 2011). Nonetheless, recent meta-analytic evidence (Saunders & Inzlicht, 2020) suggests that at least the relationship between trait anxiety and the ERN is smaller and more heterogenous than previously reported.

Furthermore, the connection between the ERN and affective state inductions has mixed and inconclusive data. There is growing evidence that stimuli with emotional significance can modulate the ERN (Boksem et al., 2011; Dreisbach & Fischer, 2012). Studies with experimental manipulations, such as evaluation of performance (Hajcak et al., 2005), disapproval in a social context (Boksem et al., 2011), the unpredictability of the stimuli (Jackson et al., 2015), and the administration of reward and punishment (Stürmer et al., 2011), have shown that experimental manipulations of affective states impact the ERN whether indicating the negative affect valence of errors (Aarts et al., 2013), an increasing threat value of errors (Meyer & Gawlowska, 2017; Weinberg et al., 2012), or changes in the reward prediction error (Bakic et al., 2014). In contrast, other authors have found an unaltered ERN by inducing diverse affective states (Elkins-Brown et al., 2018; Larson et al., 2013; Olvet & Hajcak, 2012).

State-trait interactions also have been regularly reported among studies. Sometimes, modulations of the ERN are not found exclusively by the experimental affective manipulations; however, when a trait is considered in the analysis, significant interactions emerge (Boksem et al., 2008; Olvet & Hajcak, 2012; Riesel et al., 2019c). Similarly, when experimental state affect inductions have significant modulations on the ERN, trait appear as a relevant moderator of the observed effect (Dikman & Allen, 2000; Maruo et al., 2016; Riesel et al., 2012). Nevertheless, additional factors, such as heterogeneity in task parameters, performance feedback, and task stimuli, seem to moderate the complex relationship between state-affect and trait-affect with the ERN (Gloe & Louis, 2021; Riesel et al., 2019a).

Insight into the role of state affect in cognitive conflict, and the ERN generation is critical to address theoretical questions and to define the nature of error processing. Therefore, this systematic review aims to synthesize the results in this area, investigating if the ERN is sensitive or can be influenced by state affect manipulations (i.e., state-dependent). We additionally explored the directions of the effects on the ERN and how the potential differences can be interpreted.

Method

This systematic review was done following the Prisma guidelines (Page et al., 2021). We conducted an automated search up to October 2021 using the PubMed, PsycINFO, and PsyArticles databases. The search string “ERN” OR “Error related negativity” OR “Ne” OR “performance monitoring” OR “error monitoring” OR “conflict monitoring” AND “affect” OR “emotion” OR “mood” OR “reward” OR “punishment” OR “affective state” OR “aversiveness” was entered in the standard search field in all the databases. Also, for these searches of electronic databases, manual searches were performed using the reference sections of published texts to find eligible articles. Studies were included if they matched inclusion criteria: 1) Experimental manipulations of a broad framework of the so-called affective states (Russell, 2003; Russell & Barrett, 1999): core Affect (valence and arousal), emotion, mood, motivation (reward and punishment), and emotion regulation; 2) measurement of the ERN; 3) Nonclinical samples; 4) Original research published in peer-review journals written in English. Two reviewers (X.N.E.) and (L.Z.) independently assessed if studies met the initial criteria. In case of discordances, a third judge was consulted, and the three judges discussed again until all three agreed. After removing duplicates, exclusion criteria were applied through seeking titles and abstracts. Reasons for exclusion of studies included: (a) theoretical and review papers, (b) studies with clinical population and substance manipulations, (c) observational studies, (d) studies with children and adolescents, and (e) studies without EEG techniques. After removing studies that met exclusion criteria, the remaining papers were fully read to identify the studies’ manipulations and the results. Finally, studies that only reported analysis by trait affect were excluded. Figure 1 describes the four phases of this review process (identifying, screening, eligibility, and inclusion). Data extraction included: author, year, design, sample characteristics, paradigm used, type of affective state manipulation, main findings (behavioral outcomes and EEG effects), and conclusions. Data were only extracted from the included studies. An analysis table was constructed for the organization of the results. The two judges filled out the table. The experimental tasks were analyzed according to the type of manipulation.

Fig. 1
figure 1

PRISMA flowchart 2020 diagram depicting article selection and screening process. Some articles met more than one exclusion criteria. *Some studies used the term ERN but were analyzing FRN

Assessment of risk of bias

Two reviewers (X.N.E) and (R.M.) independently assessed the methodological quality of eligible studies using the JBI critical appraisal checklist for quasi-experimental studies (Tufanaru et al., 2020). In case of disagreements between the two judges, a third judge was consulted for the final decision. The JBI methodological checklist consists of 9 items concerning a study’s validity and overall assessment where items are rated as: “yes,” “no,” “unclear,” “no applicable.” The overall appraisal determines if an article is included or excluded for the analysis (Tufanaru et al., 2020). Only studies with High quality and Acceptable quality were included in this review. Additionally, we decided to rate the overall methodological quality of the studies as follows: “High quality (+ +)”: Majority of criteria met, Little or no risk of bias; OR Acceptable ( +) Most criteria met. Some flaws in the study with an associated risk of bias.” Special attention was given to reporting of internal consistency of the ERN, power sample calculations, and appropriate statistical procedures.

Results

After performing searches of the databases and removing duplicates, 299 articles were obtained. Figure 1 shows the electronic search steps that led to 51 articles being included in the systematic review. The exclusions based on the title and abstract (n = 155) were usually studies with the clinical population; articles that did not use ERN; studies with fMRI; studies that did not analyze ERN and affective states, and theoretical articles or systematic reviews. The main results are presented below, according to the sample’s characteristics, paradigms and designs, behavioral effects, variables, and EEG effects. A description of tasks and designs employed in the selected studies can be found in Table 1. A summary of the key findings is displayed in Table 2.

Table 1 Description of paradigms and designs in studies included for review
Table 2 Summary of studies included in the systematic review sorted by affect state manipulation

Study and sample characteristics

The 51 investigations contained 54 experimental studies. They employed 2284 subjects, resulting in an average sample size of 42.2 participants. Sample sizes varied widely, from 16 to 121 participants. Thirty-nine studies (72.2%) had a sample with less than 50 subjects; 13 (24.1%) had a sample size between 50 and 100 participants, and only 2 (3.7%) employed more than 100 participants. Ten (19.6%) articles reported sample size calculations for their studies. Forty-one studies (75.9%) had their samples composed of undergraduate students, of which 14 (25.9% of the total) were explicitly performed with undergraduate students of Psychology courses. The other 13 studies (24.1%) were conducted with a nonacademic population or did not specify the participants’ nature.

Concerning sample age, ten studies did not report the mean age in the final sample. Among the 44 studies reported (n = 1,966), the participants’ age ranged from 18.6 to 35.8 years, resulting in an average of 21.6 years.

Regarding other characteristics, such as gender and racial group, one study did not report the proportion between men and women in its final sample. Among the 47 studies that reported (n = 2,038), 62.7% (1,222) of the participants were female, and 37.3% (n = 816) were male. Three studies had a final sample composed of 100% women, and only a few studies included ethnicity data.

Paradigms and Designs

We observed more than 10 paradigms used in the 51 papers. Moreover, the paradigms are widely used in ERN and cognitive control studies. The distribution of the number of papers by paradigms used was as follows:

Flanker task (24): Letter version, arrow version, social version, and modified version; Go/No go Task (9), Stroop Task (3): six choices, spatial; Simon task (3); Probabilistic learning Task (3), Weapons Identification Task (2), Emotional Stop Signal Task, A combination of flanker and Simon task (1), and studies used other inhibitory control tasks (8): MSIT (multisource interference task), Two-choice Speeded Task, Punished Inhibitory Control Task, Modified Learning task, Emotional Stop signal Task, AX-CPT task, Faces identification task, and Picture naming task. Twenty-one experiments used between-subjects designs, 24 used within-subjects, and 9 were between and within-subjects analyses. We found 27 experiments that used feedback to encourage better Reaction Times and were contingent on trials or performance. Ten of the studies did not use feedback in their paradigms, and 17 did not report. The main description of the paradigms, designs and use of feedback can be found in Table 1.

On the other hand, we pooled publications according to the used affect manipulations to compare methodologies and results’ studies. A summary of extracted data is displayed in Table 2. The following sections will review the results of the included studies.

Manipulation’s check

Of the 54 studies (in the 51 papers), 39 studies described at least one state manipulation check, and 37 studies reported the effectiveness of the manipulation used.

Regarding the affective induction, a variety of stimuli and procedures were used. Nine studies, sensory stimuli (tones/noises, respiratory occlusion, electrocutaneous stimulation) were applied to elicit affective states. Eight more studies used an emotional regulation strategy (emotional attendance, suppression, or Reappraisal), and six manipulated financial punishment or reward. Six studies presented variables related to mood induction procedures (achieved using music, movies, guided imagery, or text passages), and three studies used a placebo. Some others reported unsolvable math tasks, encouraging and derogatory feedback, being or not being evaluated during the task, the Trier Social Stress Test (TSST), and Socially Evaluated Cold-Pressor Test (SECPT).

The most frequent measures used to assess the participant’s affect were the State-Trait Anxiety Inventory for state anxiety (STAI) (7 studies); Subjective valence and arousal ratings (7 studies); Profile of Mood State Scale (POMS) (4 studies); Self-reported mood state (4 studies); Self-Assessment Manikin (SAM) (4 studies), the Positive and Negative Affect Schedule (PANAS) (2 studies); and Self-reporting scales for discomfort, anxiety (2). All the following measures were reported by only one study: Self-reported emotional involvement; Religious Zeal Scale; Situational Motivational Scale (SIMS); Anxiety Sensitivity Index (ASI); Intrinsic Motivation Inventory (IMI); Helplessness questionnaire; Self-reported wakefulness state; Salivary cortisol, blood pressure, and Heart rate.

Risk of bias

Of the 51 articles included, 9 was rated as high quality (+ +), with the remaining 42 rated as acceptable ( +) (Table 2.)

Behavioral effects

RTs and accuracy by trial type and behavioral outcome.

Twenty studies reported a congruency effect with slower RTs and less accuracy on incongruent or high conflict trials (Boksem et al., 2008; Cano Rodilla et al., 2016; Clayson & Larson, 2019; Clayson et al., 2012; de Bruijn et al., 2020; Elkins-Brown et al., 2016; Hajcak et al., 2005; Inzlicht & Gutsell, 2007; Larson et al., 2006, 2013; Maruo et al., 2016; Nigbur & Ullsperger, 2020; Pfabigan et al., 2013; Potts, 2011; Rodeback et al., 2020; Saunders et al., 2015; Tan et al., 2019; van Wouwe et al., 2011; Wang et al., 2014; Wiswede et al., 2009; Wiswede et al., 2009).

Regarding the behavioral outcome, 11 studies reported faster RTs on error trials (Hajcak et al., 2005; Hobson et al., 2014; Inzlicht & Al-Khindi, 2012; Meyer & Gawlowska, 2017; Olvet & Hajcak, 2012; Pfabigan et al., 2013; Riesel et al., 2012; Riesel et al., 2019b; Saunders et al., 2016; Wiswede et al., 2009; Wiswede et al., 2009), whereas 1 observed faster responses on correct trials (Compton et al., 2007). Finally, four studies in the Go/No go paradigm reported more commission than omission errors (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018; Hobson et al., 2014; Inzlicht & Al-Khindi, 2012).

RTs and accuracy by conditions

Studies with groups and conditions differ along with findings by manipulations. Slower RTs and less accuracy were observed in five studies with punishment conditions (Leue et al., 2017; Meyer & Gawlowska, 2017; Potts, 2011; Riesel et al., 2012; Riesel et al., 2019b). Nevertheless, Saunders et al. (2015) observed higher accuracy in high conflict trials after punishment, whereas Maruo et al. (2016) reported higher accuracy for both reward and punishment than in control. Two studies found higher accuracy in the reward condition (Pailing & Segalowitz, 2004; Potts, 2011).

Regarding the negative affect manipulations, three studies found slower RTs in emotional faces condition (Sueyoshi et al., 2014), occlusion task (Tan et al., 2019), and with unpleasant pictures (Wiswede et al., 2009). However, two studies reported faster RTs to threatening visual stimuli (Senderecka, 2016, 2018). Three studies found better accuracy in the aversive conditions (Jackson et al., 2015; Senderecka, 2016, 2018).

After positive affect inductions, one study reported less accuracy in their smile faces condition (Wiswede et al., 2009), while the other found more accuracy in high conflict trials (van Wouwe et al., 2011). Studies that manipulated both negative and positive inductions reported better accuracy for neutral faces (Compton et al., 2007), slower RTs for emotionally arousing regardless of pleasant or unpleasant (Larson et al., 2006), and less accuracy following anxiety induction, especially on more difficult incongruent trials (Larson et al., 2013).

A considerable number of studies (n = 23) found no significant behavioral effects per condition or manipulation (Bakic et al., 2014; Boksem et al., 2008, 2011; Cano Rodilla et al., 2016; de Bruijn et al., 2020; Elkins-Brown et al., 2016, 2018; Ganushchak & Schiller, 2008; Glienke et al., 2015; Hajcak et al., 2005 (Experiments 1 and 2); Hobson et al., 2014; Inzlicht & Al-Khindi, 2012; Moser et al., 2005; Nigbur & Ullsperger, 2020; Olvet & Hajcak, 2012; Paul et al., 2017; Pfabigan et al., 2013; Rodeback et al., 2020; Tullett et al., 2015; Unger, et al., 2012; Valt et al., 2017; White et al., 2018).

Post-Error Slowing

Eighteen studies reported the presence of post-error slowing (PES) effect (Boksem et al., 2011; de Bruijn et al., 2020; Larson et al., 2013; Maruo et al., 2016; Meyer & Gawlowska, 2017; Nigbur & Ullsperger, 2020; Olvet & Hajcak, 2012; Paul et al., 2017; Pfabigan et al., 2013; Riesel et al., 2012; Riesel et al., 2019b; Rodeback et al., 2020; Stürmer et al., 2011; Sueyoshi et al., 2014; Wiswede et al., 2009; Wiswede et al., 2009; Wiswede et al., 2009). Specific PES effects were observed. Maruo et al. (2016) and Stürmer et al. (2011) reported that this effect was more pronounced in the reward condition or rewarded blocks. On the contrary, three studies found increased PES during punishment conditions (Meyer & Gawlowska, 2017; Riesel et al., 2012; Riesel et al., 2019b).

EEG Acquisition and ERPs Reduction

Methods and technologies to record EEG and calculate ERPs were diverse. Electrode numbers fluctuated between 6 to 129, with 128, 64, and 32 channel systems being the most frequent across studies (n = 23), distributed according to the 10–20 system. Most studies chose Biosemi Active Two system (Biosemi Inc.) and Electrical Geodesics System, including high-density arrays, to get ERPs components. For ocular movement artifacts correction, most of the studies (n = 21) used the procedure described by Gratton et al. (1983), Coles et al. (1988) and remotion with the independent component analysis (ICA) algorithm (18 studies). Components were quantified employing two measures: 39 studies selected mean amplitude (the most used brain activity measure); 9 used peak amplitude and latency; and 6 used peak-to-peak amplitude. Several authors across the studies cited the work of Luck (2005) and Clayson et al. (2013) as their primary criterion of choice for mean amplitude, as they identify this measure as a more reliable ERP measure than the peak amplitude. ERN amplitudes were mostly averaged across frontocentral electrode sites FCz, Fz, and Cz. Criteria to select EEG sites for analysis, electrodes, and latency ranges were due to the topographic properties of the dataset, previous works, and visual inspection of the grand-average waveform post response. A summary of the locations and time windows used in the reviewed studies is displayed in Table 2.

EGG effects

We reviewed 54 experimental studies of the 51 articles selected. Thirty-two studies reported significant effects of the state affect inductions on the ERN amplitudes, and 20 studies could not find significant effects or associations for the overall sample. However, 9 of the 20 studies that did not find significant effects for the overall sample by manipulation also assessed traits and reported moderation effects on the ERN. The following sections will review the results and the direction of the effects of the included studies, organized by experimental manipulation. Experimental manipulations with a negative valence of stimuli or aversive mood inductions were categorized as “negative affect.” Likewise, manipulations with a positive valence or mood inductions were labeled as “positive affect.” Some studies included both types of manipulations and were categorized as “negative and positive affect.”

An individualized category was created for manipulations with motivational factors, emotion regulation strategies, and state-traits interactions. Finally, some of the studies could appear in more than one category due to multiple variables manipulated (i.e., manipulation of the valence stimuli and motivational factors; studies containing two experiments with different manipulations). A summary of the extracted data is presented in Table 2.

Negative affect (18 studies)

Of the nine studies that reported significant effects, 8 studies identified enhanced ERN/ΔERN amplitudes manipulating negative affective states through mistakes that harmful others (de Bruijn et al., 2020), evaluation of performance (experiment 2) (Hajcak et al., 2005), unpredictable tones (Jackson et al., 2015), aversive respiratory occlusions (Jelinčić et al., 2020; Tan et al., 2019), feelings of helplessness (Pfabigan et al., 2013), randomness in readings (Tullett et al., 2015), and negative valence pictures (IAPS) (Wiswede et al., 2009). One study reported reduced amplitudes throughout a negative valence manipulation (angry faces) (Valt et al., 2017).

Conversely, nine studies did not find significant effects on the ERN for the overall sample (Glienke et al., 2015; Leue et al., 2017; Moser et al., 2005; Olvet & Hajcak, 2012; Rodeback et al., 2020; Senderecka, 2016, 2018; Sueyoshi et al., 2014; White et al., 2018). However, Sueyoshi et al. (2014) found positive correlations between the ERN and interoceptive sensitivity in disgusted face stimuli. Also, three of these studies reached significance when trait interactions were taken into account (Leue et al., 2017; Olvet & Hajcak, 2012; White et al., 2018), highlighting the relevance of looking for these changes in experimental manipulations of affect. (These results will be described in more detail in the state-trait interactions section.)

Positive Affect (5 studies)

Out of the five studies that manipulated positive affect, two studies reported significantly enhanced ERN through happy mood inductions (Bakic et al., 2014; Nigbur & Ullsperger, 2020), while two studies reported dampened ERN when manipulating movie clips (van Wouwe et al., 2011), and induced smile expressions (Wiswede et al., 2009). One study did not obtain significant main effects or interactions (Paul et al., 2017).

Negative and Positive Affect (6 studies)

Of the six studies with negative and positive affect manipulations, four observed enhanced ERN with different outcomes regarding the manipulation. Boksem et al. (2011) identified larger ERN for the disgusted faces condition, while Larson et al. (2006) for pleasant images. Clayson and Larson (2019) compared paradigms of previous studies: the overlaid flanker (Larson et al., 2006) and the interspersed flanker (Wiswede et al., 2009) and found enhanced amplitudes for high arousal images in an interspersed flanker task but dampened amplitudes in the overlaid task for more pleasant images (effect of valence). One study reported dampened ΔERN after less anxiolytic religious primes (Good et al., 2015), and two studies reported no significant effects on the ERN only due to mood (Larson et al., 2013) and stimulus emotionality manipulation (faces) (Compton et al., 2007). Nevertheless, Compton et al. (2007) observed an unexpected interaction effect between manipulation and state anxiety, with high anxious individuals showing smaller ERN for angry faces blocks and larger ERN after happy faces blocks.

Emotion regulation strategies (8 studies)

Six studies reported significant effects of emotion regulation manipulation with decreased overall ERN amplitudes (Hobson et al., 2014; Inzlicht & Al-Khindi, 2012; Inzlicht & Gutsell, 2007; Levsen & Bartholow, 2018; Saunders et al., 2016; Wang et al., 2014), suggesting that these regulatory strategies reduce affective reactivity and impact cognitive control through dampening the ERN. These studies used a down-regulation reappraisal condition (Hobson et al., 2014), suppression and reappraisal (Wang et al., 2014), meditation techniques (Saunders et al., 2016), and misattribution of the arousal paradigm (Inzlicht & Al-Khindi, 2012). The study by Levsen and Bartholow (2018) was composed of two experiments with informative moderators and interactions. In the first, a within-subjects analysis showed that only Reappraisal affected the ERN amplitudes. While in experiment 2, a between subjects’ analysis showed that both Reappraisal and Suppression diminished the ERN. The authors highlighted the importance of using between-subjects design when testing the effect of emotion regulation strategies to avoid cofounding effects. Another interesting finding from this study was the interaction effect observed between the used strategy and the black prime. In both experiments, larger ERNs were reported for Reappraisal and Suppression but only for black prime trials.

Finally, one study identified an enhanced ERN for the emotion-focused group (Saunders et al., 2016), and two studies found no significant effects on the ERN (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018). Cano Rodilla et al. (2016) and Elkins-Brown et al. (2018) were replications of the misattribution approach by Inzlicht and Al-Khindi (2012), and both failed to replicate the reduced amplitude of the ERN due to misattribution of arousal procedure, pointing out the original study as a possible false positive. However, it is relevant to note that these studies reported failing to manipulate state affect, so the data cannot provide evidence for or against the affective properties of the ERN.

Motivational Factors: Punishment and reward (16 studies)

Of the 16 studies that manipulated motivational factors, 10 found significant effects of condition on the ERN (Ganushchak & Schiller, 2008; Hajcak et al., 2005, experiment 2; Leue et al., 2017; Maruo et al., 2016; Potts, 2011; Riesel et al., 2012; Saunders et al., 2015; Stürmer et al., 2011; Unger et al., 2012; Wiswede et al., 2009). Eight studies reported larger ERN amplitudes under punishment or aversive feedback (Ganushchak & Schiller, 2008; Hajcak et al., 2005, experiment 2; Leue et al., 2017; Potts, 2011; Riesel et al., 2012; Saunders et al., 2015; Unger et al., 2012; Wiswede et al., 2009). One of these studies used a failure induction in two experiments investigating the effects of failure feedback and threatening self-worth (intelligence abilities) in deterministic and probabilistic conditions (Unger et al., 2012). The authors reported an enhancement of the ERN amplitudes resulting from the failure manipulation in the two learning conditions and decreasing ERN for individuals in the no-failure induction consequently (Unger et al., 2012).

Larger ERN amplitudes were also found in reward and punishment conditions (Maruo et al., 2016) and the reward blocks (Stürmer et al., 2011). In contrast, six studies reported no significant effects of salience and motivational manipulations (punishment or aversive feedback, and reward) on the ERN or ΔERN amplitudes (Clayson et al., 2012; Elkins-Brown et al., 2016; Meyer & Gawlowska, 2017; Pailing & Segalowitz, 2004; Riesel et al., 2019b). However, these studies reported trait interactions effects that are accounted for in the state-trait interactions section. The study conducted by Clayson et al. (2012) intended to replicate a previous study (Wiswede et al., 2009) that reported significant effects on the ERN amplitude but had a small sample composed only of 12 females. As mentioned, Clayson et al. (2012) failed to replicate those findings.

Sate-trait interactions (10 studies)

Nine studies reported a moderator effect of trait by enhancing the ERN amplitudes. These effects were reported depending on the experimental manipulation and the assessed trait. For instance, four studies observed an interaction effect in a punishment condition for high anxious individuals (Meyer & Gawlowska, 2017; Riesel et al., 2012; Riesel et al., 2019b), and high sensitivity to punishment (BIS) (Boksem et al., 2008). Specifically, Riesel et al. (2012) found significant differences by punishment for the overall sample, but further analysis corroborated that anxiety trait was moderating this effect with larger ERN amplitudes for punishment only in high anxious individuals.

Likewise, other state-trait interactions were reported. For instance, Olvet and Hajcak (2012) found no significant associations just by the mood induction, but high neuroticism moderated the relationship between mood changes (i.e., increase sadness) and Δ ERN. Regarding manipulations of the valence of the stimuli, Leue et al. (2017) also reported an emerged interactions between trait (lower BIS) and manipulation of valence. The authors found larger amplitudes for lower BIS individuals in the fearful faces condition (Leue et al., 2017). Other studies observed enhanced ERN associated with high reward drive responsiveness (Clayson et al., 2012), on high neuroticism and low conscientiousness on motivated trials with high monetary reward (Pailing & Segalowitz, 2004), and in high worry individuals that were exposed under uncertain evaluative feedback (White et al., 2018).

Conversely, one study that assessed trait affect under an emotion regulation manipulation did not find significant moderation effects on the ERN (Elkins-Brown et al., 2018).

Discussion

Overall findings

This study aimed to systematically review the literature searching for effects of affective state manipulations on the ERN event-related potential component. Of the 54 experiments reviewed, 34 (~ 63%) studies reported significant effects on the ERN solely by state affect manipulation. From the 20 studies that reported no significant effects by manipulation in the overall sample, 9 studies observed that trait affect significantly moderated the interactions with state affect. Notably, the most mixed results found in our review were seen in our label categorized as “negative affect.” Half of the studies reported no significant effects only by state manipulation. Interestingly, as previously stated, some of these studies assessed traits emphasizing emerging effects due to those state-trait interactions (Olvet & Hajcak, 2012; White et al., 2018).

Nevertheless, within the studies reporting modulations of the ERN component, amplitudes tended to be increased in negative affect manipulations and punishment and reduced with positive affect, and interventions that lessen negative affect. Although we observed heterogeneity across the studies that hinder comparisons between findings, the results from different manipulations suggested the sensitivity of the ERN to state affect contexts. It seems that this event-related component does not purely reflect the mismatch of goal and actual responses; instead, it can be informative about transient affective and motivational factors.

However, relevant issues emerged from studies that did not obtain significant effects. For instance, two studies (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018) were replications of the Inzlicht and Al-Khindi (2012) paradigm, which had found a significant effect of misattribution of arousal on the ERN. These null findings highlighted the following important aspects: (a) the replication attempts used higher statistical power (more representative samples), with nonreplication findings confirmed by Bayesian Statistics; and (b) the possibility of a false positive of the original article (Inzlicht & Al-Khindi, 2012). Similarly, by including a larger sample, Clayson et al. (2012) failed to replicate the results from Wiswede et al.,(2009), and Clayson and Larson (2019) observed opposite patterns from the replicated studies (Larson et al., 2006; Wiswede et al., 2009). These results represent an opportunity to show the relevance of the high-powered replications of the ERN paradigms, methodologies, and theories.

Caution should be taken regarding significant effects reported in EEG low-powered studies, especially when treated as solid evidence of the nature of psychophysiological phenomena, such as the ERN (Button et al., 2013; Clayson, 2020; Gelman & Carlin, 2014). In our review, 72.2% of the studies had a sample with less than 50 subjects, 2 studies used more than 100 participants (Good et al., 2015; Larson et al., 2013), and 10 (19,6%) articles reported sample size calculations for their studies. Furthermore, 75.9% of the reviewed studies were conducted with samples composed of undergraduate students, and only a few reported ethnicity data. This fact leads us to the WEIRD (Western, educated, industrialized, rich, and democratic) sample problem (Henrich et al., 2016). Although empirical research publications usually report the characteristics of their samples as a limitation, few studies identified their target population adequately, and the risk of sampling bias remains despite the data collection issues (Muthukrishna et al., 2020). Nonetheless, the notable percentage of studies using undergraduate samples also can be seen as a strength in the homogeneity that allows future replications.

Importantly, given that widely different paradigm tasks were used to seek state affect effects on the ERN as a dependent variable, some methodological issues need to be acknowledged when considering the conflicting findings across studies. For instance, moderator factors, such as paradigm task, performance feedback, and task stimuli, have been shown to influence the effects and magnitude of the ERN in nonclinical and clinical samples (Gehring et al., 1993; Gloe & Louis, 2021; Riesel et al., 2019a). Our review identified 15 different paradigms to elicit the ERN; whereas the Flanker Task was the most frequent, at least 5 different arrays and modifications were used between and within subjects’ design. The heterogeneity of task parameters and methods limits the generalizability of results (Clayson et al., 2021) and leaves undetermined variability of the ERN unknown (Gloe & Louis, 2021).

Moreover, we explored the state manipulation check and its effectiveness. In our review, most of the studies presented a manipulation check with the STAI for state anxiety and Subjective valence/arousal ratings to be the most applied. Thirty-seven studies had a successful state affect induction, and in two, the manipulation did not influence affect (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018); as a result, their data could not test the emotional properties of the ERN. However, usually, measurements were collected only after manipulation. Checking the effectiveness of manipulation is crucial, as it is the necessity of measurement before (when applicable) and after the experimental induction in order to establish the ERN state-dependent (or independent) characteristics.

What do the directions of the ERN say about state affect manipulations?

Knowing that errors are aversive endogenous events and that error commissioning is typically accompanied by negative emotions (Hajcak et al., 2004, 2012; Spunt et al., 2012); the aversive nature of errors and the emotional costs to the individual gave rise to the hypothesis that affective and motivational dimensions can influence performance monitoring (Maier & Steinhauser, 2016; Saunders et al., 2017). Following this line of thought, if the ERN reflects the valence (aversive) of performance monitoring processes, then negative affect and punishment manipulations would enhance the ERN amplitudes (Aarts et al., 2013; Wiswede et al., 2009). Likewise, manipulations, including positive affect, reward, and emotion regulation strategies, to lessen negative distress would prevent this aversive effect by dampening the ERN (van Steenbergen et al., 2015; Wiswede et al., 2009).

Supporting this view, 18 studies from our review found larger ERN amplitudes following negative affect and punishment, and 9 found smaller amplitudes after positive affect and emotion regulation manipulations. These results are in line with evidence of consistent integration of negative affect and cognitive control in the midcingulate cortex (MCC) (Braem et al., 2017; Shackman et al., 2011), the structure where the ERN is thought to be generated (Iannaccone et al., 2015; Ridderinkhof et al., 2004; Shenhav et al., 2016). In addition, positive correlations of the ERN amplitudes and aversive bodily sensations/interoception support the aversiveness of error commission reflected by the ERN (Jelinčić et al., 2020; Sueyoshi et al., 2014; Tan et al., 2019). However, the interpretation of the directions of these effects must be made considering the paradigms, designs, and variables compared.

Although, in general, manipulations of negative affect and negative valence stimuli were related to larger ERN amplitudes, some studies reported reduced ERN (Compton et al., 2007; Valt et al., 2017) and enhanced ERN for positive valence images (Larson et al., 2006) and in positive mood manipulations (Bakic et al., 2014; Nigbur & Ullsperger, 2020). Reconciling these findings is a difficult task. However, these differences in outcomes may be interpreted considering the impact of valence and arousal in attentional processes (Clayson & Larson, 2019; Saunders et al., 2017; Weinberg et al., 2012), the differences in cognitive and physiological costs of emotion regulation (Saunders et al., 2016; Wang et al., 2014) the affective context of manipulations (if related or unrelated to the task: prior or superimposed) (Larson et al., 2013; Wiswede et al., 2009), the different nature of the task and its specific effects (Riesel et al., 2013; Suzuki et al., 2020), and the response mode (Gloe & Louis, 2021; Senderecka, 2018).

The directions of the ERN when manipulating the stimuli valence or affect induction may vary in function of the context manipulation and motivational significance of errors (Bakic et al., 2014; Clayson & Larson, 2019). Therefore, different directions should not be interpreted as isolated events on a unidimensional line from positive–negative affective states (Nigbur & Ullsperger, 2020). Studies in our review found larger ERN amplitudes for the rewarded blocks, for both motivational conditions (reward and punishment) (Maruo et al., 2016; Stürmer et al., 2011), and in emotional regulation strategies when errors underly racial bias (i.e., highly salient errors are more difficult to regulate) (Levsen & Bartholow, 2018). These results are consistent with the notion that affective/motivational influences are not linear and that transient affect can dynamically alter error monitoring (Larson et al., 2013).

Furthermore, the divergent findings may also be understood within the Reinforcement Learning Theory (Holroyd & Coles, 2002). In their theoretical framework, the authors state that the ERN is a reward prediction error signal; in other words, a signal that carries information about expectancy violation. Enhanced or reduced amplitudes would emerge depending on the affective context and the existing discrepancy between predicted and actual rewards. For instance, positive mood may increase reward prediction error when the probability of getting a reward is high (Bakic et al., 2014) or with task-irrelevant pleasant backgrounds during error commission (Larson et al., 2006).

In general, the reviewed studies that found affective and motivational influences on the ERN support the hypothesis that the ERN modulations account for the expectancy violation (Holroyd & Coles, 2002) and the specific value or motivational saliency of errors (Hajcak & Foti, 2008; Hajcak et al., 2005). However, findings with no influences of affective manipulations and non-replication attempts suggest that state affect accounts only for some variability in the ERN. Likewise, informative results from emerging interaction trait effects in studies that found no significant effects solely by state affect manipulation need to be addressed.

State-trait interactions

Nine studies found moderator effects of trait on the ERN throughout the different state manipulations. Interestingly, all studies reported enhanced ERN amplitudes for the assessed traits. High anxiety, worry, neuroticism, and BIS moderated the effects in negative affect and punishment manipulations, except in one study where the interaction effect was with lower BIS trait (Leue et al., 2017). These findings are part of a growing body of evidence that explains the ERN (generation and modulation) by stable levels of trait personality and individual differences (Hajcak, 2012; Riesel, 2019; Weinberg & Hajcak, 2011). Analysis by trait allowed recognition of subtle state affect differences that could not be detected in the overall sample (Pailing & Segalowitz, 2004) and explained the significant differences in state manipulation within the sample (Riesel et al., 2012). Thus, individual differences are relevant when examining the effects of state manipulation, as trait affect is a moderator variable in state or motivational changes (Dikman & Allen, 2000; Luu et al., 2000; Masaki et al., 2017).

In this sense, state manipulation effects would be conditional to the moderator effect of trait personality; therefore, studies would be dealing with unaccounted ERN variability if there is no trait measurement. Future studies should include trait measures in their analysis as variability in the ERN might depend on affective/motivational states and dispositional traits (Hajcak & Foti, 2008; Leue et al., 2017; Olvet & Hajcak, 2012; Park & Kitayama, 2014). Especially in light of recent meta-analytic evidence suggesting that trait findings may be more heterogenous and influenced by publication bias (Saunders & Inzlicht, 2020) and that other moderators may be affecting the sizes effects of the ERN (Clayson et al., 2021; Gloe & Louis, 2021).

Limitations of the present study

While the review findings suggest that the ERN may be sensitive to state-specific affect manipulations, heterogeneity of the paradigms and variables studied (i.e., tasks employed, performance feedback, designs) and a considerable number of studies with nonmodulations of the ERN made it challenging to draw more decisive conclusions. Likewise, most of the reviewed studies had a sample composed of less than 50 participants with no precise sample size calculations. We identified that some higher power studies could not replicate previous findings that reported significant effects with lower power samples. In addition, caution should be taken concerning our conclusions since they are based on qualitative analysis, and the effect sizes were not examined. Therefore, future studies should include similar variables and consider the possibility of conducting a meta-analytic review of the affect states’ manipulations and their effects on the ERN to unravel the nature of their associations.

Recommendations and future directions

This systematic review identified the need for further research in state affect-related changes and the error-related negativity. Our results showed that future studies should increase their statistical power due to nonreplication findings with higher sizes samples (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018; Larson et al., 2013). To facilitate future high-power replication attempts in this area, it is relevant to explicitly describe the protocol: variables, scales, and methodologies, such as detail or explicit justification of the sample sizes, procedures, and statistical tools. Furthermore, studies should include evidence of complete preregistration (description of the content of the hypothesis, primary analysis, and exclusion criteria determined before data collection). We observed that it is not common to report statistical power estimations for sample size and effect sizes, the final sample’s complete characteristics (i.e., mean age, standard deviation, gender, ethnicity), and the internal consistency of ERN scores. Although inferring the reliability of the ERN from previous studies is a conventional practice, ERN scores’ internal consistency is dependent on studies’ contextual factors and should be reported based on each study (Clayson, 2020). As reproducible research is needed, adequate reporting behavior across studies is vital in human electrophysiological studies (Clayson et al., 2019; Larson & Carbine, 2017).

In addition, we identified that most studies tended to report marginally statistical effects, especially when reporting trends. This flexibility is problematic, because it could mislead the reader to an interpretation of effects that have low evidential value (false positives) (Pritschet et al., 2016), and not less important, they are associated with lower reproducibility (Olsson-Collentine et al., 2019). In consequence, researchers should avoid this practice. Likewise, state induction validation is essential in determining the evidence for/against the state affect-independent properties of the ERN (Cano Rodilla et al., 2016; Elkins-Brown et al., 2018). Future studies manipulating state affect should always register the effectiveness of the affect’s inductions throughout a manipulation check (Rodeback et al., 2020).

Regarding affective manipulations, we found conflicting results in the modulations of the ERN with positive valence stimuli and positive mood manipulations. In general, more studies with positive affect are needed since its specific relationship with the ERN remains unclear. Finally, future studies should include an assessment of trait in their samples. In line with previous and recent research, trait has been identified as a crucial moderator factor regarding the state affect relations with the ERN. Thus, this aspect should not be ruled out of the analysis if the objective is to account for the ERN variability in affective state manipulations.

Conclusions

We extracted 51 articles exploring the effects of state affective manipulations on the ERN. Findings with significant effects indicated a tendency towards the sensitivity of the ERN to state-related affect manipulations, and more specifically, results support the motivational value and aversiveness of errors. Likewise, manipulations of increasing negative affect or punishment usually led to an enhancement of the ERN, while manipulations that lessen negative affect (i.e., emotion regulation) or with positive affect tended to reduce the ERN. However, conflicting results with different effects, null findings in recent replications, and trait as crucial moderator factor informed the complex relationship between electrophysiological indices of error processing, individual differences, and state affective manipulations. Future research is needed with more statistical power and replicability on state-related manipulations.