The electrocortical response to rewarding and aversive feedback: The reward positivity does not reflect salience in simple gambling tasks

doi:10.1016/j.ijpsycho.2017.11.015

International Journal of Psychophysiology

Volume 132, Part B, October 2018, Pages 262-267

https://doi.org/10.1016/j.ijpsycho.2017.11.015 Get rights and content

Highlights

•
RewP amplitude was more positive to desirable outcomes than undesirable outcomes.
•
Safety from shock elicited a RewP comparable in magnitude to monetary gain.
•
PCA revealed the RewP peaked earlier in the aversive vs. rewarding task versions.
•
Findings support reward prediction error models of the RewP.

Abstract

The Reward Positivity (RewP) is an event-related potential (ERP) potentiated to monetary gains and reduced to monetary losses. Recently, competing data suggest that more salient outcomes elicit a positivity relative to less salient outcomes, regardless of valence. However, all previous work testing the impact of salience on the RewP have examined expected versus unexpected outcomes. In the current study, participants completed the same gambling task twice in which feedback was equally probable: in one condition, feedback indicated monetary gain or loss—and in the other condition, feedback indicated either safety or punishment from subsequent electric shock. Traditional ERP and principal component analysis (PCA)-derived measures confirmed that the RewP was more positive to feedback signaling monetary gain and safety from shock compared to feedback signaling monetary loss and punishment with shock. These results align with models in which the RewP indexes reward-related processes, including reward prediction error models. Potential explanations for salience-based effects on the RewP are discussed.

Introduction

For the past 20 years, ERP researchers have increasingly focused on the differentiation between positive and negative feedback to understand reward processing and learning (Miltner et al., 1997, Krigolson, n.d). Across time estimation (Miltner et al., 1997, Becker et al., 2014), reinforcement learning (Baker and Holroyd, 2008, Holroyd et al., 2011), and simple gambling tasks (Gehring and Willoughby, 2002, Holroyd et al., 2004, Holroyd et al., 2006, Proudfit, 2015), studies have consistently observed a relative negativity that peaks approximately 300 ms following feedback indicating bad compared to good outcomes. This relative negativity has been referred to as the feedback error-related negativity (Miltner et al., 1997, Holroyd and Coles, 2002, Holroyd et al., 2006, Nieuwenhuis et al., 2004), feedback negativity (Yeung and Sanfey, 2004), feedback related negativity (Cohen et al., 2007, Hajcak et al., 2006, Liu et al., 2014), and the medial frontal negativity (Gehring and Willoughby, 2002). More recent accounts suggest that this negativity may be a N200 to unexpected events that require increased need for cognitive control (Holroyd, 2004, Holroyd et al., 2008), and that this N200 is suppressed by a reward-sensitive positivity on reward trials (Holroyd et al., 2008). When conceptualized as a relative positivity following reward, several authors have suggested naming the ERP accordingly, either as the feedback correct-related positivity or the reward positivity (RewP; Holroyd et al., 2008, Proudfit, 2015).

Several lines of evidence suggest rewards drive the ERP difference between positive and negative feedback, including experimental manipulations (Holroyd et al., 2006, Holroyd et al., 2008, Kujawa et al., 2013), principal components analysis (PCA) of the ERP waveform (Foti et al., 2011, Weinberg et al., 2014, Liu et al., 2014, Carlson et al., 2011), and correspondence of the RewP to both reward-related behavioral (Bress and Hajcak, 2013) and neural measures derived from fMRI (Carlson et al., 2011, Becker et al., 2014, Foti et al., 2014). Collectively, these data suggest a positive potentiation in the ERP following rewards that is reduced or absent on non-reward trials.

Functionally, the RewP is thought to reflect a reward prediction error signal, which codes whether outcomes are better or worse than expected (Holroyd and Coles, 2002, Holroyd et al., 2008, Walsh and Anderson, 2012, Sambrook and Goslin, 2015). Consistent with this view, the RewP is larger when rewards are unexpected (Holroyd et al., 2011) and larger in magnitude (Sambrook and Goslin, 2015). While there is much evidence to suggest that the RewP is a reward-related modulation of the ERP, recent studies have provided evidence for the possibility that the RewP instead reflects a salience prediction error (SPE) signal. That is, the RewP may instead differentiate high- from low-salience events, regardless of valence. In this view, rewards might elicit a RewP because reward is more salient than non-reward.

In particular, two studies have found more positive ERP responses to feedback indicating aversive outcomes relative to feedback signaling the omission of aversive outcomes (Soder and Potts, n.d, Talmi et al., 2013). In terms of their experimental design, both studies presented participants with an initial cue that induced expectations regarding the likelihood of the outcome on each trial; following this cue (S1), participants were presented with feedback (i.e., the S2) that indicated expected or unexpected reward, or with feedback that indicated an expected or unexpected punishment (i.e., electric shocks in Talmi et al., 2013; noise blasts in Soder & Potts, this issue). Both Talmi and colleagues, as well as Soder and Potts, found that the S2 indicating unexpected reward elicited a positivity in the waveform relative to unexpected non-reward; however, both studies also found that the S2 signaling unexpected punishment also elicited a positivity relative to unexpected punishment omission (Talmi et al., 2013, Soder and Potts, n.d). The notion that unexpected punishment would elicit a RewP is inconsistent with reward-related accounts and suggests instead that a RewP may be elicited by salient outcomes.

Heydari and Holroyd (2016) have reported competing findings from a study in which participants navigated a virtual T maze and received feedback in rewarding and aversive conditions. Feedback indicated absence or presence of monetary reward in the rewarding condition, and absence or presence of small shock in the aversive condition. They found the RewP to be more positive to feedback indicating receipt of monetary reward as compared to its omission, and to feedback indicating omission of shock relative to impending shock. Thus, this study utilized a similar paradigm to those from Talmi et al. (2013) and Soder & Potts (current issue) by employing rewarding and aversive conditions, however, their results demonstrated the RewP tracked feedback valence rather than salience.

In the studies from Talmi et al. (2013) and Soder and Potts (current issue), the S1–S2 design was used to induce expectations regarding outcomes. However, participants never made choices—there was no response requirement in either the Talmi et al., or Soder and Potts experiments. This is particularly relevant given the fact that experimental results suggest that the RewP is maximized by feedback that follows volitional choice (Walsh and Anderson, 2012, Yeung and Sanfey, 2004). Moreover, many studies that have examined the RewP do so in the context of simple guessing tasks in which reward and loss are equiprobable on each trial (Gehring and Willoughby, 2002, Holroyd et al., 2004, Holroyd et al., 2006, Proudfit, 2015).

The current study employed a simple guessing task and within-subject design to examine whether feedback that signaled impending shock or safety would elicit a RewP. Subjects were administered two identical versions of a guessing task: a monetary version in which choices led to either monetary gain or loss—and an aversive version in which choices led to either safety from shock or punishment with shock. In this way, we employed identical features as Talmi and colleagues and Soder and Potts, however, feedback followed participant choices and were equiprobable on each trial. Traditional and principal component analysis (PCA)-derived factors were analyzed to assess the impact of outcome on ERPs. If the more rewarding outcomes (i.e., monetary gain and safety from shock) elicit a relative positivity compared to non-rewarding outcomes (i.e., monetary loss and punishment), the data would support the role of the RewP in reward-related process. If more salient outcomes (i.e., monetary gain and punishment) elicit a positivity relative to less salient outcomes (i.e., monetary loss and safety from shock), the data would support the SPE model and sensitivity of the RewP to salient outcomes.

Section snippets

Participants

Forty-one undergraduates from the introduction to psychology subject pool at Stony Brook University participated for course credit. The sample was college-aged (M = 20 years, SD = 3.70), 65.8% female, and ethnically diverse, including 38.1% Caucasian, 33.3% Asian, 14.3% Black, and 4.8% Latino. Demographic information was obtained through an initial screening e-mail. Informed consent was obtained prior to participation and the research protocol was approved by the Institutional Review Board at Stony

RewP

A 2 (outcome: best outcome [gain/safety], worst outcome [loss/punishment]) × 2 (task: money, shock) repeated measures ANOVA on mean activity from 250 to 350 ms following feedback at Cz confirmed that the ERP was more positive following desirable outcomes (i.e., gain and safety feedback; M = 17.71, SD = 10.11) than undesirable outcomes (i.e., loss and punishment feedback; M = 13.19, SD = 11.13; F(1, 40) = 19.12, p < 0.001, η_p² = 0.33). The main effect of outcome is depicted in the ERP waveforms in Fig. 1. There

Discussion

The current study examined traditional ERP and PCA-based scores in the time window of the RewP to feedback indicating monetary gains and losses, as well as to feedback indicating safety and punishment, to determine whether more rewarding or more salient outcomes elicit the RewP. Consistent with previous studies on the RewP, when examining both the traditional ERP- and PCA-based scores, monetary gains compared to losses were associated with a relative positivity that peaked around 300 ms at

References (39)

J.M. Carlson et al.
Ventral striatal and medial prefrontal BOLD activation is correlated with reward-related electrocortical activity: a combined ERP and fMRI study
NeuroImage
(2011)
M.X. Cohen et al.
Reward expectation modulates feedback-related negativity and EEG spectra
NeuroImage
(2007)
J. Dien
The ERP PCA toolkit: an open source program for advanced statistical analysis of event-related potential data
J. Neurosci. Methods
(2010)
J. Dien et al.
Optimizing principal components analysis of event-related potentials: matrix type, factor loading weighting, extraction, and rotations
Clin. Neurophysiol.
(2005)
D. Foti et al.
Reward dysfunction in major depression: multimodal neuroimaging evidence for refining the melancholic phenotype
NeuroImage
(2014)
G. Gratton et al.
A new method for off-line removal of ocular artifact
Electroencephalogr. Clin. Neurophysiol.
(1983)
G. Hajcak et al.
The feedback-related negativity reflects the binary evaluation of good versus bad outcomes
Biol. Psychol.
(2006)
C.B. Holroyd et al.
The good, the bad and the neutral: electrophysiological responses to feedback stimuli
Brain Res.
(2006)
O.E. Krigolson et al.
Cognitive load impacts error evaluation within medial-frontal cortex
Brain Res.
(2012)
S. Nieuwenhuis et al.
Reinforcement-related brain potentials from medial frontal cortex: origins and functional significance
Neurosci. Biobehav. Rev.
(2004)

M.M. Walsh et al.

Learning from experience: event-related potential correlates of reward processing, neural adaptation, and behavioral choice

Neurosci. Biobehav. Rev.

(2012)

A. Weinberg et al.

Show me the money: the impact of actual rewards and losses on the feedback negativity

Brain Cogn.

(2014)

T.E. Baker et al.

Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-maze

Cereb. Cortex

(2008)

M.P. Becker et al.

A single-trial estimation of the feedback-related negativity and its relation to BOLD responses in a time-estimation task

J. Neurosci.

(2014)

J.N. Bress et al.

Self-report and behavioral measures of reward sensitivity predict the feedback negativity

Psychophysiology

(2013)

J. Dien

Evaluating two-step PCA of ERP data with geomin, infomax, oblimin, promax, and varimax rotations

Psychophysiology

(2010)

D. Foti et al.

Differentiating neural responses to emotional pictures: evidence from temporal-spatial PCA

Psychophysiology

(2009)

D. Foti et al.

Event-related potential activity in the basal ganglia differentiates rewards from nonrewards: temporospatial principal components analysis and source localization of the feedback negativity

Hum. Brain Mapp.

(2011)

W.J. Gehring et al.

The medial frontal cortex and the rapid processing of monetary gains and losses

Science

(2002)

Cited by (31)

Distinct influence of inter- versus intra-trial feedback on the brain response to subsequent feedback: Evidence from event-related potentials
2023, Biological Psychology
Substantial evidence indicates that feedback processing not only varies with the valence of feedback, but is also highly dependent on contextual factors. Even so, the influence of prior outcome history on current outcome evaluation is far from clear. To investigate this issue, we conducted two event-related potential (ERP) experiments using a modified gambling task whereby each trial was associated with two consequences. In experiment 1, two instances of feedback indicated participant performance on two dimensions of a single decision, within a trial. In experiment 2, participants made two decisions in each trial, and then received two instances of feedback. We examined the feedback-related negativity (FRN) as an index of feedback processing. When both instances of feedback were relevant to the same trial (intra-trial), the FRN to the second was affected by the valence of the immediately previous feedback: The FRN was amplified to losses following wins. This was observed in both experiment 1 and experiment 2. When two instances of feedback were relevant to two different trials (inter-trial), the effect of immediately previous feedback on the FRN was inconsistent. In experiment 1 there was no effect of feedback from the previous trial on the FRN. However, in Experiment 2 there was an effect of inter-trial feedback on the FRN that was opposite to the effect of intra-trial feedback: The FRN was amplified when losses followed losses. Taken together, the findings suggest that the neural systems involved in reward processing dynamically and continuously integrate preceding feedback for the evaluation of present feedback.
Do food images as action outcomes evoke a reward positivity?
2021, Brain and Cognition
Favourable compared to unfavourable action outcomes typically evoke a positive-going amplitude shift at frontomedial electrodes in the scalp-recorded electroencephalogram. Since prior studies on this Reward Positivity (RewP) have heavily relied on monetary outcomes, it is still debated whether the RewP is also elicited by other kinds of reward. We addressed this issue by focussing on food as another major category of daily reward. Twenty-eight healthy participants completed a decision task, in which they received images of personally liked, neutral or disliked food as outcome stimuli. Importantly, single trial outcomes were of relevance for a prolonged task goal (i.e., obtaining the liked foods and avoiding the disliked foods). The observed amplitude pattern did not correspond to the typical RewP effect observed for monetary outcomes. In particular, disliked foods evoked a similar positive-going amplitude shift as liked foods when compared to neutral foods. Exploratory analyses indicated that this pattern may result from a spatiotemporal overlap between a potential RewP response and other, emotion-related ERP components (i.e., the Early Posterior Negativity and the Late Positive Potential). We discuss our findings with regard to theoretical and methodological implications for the usage of the RewP in the study of reward processing.
Dissociating the effect of reward uncertainty and timing uncertainty on neural indices of reward prediction errors: A reward positivity (RewP) event-related potential (ERP) study
2021, Biological Psychology
Citation Excerpt :
However, due to recent evidence showing that the underlying ERP component is driven primarily by rewards and an absence of reward that appears as a negativity (Proudfit, 2015), the terminology RewP will be used throughout this paper. RewP amplitude is thought to be partially driven by the activity of cortical dopamine neurons (Meadows, Gable, Lohse, & Miller, 2016), and, as such, serves as a neurobiological index of reward prediction errors (Mulligan & Hajcak, 2018; Sambrook & Goslin, 2015). Along with indexing reward prediction errors, amplitude of the RewP varies based on multiple characteristics of feedback.
Accurate reward predictions include forecasting both what a reward will be and when a reward will occur. We tested how variations in the certainty of reward outcome and certainty in timing of feedback presentation modulate neural indices of reward prediction errors using the reward positivity (RewP) component of the scalp-recorded brain event-related potential (ERP). In a within-subjects design, seventy-three healthy individuals completed two versions of a cued doors task; one cued the probability of a reward outcome while the other cued the probability of a delay before feedback. Replicating previous results, RewP amplitude was larger for uncertain feedback compared to certain feedback. Additionally, RewP amplitude was differentially associated with uncertainty of presence/absence of reward, but not uncertainty of feedback timing. Findings suggest a dissociation in that RewP amplitude is modulated by reward prediction certainty but is less affected by certainty surrounding timing of feedback.
The aversion positivity: Mediofrontal cortical potentials reflect parametric aversive prediction errors and drive behavioral modification following negative reinforcement
2021, Cortex
Citation Excerpt :
Prior experiments examining positive and negative reinforcement are also confounded by failure to match outcome modality. Mulligan and Hajcak (2018) compared reward/loss to punishment/omission, and Heydari and Holroyd (2016) and Talmi et al. (2013) compared reward/omission with punishment/omission. However, in these tasks, conditions differed in outcome modality (money vs shock) and reinforcement timing (money is paid at the end of the study, but shocks are delivered immediately), making it impossible to draw direct comparisons between positive and negative reinforcement.
Reinforcement learning capitalizes on prediction errors (PEs), representing the deviation of received outcomes from expected outcomes. Mediofrontal event-related potentials (ERPs), in particular the feedback-related negativity (FRN)/reward positivity (RewP), are related to PE signaling, but there is disagreement as to whether the FRN/RewP encode signed or unsigned PEs. PE encoding can potentially be dissected by time-frequency analysis, as frontal theta [4–8 Hz] might represent poor outcomes, while central delta [1–3 Hz] might instead represent rewarding outcomes. However, cortical PE signaling in negative reinforcement is still poorly understood, and the role of cortical PE representations in behavioral reinforcement learning following negative reinforcement is relatively unexplored. We recorded EEG while participants completed a task with matched positive and negative reinforcement outcome modalities, with parametrically manipulated single-trial outcomes producing positive and negative PEs. We first demonstrated that PEs systematically influence future behavior in both positive and negative reinforcement conditions. In negative reinforcement conditions, mediofrontal ERPs positively signaled unsigned PEs in a time window encompassing the P2 potential, and negatively signaled signed PEs for a time window encompassing the FRN/RewP and frontal P3 (an “aversion positivity”). Central delta power increased parametrically with increasingly aversive outcomes, contributing to the “aversion positivity”. Finally, negative reinforcement ERPs correlated with RTs on the following trial, suggesting cortical PEs guide behavioral adaptations. Positive reinforcement PEs did not influence ERP or time-frequency activity, despite significant behavioral effects. These results demonstrate that mediofrontal PE signals are a mechanism underlying negative reinforcement learning, and that delta power increases for aversive outcomes might contribute to the “aversion positivity.”
Reward Processing Abnormalities and Promising New Directions for Understanding Suicide Vulnerability
2021, Biological Psychiatry: Cognitive Neuroscience and Neuroimaging
Reward-Related Neural Predictors and Mechanisms of Symptom Change in Cognitive Behavioral Therapy for Depressed Adolescent Girls
2021, Biological Psychiatry: Cognitive Neuroscience and Neuroimaging
Citation Excerpt :
Specifically, the RewP has been linked to activity within the mesocorticolimbic reward circuit (e.g., ventral striatum and medial prefrontal cortex) (12,13) and dorsal anterior cingulate cortex (17); conversely, the LPP has been associated with a more distributed set of cortical and subcortical regions linked with visual, attentional, and emotion processing, including occipital, parietal, inferotemporal, and lateral prefrontal regions, as well as the amygdala and insula (47–51). In addition, in contrast to the RewP, which reflects initial reactivity to the receipt of rewards [but see studies linking the RewP/FRN to unexpected outcomes or feedback indicating safety, e.g., (52)], the LPP reflects more sustained attention toward and engagement with emotional or motivationally salient content (and not specific to only rewards). Although this is speculative, depressed adolescents exhibiting more sustained neural engagement to rewarding or motivationally salient feedback may be relatively more likely to successfully engage in and benefit from cognitive and behavioral activities prescribed in CBT.
Approximately half of depressed adolescents fail to respond to cognitive behavioral therapy (CBT). Given the variability in response, it is important to identify pretreatment characteristics that predict prognosis. Knowledge of which depressed adolescents are likely to exhibit a positive versus poor outcome to CBT may have important clinical implications (e.g., informing treatment recommendations). Emerging evidence suggests that neural reward responsiveness represents one promising predictor.
Adolescents with major depressive disorder (n = 36) received CBT and completed a reward task at 3 time points (pretreatment, midtreatment and posttreatment) while 128-channel electroencephalographic data were acquired. Healthy control participants (n = 29) completed the same task at 3 corresponding time points. Analyses focused on event-related potentials linked to 2 stages of neural processing: initial response to rewards (reward positivity) and later, elaborative processing (late positive potential). Moreover, time-frequency analyses decomposed the reward positivity into 2 constituent components: reward-related delta and loss-related theta activity.
Multilevel modeling revealed that greater pretreatment reward responsiveness, as measured by the late positive potential to rewards, predicted greater depressive symptom change. In addition, a group × condition × time interaction emerged for theta activity to losses, reflecting normalization of theta power in the group with major depressive disorder from baseline to posttreatment.
An event-related potential measure of sustained (late positive potential)—but not initial (reward positivity)—reward responsiveness predicted symptom improvement, which may help inform which depressed adolescents are most likely to benefit from CBT. In addition to alleviating depression, successful CBT may attenuate underlying neural (theta) hypersensitivity to negative outcomes in depressed youths.

View all citing articles on Scopus

View full text

The electrocortical response to rewarding and aversive feedback: The reward positivity does not reflect salience in simple gambling tasks

Highlights

Abstract

Introduction

Section snippets

Participants

RewP

Discussion

NeuroImage

NeuroImage

J. Neurosci. Methods

Clin. Neurophysiol.

NeuroImage

Electroencephalogr. Clin. Neurophysiol.

Biol. Psychol.

Brain Res.

Brain Res.

Neurosci. Biobehav. Rev.

Neurosci. Biobehav. Rev.

Brain Cogn.

Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-maze

Cereb. Cortex

A single-trial estimation of the feedback-related negativity and its relation to BOLD responses in a time-estimation task

J. Neurosci.

Self-report and behavioral measures of reward sensitivity predict the feedback negativity

Psychophysiology

Evaluating two-step PCA of ERP data with geomin, infomax, oblimin, promax, and varimax rotations

Psychophysiology

Differentiating neural responses to emotional pictures: evidence from temporal-spatial PCA

Psychophysiology

Event-related potential activity in the basal ganglia differentiates rewards from nonrewards: temporospatial principal components analysis and source localization of the feedback negativity

Hum. Brain Mapp.

The medial frontal cortex and the rapid processing of monetary gains and losses

Science