Motivational signals disrupt metacognitive signals in the human ventromedial prefrontal cortex

Hoven, Monja; Brunner, Gina; de Boer, Nina S.; Goudriaan, Anna E.; Denys, Damiaan; van Holst, Ruth J.; Luigjes, Judy; Lebreton, Maël

doi:10.1038/s42003-022-03197-z

Download PDF

Article
Open access
Published: 18 March 2022

Motivational signals disrupt metacognitive signals in the human ventromedial prefrontal cortex

Monja Hoven ORCID: orcid.org/0000-0002-0900-8565¹,
Gina Brunner^1,2,
Nina S. de Boer^1,3,
Anna E. Goudriaan^1,4,
Damiaan Denys^1,5,
Ruth J. van Holst¹^na1,
Judy Luigjes¹^na1 &
…
Maël Lebreton^6,7^na1

Communications Biology volume 5, Article number: 244 (2022) Cite this article

3039 Accesses
4 Citations
16 Altmetric
Metrics details

Subjects

Abstract

A growing body of evidence suggests that, during decision-making, BOLD signal in the ventromedial prefrontal cortex (VMPFC) correlates both with motivational variables – such as incentives and expected values – and metacognitive variables – such as confidence judgments – which reflect the subjective probability of being correct. At the behavioral level, we recently demonstrated that the value of monetary stakes bias confidence judgments, with gain (respectively loss) prospects increasing (respectively decreasing) confidence judgments, even for similar levels of difficulty and performance. If and how this value-confidence interaction is reflected in the VMPFC remains unknown. Here, we used an incentivized perceptual decision-making fMRI task that dissociates key decision-making variables, thereby allowing to test several hypotheses about the role of the VMPFC in the value-confidence interaction. While our initial analyses seemingly indicate that the VMPFC combines incentives and confidence to form an expected value signal, we falsified this conclusion with a meticulous dissection of qualitative activation patterns. Rather, our results show that strong VMPFC confidence signals observed in trials with gain prospects are disrupted in trials with no – or negative (loss) – monetary prospects. Deciphering how decision variables are represented and interact at finer scales seems necessary to better understand biased (meta)cognition.

Neural and computational underpinnings of biased confidence in human reinforcement learning

Article Open access 28 October 2023

Prefrontal signals precede striatal signals for biased credit assignment in motivational learning biases

Article Open access 02 January 2024

Task state representations in vmPFC mediate relevant and irrelevant value signals and their behavioral influence

Article Open access 31 May 2023

Introduction

Over the past decades, a growing number of neurophysiological studies in human and non-human primates have established that the neural signals recorded during learning and decision-making tasks in the orbito-medial parts of the prefrontal cortex (OMPFC)—the medial orbitofrontal cortex (OFC) and the ventromedial prefrontal cortex (VMPFC)—correlate with key concepts from theories of motivation and decision-making^1,2,3. For instance, in Pavlovian conditioning tasks, the activity of neurons in the non-human primate OFC correlates with the anticipatory value of upcoming rewards, with neural activity predicting the monkeys’ subjective preferences⁴. In economic decision-making tasks, neuronal activity in the same region of the OFC correlates with the subjective value of available options⁵. In humans, similar results have been derived from functional neuroimaging studies. Blood-oxygen level-dependent (BOLD) signal in the VMPFC scales with the anticipation of upcoming rewards^6,7, the subjective pleasantness and desirability attributed to different stimuli⁸, the willingness to pay for different types of goods^9,10,11, and the expected value (EV) of prizes, performance incentives, and economic bundles such as lotteries^12,13,14,15. Overall, together with the midbrain and the ventral striatum (VS), the VMPFC seems to form a “brain valuation system”^16,17,18, whose activity automatically indexes the value of available options so as to guide value-based decision-making^8,10 and motivate motor and cognitive performance¹⁹.

Recently, a set of human neurophysiological studies have suggested that activity in the VMPFC is also related to metacognitive processes^20,21. In particular, both single neuron and BOLD activity in the VMPFC correlate with participants’ confidence in their own judgments and choices^22,23,24,25. Confidence is a metacognitive variable that can be defined as one’s subjective estimate of the probability of a given choice being correct^26,27. Just like values, confidence judgments seem to be automatically represented in the VMPFC, for different types of judgments and choices^24,28,29. Confidence signals could be useful for the flexible adjustment of behavior—such as monitoring and reevaluating previous decisions³⁰, tracking changes in the environment^31,32, adapting future decisions^30,33, or arbitrating between different strategies^34,35.

Interestingly, at the behavioral level, values and confidence seem to interact. For instance, a handful of studies in psychology and economics have documented that positive incentive values, operationalized as prospects of monetary bonuses, increase subjective estimates of confidence³⁶. Similar confidence boosts have been reported with higher state values, operationalized as positive incidental psychological states such as elevated mood³⁷, absence of worry³⁸, and emotional arousal^39,40,41. Recently, we designed an incentivized perceptual decision-making task to demonstrate that monetary incentives bias confidence judgments, with gain (respective loss) prospects increasing (respectively decreasing) confidence judgments, even for similar levels of difficulty and performance⁴². This result was also replicated in a reinforcement-learning context^43,44. We explicitly hypothesized that this interaction would stem from the concurrent neural representation of—hence putative interaction between—incentive values and confidence in the VMPFC⁴².

Here, we used a functional neuroimaging adaptation of our original perceptual decision-making paradigm that allows for investigation of the overlap in neural correlates between incentive value and confidence⁴². Our first set of analyses did not show the hypothesized overlap of incentive value and confidence signals in the VMPFC at the expected statistical threshold (p < 0.05 whole-brain corrected family-wise error (FWE) at the cluster level), nor in other regions of interest (ROI) that have been linked with value, motivation, and confidence in the past—such as the VS and the anterior cingulate cortex (ACC). Therefore, we formulated an alternative hypothesis, positing that VMPFC integrates confidence and incentive signals into a probabilistic EV signal. We ran several quantitative and qualitative analyses that thoroughly compared the relative merits of these different hypotheses for the neural basis of the value-confidence interaction. Our results ultimately depict a complex picture, suggesting that motivational signals (notably prospects of loss) can disrupt metacognitive signals in the VMPFC.

Results

To investigate the neurobiological basis of the interactions between incentives and confidence, we modified the task used in Lebreton et al.⁴² to make it suitable for functional magnetic resonance imaging (fMRI) (Fig. 1a). Basically, this task is a simple perceptual task (contrast discrimination), featuring a two-alternative forced-choice followed by a confidence judgment. Then, we experimentally manipulated the available monetary outcomes, defining several incentive conditions: at each trial, participants could win (gain context) or lose (loss context) points—or not gain or lose anything (neutral context)—depending on the correctness of their choice. Incentives were presented in an interleaved fashion, in order to avoid contextualization of outcomes (rather than in a blocked design, where the absence of gain could be reframed as relative loss in a gain block, or vice versa). Importantly, this incentivization was implemented after the moment of choice and before the confidence rating. Consequently, by design, there should not be any incentivization effects on either accuracy or reaction times (RTs) as they develop during the choice. Note that this design corresponds to the simplest implementation of the task—corresponding to Experiment 2 in Lebreton et al.⁴²— which otherwise conditioned monetary outcomes to confidence rating precision rather than choice accuracy (for details see ref. ⁴²). Yet, our previous results suggested that this task still reveals an effect of incentives on confidence, while keeping instructions simpler—a desirable feature, especially for clinical and fMRI studies.

Behavioral results

To start, we verified that our task generated the incentive-confidence interaction at the behavioral level. First, using an approach similar to Lebreton et al.⁴², we used linear mixed-effects models to evaluate the effects of our experimental manipulation of incentives (i.e., the incentive condition) on behavioral variables (see Methods). More specifically, we defined and tested the incentives’ biasing effects (i.e., the net incentive value, or in other words, the linear effect of incentives coded as −1, 0, and +1) and incentives’ motivational effects (i.e., the absolute incentive value, or in other words, the mere presence of incentives, indicating whether something is at stake coded as 0 and +1). Replicating our previous results, we found a significant positive effect of incentive net value on confidence (β = 0.78 ± 0.32, t₄₃₁₇ = 2.43, p = 0.015; Fig. 1b, c) and no effect of incentive absolute value (β = −0.32 ± 0.55, t₄₃₁₇ = −0.58, p = 0.565; Fig. 1c). This result alone validates the presence of an incentive-confidence interaction at the behavioral level. Importantly, this effect was not driven by any net incentive value effects on accuracy or RT (accuracy: β = 0.38 ± 0.93, t₄₃₁₇ = 0.41, p = 0.685; RT: β = 13.75 ± 19.22, t₄₃₁₇ = 0.72, p = 0.474). Moreover, we did not find evidence for an effect of absolute incentive value on both accuracy and RT (accuracy: β = 1.86 ± 1.45, t₄₃₁₇ = 1.28, P = 0.199; RT: β = −25.24 ± 29.17, t₄₃₁₇ = −0.87, p = 0.387). Next, to confirm the robustness of our main effect of net incentive value on confidence, we ran several full linear mixed-effects models, which included additional control variables that could influence confidence as well (evidence, accuracy, RTs, et cetera, see Supplementary Note 1). Overall, the incentive-confidence interaction remained significant after accounting for those other potential sources of biases and confounds.

At last, we tested for an incentive effect on metacognitive sensitivity, a metric that measures the efficacy with which subjects discriminate between correct and incorrect answers using their confidence ratings (see Methods for details on its’ computation). Replicating earlier findings⁴², we found that incentive condition did not have a significant effect on metacognitive sensitivity (F(2,62) = 0.25, p = 0.783. Loss: 5.5973 ± 1.2106, neutral: 4.8572 ±1.0515, gain: 5.2797 ± 0.8692).

fMRI results

Having established the presence of a robust confidence-incentive interaction at the behavioral level, we next turned to the analysis of the functional neuroimaging data. Critically, our task allowed us to temporally distinguish the moment of stimulus presentation and choice—where the decision value and an implicit estimation of (un)certainty are expected to build up—from the incentive presentation and confidence rating moment—where the explicit, metacognitive confidence signal is expected to interact with the incentive (Fig. 2a, b).

**Fig. 2: Overview of general linear models for fMRI analyses.**

BOLD signal in the VMPFC correlates significantly with early certainty and incentives but weakly with confidence

Our original hypothesis proposes that incentives bias confidence because those two variables are both correlated to activity in the same brain area—presumably the VMPFC^22,23. To test this hypothesis, we built a first fMRI GLM (GLM1) which modeled (1) early certainty during stimulus and choice, and (2) both incentives and confidence ratings during incentive/rating (Fig. 2c). Early certainty was defined and computed as the precursor of confidence (i.e., an incentive bias-free signal of confidence), that builds up before the commitment to a choice (see Methods for details). During choice, early certainty positively correlated with activation in the VMPFC and the posterior cingulate cortex (PCC) (Fig. 3a). This replicates several studies that have reported an early and automatic (i.e., without explicit instructions) encoding of confidence in the VMPFC^23,25,45. Negative correlations of early certainty were observed in a widespread network including the bilateral dorsolateral prefrontal cortex (DLPFC) and rostro-lateral prefrontal cortex (RLPFC), bilateral anterior insula, right putamen, right inferior frontal gyrus, supplementary motor area, mid- and ACC, and bilateral inferior parietal lobe. This large network has already been implicated in uncertainty and metacognition²¹.

During the incentive/rating moment, we found positive correlations between incentive value and activity in the VMPFC, extending to clusters in the dorsomedial prefrontal cortex (DMPFC) (Fig. 3b). This is in line with our hypothesis and with a large body of neuro-economics literature¹⁶. A small cluster was detected in the occipital lobe, which negatively correlated with incentives.

Finally, regarding subjective confidence, we found significant positive effects in a large, lateralized visuo-motor network including the left primary motor cortex, left putamen, and left para-hippocampal gyrus, as well as the right cerebellum and right visual cortex (Fig. 3c). All those activations were mirrored in the negative correlation with confidence (although with lower and sometimes subthreshold significance), suggesting these brain regions are part of the visuo-motor network that processes the movement of the cursor on the rating scale (remember that movements of the cursor were operationalized with the left (respective right) index finger to move the cursor toward the left (respective right).

Outside those visuo-motor areas, activity in a large cluster in the dorsal anterior cingulate cortex (dACC) and the mid-cingulate cortex (MCC) was found to positively correlate with confidence. Interestingly, an adjacent region of the dACC negatively correlated with early certainty in the choice period (Fig. 3a).

To our surprise, and in contradiction with our hypothesis, no whole-brain significant cluster was found in the VMPFC at our a priori defined statistical threshold. There were, however signs of subthreshold activations (Fig. 3c).

As observed with confidence activations, motor-related activity can be an important confound. To ensure that our activity patterns of interest (i.e., early certainty, incentive, and confidence) were not related to motor processes, we replicated our analyses using an exclusive motor-related mask, generated from large-scale automated meta-analyses (see Methods for more details). Importantly, those control analyses revealed that most activations—with the exception of the visuo-motor activations identified in the confidence activation maps—remain significantly associated with our variables of interest (for whole-brain activation tables when using this exclusive mask, see Supplementary Data 2).

Accounting for incentive bias in confidence does not restore VMPFC confidence activations

Next, we attempted to understand the absence of strong correlations with confidence in the VMPFC, despite the same region robustly encoding early certainty and incentives (i.e., precursors of confidence). We reasoned that because confidence is biased by incentive, the shared variance between those two variables could have decreased our chances to reveal clear confidence signals during confidence ratings. We, therefore, built two control GLMs, which differed in how the incentive/rating period was modeled (Fig. 2c): GLM2a only included confidence as a parametric modulator, while GLM2b included incentive and early certainty (i.e., the precursor of confidence devoid of incentive shared variance). We defined an anatomical VMPFC ROI (see Methods and Fig. 4a), and extracted individual standardized regression coefficients (t values) corresponding to the confidence variable in those three GLMs (GLM1, GLM2a, GLM2b) (see Methods). We then tested whether the difference in the GLM specifications had an impact on these activations at the rating period (GLM1 and 2a: confidence; GLM2b: certainty) using repeated-measure analysis of variances (ANOVAs). Results showed that activations for GLM2a-confidence and GLM2b-early certainty during incentive/rating period were indistinguishable from GLM1-confidence (ANOVA, the main effect of GLMs: F(2,29) = 0.68; p = 0.509), falsifying the hypothesis that the weak confidence activations in VMPFC observed with GLM1 were due to an ill-specified GLM.

**Fig. 4: Activation in ventromedial prefrontal cortex across models.**

BOLD signal in the VMPFC strongly correlates with the EV

Having established that BOLD activity in the VMPFC only weakly correlates with confidence after the incentive display, we proposed an alternative hypothesis—namely that the VMPFC encodes a signal commensurate to an EV. The rationale of this hypothesis is twofold. First, because confidence represents a subjective probability of being correct, it may be combined with information about the prospective monetary bonus to generate a representation of EV, once this reward information is revealed. Second, activity in the VMPFC has been repeatedly shown to correlate with EV in different contexts (lotteries, et cetera)^12,13,14,15. To test this hypothesis, we built another fMRI GLM similar to the previous ones, but that instead modeled EV at the time of incentive/rating (GLM3; see Fig. 2c).

Whole-brain results showed massive positive correlations between EV and signal in the VMPFC stretching into the anterior medial prefrontal cortex, as well as the ventral and dorsal part of the ACC and the mid-cingulate cortex (Fig. 3d, Supplementary Data 1). There were no activation clusters negatively related to EV.

BOLD signal in the VMPFC correlates better with EV than with other variables

Although these results seem to validate our second hypothesis, our observation of more activations (wider cluster, lower p values) at the whole-brain level for EV than for confidence does not constitute a formal statistical test that VMPFC signals might rather correlate with EV than with confidence. These results may be owing to incentives and EV being highly correlated—in other words—, VMPFC activations to EV could simply be a result of VMPFC activations to incentives. To rule out these hypotheses, we built an additional GLM (GLM4), which only included incentive at the incentive/rating period (Fig. 2c). Again, we extracted VMPFC individual standardized regression coefficients (t values) corresponding to the early certainty, incentive, and confidence-related activations in all available GLMs. We tested whether the different specifications had an impact on those activations using repeated-measure ANOVAs, and post hoc t tests (Fig. 4, Table 1). Although activations for early certainty during choice moment were similar for all GLMs (ANOVA, main effect of GLM; F(4,29) = 0.24, p = 0.916; Fig. 4b), GLM specification had an impact on both the incentive activations (ANOVA, main effect of GLM; F(3,29) = 10.67, p = 4.837 × 10⁻⁶; Fig. 4c) and the confidence activations (ANOVA, main effect of GLM; F(3,29) = 3.22, p = 0.027; Fig. 4d) during incentive/rating moment. In both cases, post hoc t tests showed that t values extracted from the GLM3 that related to the EV regressor were significantly higher than from other GLMs with a different coding of incentives (GLM1 vs GLM3: t₂₉ = 3.90, p = 5.306 × 10⁻⁴; GLM2b vs GLM3: t₂₉ = 3.38, p = 0.002, GLM4 vs GLM3: t₂₉ = 2.97, p = 0.006), and marginally higher from other GLMs with a different coding of confidence (GLM1 vs. GLM3: t₂₉ = 1.92, p = 0.064; GLM2a vs. GLM3: t₂₉ = 1.72, p = 0.096; GLM2b vs. GLM3: t₂₉ = 2.36, p = 0.025). Overall these analyses suggest that the VMPFC combines incentive and confidence signals in the form of an EV signal.

Table 1 Comparison of ventromedial prefrontal cortex (VMPFC) parametric activity (t values) as a function of model specification (GLMs).

Full size table

Qualitative falsification of the EV model of VMPFC activity

At last, in order to confirm the conclusions drawn from our quantitative comparison of VMPFC activations, we ran a qualitative falsification exercise⁴⁶. Leveraging the factorial design of our experiment, we could draw qualitative patterns of activations that would be expected under different hypotheses underlying VMPFC activation (Fig. 5a).

**Fig. 5: Activation in ventromedial prefrontal cortex across incentives and time points.**

To this end, we designed a final GLM (GLM5) that divided the task into two timepoints (stimulus/choice and incentive/rating), and three incentive conditions, and that incorporated a baseline and a regression slope with confidence judgment for all these events. We then extracted the VMPFC activations for all these regressors using our ROI, and compared them with the theorized qualitative patterns we would expect if the VMPFC encoded one of these variables (Fig. 5b, c and Table 2, Table 3). As expected, at the moment of the stimulus/choice, there was no effect of incentive conditions on VMPFC baseline activity, nor on its correlation with confidence—“slope” (ANOVA baseline: F(2,29) = 0.36, p = 0.701; ANOVA correlation with confidence: F(2,29) = 0.56, p = 0.574). Basically, the slopes were significantly positive in all three incentive conditions (Loss: t₂₉ = 2.10, p = 0.045; Neutral: t₂₉ = 2.43, p = 0.021; Gain: t₂₉ = 3.04, p = 0.005), confirming that the VMPFC encodes an early certainty signal.

Table 2 Comparison of ventromedial prefrontal cortex (VMPFC) activity at the choice moment (t values), as a function of incentive condition.

Full size table

Table 3 Comparison of ventromedial prefrontal cortex (VMPFC) activity at rating moment (t values), as a function of incentive condition.

Full size table

At rating moment, incentive conditions had an effect on both VMPFC baseline activity, and on the correlation of VMPFC activity with confidence (ANOVA baseline: F(2,29) = 8.56, p = 5.543 × 10⁻⁴; ANOVA correlation with confidence: F(2,29) = 5.26, p = 0.008). Post hoc testing revealed that VMPFC baseline activity was significantly larger in gain versus loss (t₂₉ = 3.47, p = 0.002) and in gain versus neutral conditions (t₂₉ = 3.17, p = 0.004), but not in neutral versus loss condition (t₂₉ = 0.43, p = 0.673) (see Table 3). This constitutes a deviation from a standard linear model of incentives, and suggest that different regions might process incentives in gains and loss contexts⁴⁷.

Moreover, we found that the correlation of VMPFC activity with confidence is significantly positive in the gain condition only (t₂₉ = 3.29, p = 0.003), and not in the loss (t₂₉ = −0.75, p = 0.457) nor neutral (t₂₉ = 0.70, p = 0.491) conditions. The correlation with confidence was therefore significantly higher in gain versus loss (t₂₉ = 3.13, p = 0.004) and in gain versus neutral conditions (t₂₉ = 2.02, p = 0.053), but not in neutral versus loss condition (t₂₉ = 1.03, p = 0.313). Although the absence of correlation in the neutral condition would be expected if the VMPFC encodes EV, the lack of correlation in the loss condition was not predicted by any of our models (Fig. 5a). Because VMPFC confidence activations were robustly observed in the gain domain, as well as VMPFC early certainty activations in all three conditions, we suggest that the lack of VMPFC confidence activations in the neutral and loss conditions is a feature of the VMPFC signal, rather than a failure of our design to elicit those activations (e.g., due to limited statistical power or excessive statistical noise).

To evaluate whether the lack of robust confidence activation in the neutral and loss condition could be caused by the rough averaging of the VMPFC signal over the anatomical ROI, we also performed a finer-grained analysis. We extracted confidence activations in the three conditions and two timepoints at the voxel-level in a large anatomical area covering most of the medial prefrontal cortex, averaged those activations over two dimensions (respectively X and Z, and X and Y), and assessed how activations unfold over the last dimension—respectively Y and Z (Fig. 6). This last analysis confirmed three main facts: first, the early certainty activations are robustly observed in the same portion of the VMPFC, and—as expected—with similar effect sizes in the three conditions; second, the confidence activations in the gain condition are observed at similar levels as the early certainty activations, confirming that our experimental design elicits robust activations at the incentive/confidence rating time-point; third, no confidence activations can be detected at this finer-grained level in the neutral or loss condition, in the VMPFC. If anything, it seems that the confidence activations in the loss condition trend toward a negative correlation between VMPFC BOLD signal and confidence.

**Fig. 6: Activation in ventromedial prefrontal cortex across Y and Z dimensions.**

Overall, these results initially explain why EV appears a better model of VMPFC activation than confidence and/or incentive (correct pattern in gains and neutral conditions), but ultimately falsify this account by demonstrating the absence of positive correlation between VMPFC activation and confidence in the loss condition.

Discussion

In this study, we set out to investigate the neural signature of incentive bias on confidence estimations, using an fMRI-optimized version of an incentivized perceptual decision-making task⁴². First, at the behavioral level, we replicated the biasing effect of incentives on confidence estimation, in the form of higher confidence in gain contexts and lower confidence in loss context, despite equal difficulty and performance. This result is the fourth independent replication of this bias, initially revealed in perceptual decision making and later generalized in a reinforcement-learning task^43,44. Note, however, that the bias’ effect size remains small—a few average confidence percentage points at the population level— which a priori limits our ability to dissect its precise neurophysiological basis with current (correlational) functional neuroimaging techniques.

Our initial goal and hypothesis were therefore quite simple and modest. In the literature, it is now well established that the BOLD signal in the VMPFC correlates with confidence and/or values in a variety of tasks^{22,23,24,25,29,45}. We reasoned that if we could provide evidence for the presence of both incentive and confidence signals in the VMPFC during our task, this would reinforce the intuition that the VMPFC has a role in the observed behavioral phenomenon, i.e., the incentive bias on confidence. Our neuroimaging predictions were that (1) the VMPFC should correlate with early certainty before and during choice, regardless of the context, and (2) the VMPFC should integrate confidence and incentive after the choice and the revealing of the incentive condition. Our broader, speculative neural hypothesis was that during this last confidence judgment step, a third-party metacognitive region or network would sample signal in the VMPFC^48,49, and incidentally end up with a biased confidence estimate incorporating incentive signal. Our limited sample size combined with some known limits of brain-behavior analyses⁵⁰ restricted a priori any ambition to validate a neurobiological model of the observed confidence bias by running inter-individual correlations between VMPFC activations and the confidence bias estimated at the behavioral level.

Our fMRI investigation of the neural correlates of early certainty confirms our first prediction: BOLD activity in the VMPFC positively correlates with early certainty in all conditions. This result replicates and extends previous studies demonstrating this area to be associated to the initial and automatic processing of confidence during choice^22,23,25. In parallel with this positive correlation in the VMPFC, we also observed widespread negative correlations in the DLPFC, DMPFC, and insula, a network robustly associated with both metacognition and uncertainty^21,29,51. Contrary to our second prediction, we only found weak evidence (i.e., at a lower statistical threshold than the one we defined a priori) for confidence encoding in the VMPFC. Robust activations were nonetheless observed in the dACC, a region known to be recruited in metacognitive judgments^20,52.

Given that the lack of robust confidence signal in the VMPFC is somewhat in contradiction with what we expected from our previous work, as well as numerous other reports in the literature^{22,23,24,25,29,45}, we formulated an alternative hypothesis: we proposed that VMPFC could encode a signal commensurate to an expected reward (or EV), i.e., incorporating the subjective probability of being correct with the potential incentive bonus when revealed. Whole-brain activations and ROI quantitative analyses clearly showed that this second hypothesis seems to give a better account of VMPFC BOLD activations. EV signals are frequently reported in the VMPFC, but mostly in reinforcement-learning contexts, where they are critical to both choices between available options and learning—i.e., value updating, through the computation of prediction errors⁵³. In the present perceptual task, there is no learning, therefore no explicit need to encode EV.

Because quantitative comparisons of hypotheses are notoriously hard to interpret, we decided to leverage the factorial aspect of our design to proceed to a qualitative hypothesis falsification, to validate—or falsify—the EV account of VMPFC activity⁴⁶. In short, different hypotheses about what should be contained in VMPFC signal (EV, confidence, and/or incentives) predict different patterns of activations (baseline and correlation with confidence) in our different incentive conditions. From activity extracted from an anatomical VMPFC ROI, it is clear that VMPFC activity correlates with confidence only in the gain context, once the incentive has been revealed. This finding explains why the EV hypothesis obtained stronger quantitative support than the confidence and/or incentives hypotheses (as the VMPFC activity pattern is similar to the EV predictions in the gain and neutral context). However, it also ultimately falsifies this EV hypothesis as well, as VMPFC activity does not seem to correlate with confidence in the loss context. Interestingly, VMPFC does correlate with early certainty—a precursor of confidence—in all conditions before the incentives are revealed. Therefore, it does not seem that the VMPC fails to activate in the neutral and loss conditions, but rather that the signal is actively suppressed once those contexts are explicit. Moreover, the fact that we do not observe confidence activations in neutral or loss conditions is also not due to the fact that participants are less focused on evaluating confidence in those conditions compared to the gain condition, as we showed that the confidence sensitivity is identical in all incentive conditions. In summary, we believe that our results show a complex picture of disruptions of confidence signals within the VMPFC in response to motivational signals.

The absence of VMPFC confidence signal in the neutral condition might seem at odds with other studies that report such signal in non-incentivized tasks such as pleasantness or desirability ratings²³. One possible explanation is that VMPFC confidence signals, like attentional modulation of evidence integration⁵⁴, are primarily observed for behavior or conditions that are relevant to participants’ goals: in non-incentivized tasks such as pleasantness or desirability ratings, participants still have a goal, which is to provide ratings that are as accurate as possible. In our task, if the goal of participants is to maximize their score, the neutral condition might not be goal-relevant, which could result in a disrupted VMPFC confidence signal. Note that because our design features interleaved (rather than blocked) conditions, the valence manipulation is somewhat exacerbated, as the succession of the different conditions limit the contextualization of outcomes (whereby the absence of loss could be reframed as a relative gain in a loss-block). Also, because trials featuring gains, losses, and neutral incentives follow each-others in a pseudorandomized order, the interleaved design also prevent any systematic bias or confound for the valence effects (at the behavioral or neurobiological levels) that could be due to the processing of the feedbacks (gains, losses, or nothing).

The notion that there are different brain networks that execute symmetric computations in gains versus loss contexts is increasingly popular^47,55. Because the positive, gain context network also typically includes the VS (see e.g.,^12,16 we replicated all analyses using an anatomical VS ROI (see Supplementary Note 2). These analyses qualitatively rendered very similar results to what we observed in the VMPFC. In the present data set though, we did not find any region correlating either positively or negatively with confidence in the loss context, even when exploring the whole-brain level with very lenient statistical thresholds. The dACC is a promising area, since it has repeatedly been associated with loss anticipation and correlated positively with subjective confidence in our data. However, when we performed a similar falsification exercise within the dACC as we used within the VMPFC (see Supplementary Note 3), the results were similar to the VMPFC activation patterns: dACC activity only correlated with confidence within the gain contexts. In summary, it remains an open question what the neurobiological correlates of confidence judgments in loss contexts are.

Our results constitute a stepping stone and have important implications for studying clinical populations where these (meta)cognitive processes go awry. It shows that motivational processes can influence confidence, and when there are discrepancies between one’s behavior and confidence in that behavior, this could give rise to pathological decision making. Indeed, several psychiatric disorders such as addiction, obsessive-compulsive disorder, and schizophrenia have been associated with disrupted incentive processing^{56,57,58,59,60} and studies have additionally demonstrated distorted confidence estimations in these groups⁶¹. Our study indicates that the VMPFC is a key region involved in the interaction between motivation and metacognition, and VMPFC function is also often affected in many psychiatric disorders⁶². The current study provides a means of studying neurobiological explanations for confidence abnormalities and their interaction with incentive motivation in the clinical population which can potentially impact clinical practice, as it could help treat psychopathology⁶². Therefore, the relationship between motivational processes and confidence estimation and their role in psychopathology warrants future investigation.

In conclusion, we show that although the VMPFC seems to encode both value and metacognitive signals, these metacognitive signals are only present during the prospect of gain and are disrupted in a context with loss or no monetary prospects. Studies targeting this problem within a finer spatial^24,63,64 and/or temporal scale⁶⁵ could help with resolving and better comprehending biased confidence judgments and metacognition overall.

Methods

Participants

We included 33 right-handed healthy participants with normal or corrected to normal vision. Exclusion criteria were an IQ below 80, insufficient command of the Dutch language, or MRI contraindications. All experimental procedures were approved by the Medical Ethics Committee of the Academic Medical Center, University of Amsterdam (METC 2015_319), and participants gave written informed consent. Participants were compensated with a base amount of €40 and additional gains based on task performance. Session-level behavioral and fMRI data were excluded when task accuracy was below 60% or when subjects did not show sufficient variation in their confidence reports (standard deviation of confidence judgments < 5 confidence points), and session-level fMRI data when participants showed head movements > 3.5 mm. This led to the inclusion of 32 participants (18/14 females/males, 18–58 years old (sd: 9.76)) for the behavioral analyses and 30 for the fMRI analyses, of which four participants contributed only one of two task sessions.

Decision-making and confidence judgment task

We adapted the task from Lebreton et al.⁴² for use in an fMRI environment with fMRI suitable timing intervals. For an overview and details, see Fig. 1a. All tasks used in this study were implemented using MATLAB® (MathWorks Inc., Sherborn, MA, USA) and the COGENT toolbox (www.vislab.ucl.ac.uk/cogent.php).

Study procedure

On the day of testing, subjects were first assessed for clinical and demographic data, after which they performed one practice session (10 trials) outside of the scanner and another one inside the scanner to become acquainted with the task. Subjects were instructed that they would only be rewarded based on their performance (i.e., they should be as accurate as possible to maximize their earnings), and that it was important to give accurate confidence judgments. They were notified that 50% confidence would signal that they made a guess, whereas 100% confidence would signal that they were absolutely certain that they made the correct choice. Thus, performance but not confidence was incentivized. According to our previous findings⁴², this design elicits incentive bias on confidence while keeping confidence sensitivity identical across conditions—an important consideration when interpreting differences in confidence activations between those conditions. All subjects initially performed a 144-trial calibration session inside the scanner to tailor the difficulty levels of the task to each individual and to keep performance constant across subjects. This was done using a staircase procedure, in which data were used to estimate a full psychometric function, whose parameters were used to generate stimuli for the main task, spanning three difficulty levels (i.e., 65%, 75%, and 85% accuracy, on average) (for details, see ref. ⁴²).

Two sessions of the main task were performed in the fMRI scanner, each consisting of 72 trials with 24 trials per incentive condition, presented in random order. The practice task, calibration, and main sessions were projected onto an Iiyama monitor in the fMRI environment, which subjects could see through a 45-degree angle mirror fixed to the head coil. After completing the fMRI task, six random trials were drawn (i.e., two of each incentive condition) on which the payment was based. If subjects made an accurate choice, they would either gain or avoid losing points, whereas they would miss out on gaining or losing points when making an error. In the neutral trials, nothing was at stake. Finally, the total amount of points were converted to money.

Behavioral measures

We extracted various trial-by-trial experimental factors (evidence, incentive, and difficulty level) and behavioral measures (accuracy, subjective confidence ratings, RTs). Control analyses were performed to confirm the properties of confidence ratings (Supplementary Note 4). Three additional variables were computed as combinations of those experimental factors and behavioral measures: early certainty, EV, and metacognitive sensitivity.

Early certainty

We built an “early certainty” variable that represents a confidence signal prior to the biasing effects of incentives. We assume that such an early certainty signal should be encoded automatically at the moment of choice, in turn allowing us to investigate confidence signals with and without incentive bias²³. Importantly, such a signal should be highly correlated with the later, biased confidence judgment obtained from the subjects, while exhibiting no statistically significant relationship with incentives. Therefore, we used a leave-one-trial-out approach to obtain trial-by-trial estimations of early certainty⁵². We fitted a generalized linear regression model to each subject’s subjective confidence ratings using choice and stimulus features as predictors (i.e., log-transformed RTs, evidence, accuracy, and the interaction between accuracy, and evidence), using the whole individual dataset but trial X. We then applied this model’s estimates to generate predictions about the early certainty in trial X, using the choice and stimulus features of trial X. This process was repeated for every trial, resulting in a trial-by-trial prediction of early certainty based on stimulus features at choice moment. The resulting early certainty signal featured high correlation with confidence, and no statistical relationship with incentives (see Supplementary Note 5 for more details). Importantly, since the early certainty signal follows the main properties of confidence judgments (Supplementary Fig. 6), but does not show any incentive bias, this critically enables us to differentiate between non-biased confidence signals during decision-making and biased confidence signals after incentivization.

EV

We computed a value-based measure of EV. In our task paradigm, EV was computed as an integrative signal of early certainty (i.e., the non-biased probability of being correct) and the incentive value (i.e., the value-context of the current trial). Early certainty ratings represent the subjects’ probability of being correct, and thus the probability of gaining (or avoid losing) the incentive at stake. Thus, EV corresponds to 0 in the neutral condition (no value is expected to be gained or lost), is equal to early certainty in the gain condition (e.g., being 100% certain results in a maximal EV in a positive incentive environment), and is equal to early certainty—100 (e.g., being 100% certain in a loss trial results in an EV of 0, as you avoid losing).

Metacognitive sensitivity

Metacognitive sensitivity is a metric that indicates how well an observer’s confidence judgments discriminate between their correct and incorrect answers and can be represented using several indexes. For example, discrimination is a metric calculated as the difference between the average confidence for correct answers and the average confidence for incorrect answers, whereas meta-d’ is a metric based on the Signal Detection Theory framework⁶⁶. Notably, meta-d’ computations are known to be imprecise in designs with a low number of trials per condition⁶⁷. This, together with results from our earlier work⁴² showing high correlations between discrimination and meta-d’, as well as identical conclusions with respect to the effects of incentives on these measures, lead to us using the discrimination metric as our measure of metacognitive sensitivity.

fMRI acquisition and preprocessing

fMRI data were acquired by using a 3.0 Tesla Intera MRI scanner (Philips Medical Systems, Best, The Netherlands). Following the acquisition of a T1-weighted structural anatomical image, 37 axial T2*-weighted EPI functional slices sensitive to BOLD contrast were acquired. A multi-echo (three echoes) combine interleaved scan sequence was applied, designed to optimize functional sensitivity in all parts of the brain⁶⁸. The following imaging parameters were used: repetition time (TR), 2.375 seconds; echo times (TEs), 9.0 ms, 24.0 ms, and 43.8 ms, (total echo train length: 75 ms); 3 mm (isometric) voxel size; 37 transverse slices; 3 mm slice thickness; 0.3 mm slice-gap. Two experimental sessions were carried out, each consisting of 570 volumes. All further analyses were performed using MATLAB® with SPM12 software (Wellcome Department of Cognitive Neurology, London, UK).

Raw multi-echo functional scans were weighed and combined into 570 volumes per scan session. During the combining process, realignment was performed on the functional data by using linear interpolation to the first volume. The first 30 dummy scans were discarded. The remaining functional images were co-registered with the T1-weighted structural image, segmented for normalization to Montreal Neurological Institute (MNI) space, and smoothed using a Gaussian kernel of 6 mm at full-width at half-maximum.

Owing to sudden motion, in combination with the interleaved scanning method, a number of subjects showed artifacts in some functional volumes. In order to reduce those artifacts, the Art-Repair toolbox⁶⁹ was used to detect large volume-to-volume movement and repair outlier volumes. The toolbox identifies outliers by using a threshold for the variation of the mean intensity of the BOLD signal and a volume-to-volume motion threshold. A threshold of 1.5% variation from the mean intensity was used to detect and repair volume outliers by interpolating from the adjacent volumes (n = 12).

Statistics and reproducibility: behavioral analyses

All behavioral analyses were performed using MATLAB® and the R environment (RStudio Team (2015). RStudio: Integrated Development for R. RStudio, Inc., Boston, MA). For the statistical analyses reported in the main text, we used linear mixed-effects models (estimated with the fitglme function in MATLAB®) to model accuracy, RTs, and confidence. In order to analyze the effect of the incentive condition (i.e., of our experimental manipulation of incentives), for all three trial-by-trial dependent variables we used the absolute incentive value (i.e., the absolute value of the monetary incentive, |V|, coded as 0 and +1) and the net incentive value (i.e., the linear value of the monetary incentive, V, coded as −1, 0, and +1) as predictor variables. All mixed models included random intercepts and random slopes (N = 32). Additional control analyses are reported in Supplementary Note 4. For the analysis of metacognitive sensitivity, we performed a repeated-measures ANOVA, with net incentive value as within-subject factor.

Statistics and reproducibility: fMRI analyses

All fMRI analyses were conducted using SPM12. All general linear models (GLMs) were estimated on subject-level (N = 30) with two moments of interest: the moment of choice (i.e., presentation of the Gabor patches) and the moment of incentive presentation/confidence rating (Fig. 2). The rating moment follows the presentation of the incentive after 900 ms, hence the decision to analyze them as a single moment of interest. Moreover, the GLMs also included a regressor for the feedback moment, which was not of interest for analysis, but was intended to explain variance in neural responses related to value and accuracy feedback, but unrelated to the decision-making process.

When using parametric modulators in our GLMs, those were not orthogonalized and competed to explain variance. Nuisance regressors consisting of six motion parameters were included in all GLMs. Regressors were modeled separately for each scan session and constants were included to account for between-session differences in mean activation. All events were modeled by convolving a series of delta functions with the canonical hemodynamic response function at the onset of each event and were linearly regressed onto the functional BOLD-response signal. Low-frequency noise was filtered with a high pass filter with a cutoff of 128 seconds. All contrasts were computed at subject-level and taken to a group-level mixed-effect analysis using one-sample t tests.

We controlled for the number of sessions while making the first-level contrasts. We assessed group-level main effects by applying one-sample t tests against 0 to these contrast images. All whole-brain activation maps were thresholded using FWE for multiple corrections at cluster level (p_{FWE_clu} < 0.05), with a voxel cluster-defining threshold of p < 0.001 uncorrected.

GLM1: neural signatures of certainty, incentive, and confidence

GLM1 consisted of three regressors for the three moments of interest: “choice”, “incentive/rating”, and “feedback”, to which one or more parametric modulators (pmod) were added (Fig. 2). The regressors were specified as stick function time-locked to the onset of the events. The choice regressor was modulated by two pmods: early certainty (z scored before entering the GLM) and button press (left/right choice) in order to control for activity related to motor preparation. The incentive/rating regressor was modulated by two pmods: incentive value and subjective confidence level (z scored). At last, the feedback regressor was modulated by a pmod of accuracy.

Importantly, to ensure that our brain activations of interest (i.e., related to early certainty, incentive, and confidence) were not confounded by motor-related activations, we performed control analyses that implemented exclusive masking for motor activations. To do so, we generated the exclusive mask from “Neurosynth” (a platform for large-scale, automated synthesis of fMRI data⁷⁰), using the term ‘motor’ (https://neurosynth.org/analyses/terms/motor/). This mask represents key regions related to motor processes as identified by an automated meta-analysis of 2565 studies.

GLM2a: control for incentive bias 1

GLM2a consisted of the same regressors as GLM1, except that the rating moment was only modulated by confidence judgments (i.e., we deleted the incentive modulator).

GLM2b: control for incentive bias 2

GLM2b consisted of the same regressors as GLM1, except that the pmod of confidence judgments at the rating moment was replaced by a pmod for early certainty.

GLM3: neural signatures of EV

GLM3 consisted of the same regressors as GLM1, except that rating moment was modulated by a single pmod of EV.

GLM4: control for incentive

GLM4 consisted of the same regressors as GLM1, except that the rating moment was only modulated by incentives (i.e., we deleted the confidence judgment modulator).

GLM5: qualitative patterns of activations

GLM5 included a regressor for all three incentives at two time points of interest: choice and rating moment, as well as a regressor at feedback moment. All regressors at the choice moment were modulated by a pmod of early certainty and button press (L/R). All regressors at the rating moment were modulated by a pmod of confidence judgment. The feedback regressor was modulated by accuracy. This GLM allowed us to investigate activity related to both baseline and the regression slope with early certainty or confidence judgment for these events.

Regions of interest

To avoid circular inference, we took an independent anatomical ROI of the VMPFC from the Brainnetome Atlas⁷¹. We included three areas along the ventral medial axis for the VMPFC ROI. Using this ROI, we extracted individual t-statistics (i.e., normalized beta estimates⁵⁰) from contrasts of interest, and statistically compared them using paired t tests or repeated-measure ANOVAs.

Moreover, in order to perform a finer-grained analysis into early certainty and confidence activations, we took a larger anatomical ROI, covering most of the medial prefrontal cortex from the Brainnetome Atlas⁷¹ With this ROI, we extracted individual t-statistics from our contrasts of interest in GLM5 and averaged those activations over two dimensions (respectively, X and Z, and X and Y), so that we could assess the spread of activations over the last dimension, respectively, Y (anterior–posterior axis) and Z (ventral–dorsal axis).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All source data needed to evaluate or reproduce the figures and analyses described in the paper and supplementary materials are available online at ‘https://doi.org/10.6084/m9.figshare.19228977’. Second-level neuroimaging maps can be found at ‘https://neurovault.org/collections/12221/’⁷².

Code availability

All code needed to evaluate or reproduce the figures and analyses described in the paper and supplementary materials are available online at ‘https://doi.org/10.6084/m9.figshare.19228977’.

References

Rangel, A. & Hare, T. Neural computations associated with goal-directed choice. Curr. Opin. Neurobiol. 20, 262–270 (2010).
Article CAS PubMed Google Scholar
Kable, J. W. & Glimcher, P. W. The neurobiology of decision: consensus and controversy. Neuron 63, 733 (2009).
Article CAS PubMed PubMed Central Google Scholar
Padoa-Schioppa, C. Orbitofrontal cortex and the computation of economic value. Ann. N. Y. Acad. Sci. 1121, 232–253 (2007).
Article PubMed Google Scholar
Tremblay, L. & Schultz, W. Relative reward preference in primate orbitofrontal cortex. Nature 398, 704–708 (1999).
Article CAS PubMed Google Scholar
Padoa-Schioppa, C. & Assad, J. A. Neurons in the orbitofrontal cortex encode economic value. Nature 441, 223–226 (2006).
Article CAS PubMed PubMed Central Google Scholar
Kahnt, T., Heinzle, J., Park, S. Q. & Haynes, J. D. Decoding different roles for vmPFC and dlPFC in multi-attribute decision making. Neuroimage 56, 709–715 (2011).
Article PubMed Google Scholar
Knutson, B., Fong, G. W., Bennett, S. M., Adams, C. M. & Hommer, D. A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: characterization with rapid event-related fMRI. Neuroimage 18, 263–272 (2003).
Article PubMed Google Scholar
Lebreton, M., Jorge, S., Michel, V., Thirion, B. & Pessiglione, M. An automatic valuation system in the human brain: evidence from functional neuroimaging. Neuron 64, 431–439 (2009).
Article CAS PubMed Google Scholar
Chib, V. S., Rangel, A., Shimojo, S. & O’Doherty, J. P. Evidence for a common representation of decision values for dissimilar goods in human ventromedial prefrontal cortex. J. Neurosci. 29, 12315–12320 (2009).
Article CAS PubMed PubMed Central Google Scholar
Levy, D. J. & Glimcher, P. W. Comparing apples and oranges: using reward-specific and reward-general subjective value representation in the brain. J. Neurosci. 31, 14693–14707 (2011).
Article CAS PubMed PubMed Central Google Scholar
Plassmann, H., O’Doherty, J. & Rangel, A. Orbitofrontal cortex encodes willingness to pay in everyday economic transactions. J. Neurosci. 27, 9984–9988 (2007).
Article CAS PubMed PubMed Central Google Scholar
Knutson, B., Taylor, J., Kaufman, M., Peterson, R. & Glover, G. Distributed neural representation of expected value. J. Neurosci. 25, 4806–4812 (2005).
Article CAS PubMed PubMed Central Google Scholar
McNamee, D., Rangel, A. & O’Doherty, J. P. Category-dependent and category-independent goal-value codes in human ventromedial prefrontal cortex. Nat. Neurosci. 16, 479–485 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hare, T. A., O’Doherty, J., Camerer, C. F., Schultz, W. & Rangel, A. Dissociating the role of the orbitofrontal cortex and the striatum in the computation of goal values and prediction errors. J. Neurosci. 28, 5623–5630 (2008).
Article CAS PubMed PubMed Central Google Scholar
Gläscher, J., Hampton, A. N. & O’Doherty, J. P. Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making. Cereb. Cortex 19, 483–495 (2009).
Article PubMed Google Scholar
Bartra, O., McGuire, J. T. & Kable, J. W. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage 76, 412–427 (2013).
Article PubMed Google Scholar
Haber, S. N. & Behrens, T. E. J. The neural network underlying incentive-based learning: implications for interpreting circuit disruptions in psychiatric disorders. Neuron 83, 1019–1039 (2014).
Article CAS PubMed PubMed Central Google Scholar
Haber, S. N. & Knutson, B. The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology 35, 4–26 (2009).
Article PubMed Central Google Scholar
Pessiglione, M. & Lebreton, M. From the reward circuit to the valuation system: how the brain motivates behavior. 1–421 (2015) https://doi.org/10.1007/978-1-4939-1236-0.
Fleming, S. M., Huijgen, J. & Dolan, R. J. Prefrontal contributions to metacognition in perceptual decision making. (2012) https://doi.org/10.1523/JNEUROSCI.6489-11.2012.
Vaccaro, A. G. & Fleming, S. M. Thinking about thinking: a coordinate-based meta-analysis of neuroimaging studies of metacognitive judgements. Brain Neurosci. Adv. 2, 239821281881059 (2018).
Article Google Scholar
De Martino, B., Fleming, S. M., Garrett, N. & Dolan, R. J. Confidence in value-based choice. Nat. Neurosci. 16, 105–110 (2013).
Article PubMed Google Scholar
Lebreton, M., Abitbol, R., Daunizeau, J. & Pessiglione, M. Automatic integration of confidence in the brain valuation signal. Nat. Neurosci. 18, 1159–1167 (2015).
Article CAS PubMed Google Scholar
Lopez-Persem, A. et al. Four core properties of the human brain valuation system demonstrated in intracranial signals. Nat. Neurosci. 23, 664–675 (2020).
Article CAS PubMed Google Scholar
Shapiro, A. D. & Grafton, S. T. Subjective value then confidence in human ventromedial prefrontal cortex. PLoS One 15, e0225617 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fleming, S. M. & Daw, N. D. Self-evaluation of decision-making: a general bayesian framework for metacognitive computation. Psychol. Rev. 124, 91–114 (2017).
Article PubMed PubMed Central Google Scholar
Pouget, A., Drugowitsch, J. & Kepecs, A. Confidence and certainty: distinct probabilistic quantities for different goals. Nat. Neurosci. 19, 366–374 (2016).
Article CAS PubMed PubMed Central Google Scholar
Abitbol, R. et al. Neural mechanisms underlying contextual dependency of subjective values: converging evidence from monkeys and humans. J. Neurosci. 35, 2308–2320 (2015).
Article CAS PubMed PubMed Central Google Scholar
Morales, J., Lau, H. & Fleming, S. M. Domain-general and domain-specific patterns of activity supporting metacognition in human prefrontal cortex. J. Neurosci. 38, 2360–17 (2018).
Article Google Scholar
Folke, T., Jacobsen, C., Fleming, S. M. & De Martino, B. Explicit representation of confidence informs future value-based decisions. Nat. Hum. Behav. 1, 0002 (2017).
Heilbron, M. & Meyniel, F. Confidence resets reveal hierarchical adaptive learning in humans. PLOS Comput. Biol. 15, e1006972 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vinckier, F. et al. Confidence and psychosis: a neuro-computational account of contingency learning disruption by NMDA blockade. Mol. Psychiatry 21, 946–955 (2016).
Article CAS PubMed Google Scholar
Boldt, A., Blundell, C. & De Martino, B. Confidence modulates exploration and exploitation in value-based learning. Neurosci. Conscious. 2019, niz004 (2019).
Daw, N. D., Niv, Y. & Dayan, P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704–1711 (2005).
Article CAS PubMed Google Scholar
Donoso, M., Collins, A. G. E. & Koechlin, E. Human cognition. Foundations of human reasoning in the prefrontal cortex. Science 344, 1481–1486 (2014).
Article CAS PubMed Google Scholar
Giardini, F., Coricelli, G., Joffily, M. & Sirigu, A. Overconfidence in predictions as an effect of desirability bias. Adv. Decis. Mak. Under Risk Uncertain. 163–180 (2008).
Koellinger, P. & Treffers, T. Joy leads to overconfidence, and a simple countermeasure. PLoS One 10, 1–22 (2015).
Article Google Scholar
Massoni, S. Emotion as a boost to metacognition: how worry enhances the quality of confidence. Conscious Cogn. 29, 189–198 (2014).
Article PubMed Google Scholar
Allen, M. et al. Unexpected arousal modulates the influence of sensory noise on confidence. Elife 5, 1–17 (2016).
Article Google Scholar
Jönsson, F. U., Olsson, H. & Olsson, M. J. Odor emotionality affects the confidence in odor naming. Chem. Senses 30, 29–35 (2005).
Article PubMed Google Scholar
Kuhnen, C. M. & Knutson, B. The influence of affect on beliefs, preferences, and financial decisions. J. Financ. Quant. Anal. 46, 605–626 (2011).
Article Google Scholar
Lebreton, M. et al. Two sides of the same coin: Monetary incentives concurrently improve and bias confidence judgments. Sci. Adv. 4, eaaq0668 (2018).
Article PubMed PubMed Central Google Scholar
Lebreton, M., Bacily, K., Palminteri, S. & Engelmann, J. B. Contextual influence on confidence judgments in human reinforcement learning. PLoS Comput. Biol. 15, e1006973 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ting, C. C., Palminteri, S., Engelmann, J. B. & Lebreton, M. Robust valence-induced biases on motor response and confidence in human reinforcement learning. Cogn. Affect. Behav. Neurosci. 20, 1184–1199 (2020).
Article PubMed PubMed Central Google Scholar
De Martino, B., Bobadilla-Suarez, S., Nouguchi, T., Sharot, T. & Love, B. C. Social information is integrated into value and confidence judgments according to its reliability. J. Neurosci. 37, 6066–6074 (2017).
Article PubMed PubMed Central Google Scholar
Palminteri, S., Wyart, V. & Koechlin, E. The importance of falsification in computational cognitive modeling. Trends Cogn. Sci. 21, 425–433 (2017).
Article PubMed Google Scholar
Palminteri, S. & Pessiglione, M. Opponent brain systems for reward and punishment learning: causal evidence from drug and lesion studies in humans. Decis. Neurosci. An Integr. Perspect. 291–303 (2017).
Meyniel, F., Sigman, M. & Mainen, Z. F. Perspective confidence as bayesian probability: from neural origins to behavior. Neuron 88, 78–92 (2015).
Article CAS PubMed Google Scholar
Shekhar, M. & Rahnev, D. Distinguishing the roles of dorsolateral and anterior PFC in visual metacognition. J. Neurosci. 38, 5078–5087 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lebreton, M., Bavard, S., Daunizeau, J. & Palminteri, S. Assessing inter-individual differences with task-related functional neuroimaging. Nat. Hum. Behav. 3, 897–905 (2019).
Article PubMed Google Scholar
Molenberghs, P., Trautwein, F.-M., Böckler, A., Singer, T. & Kanske, P. Neural correlates of metacognitive ability and of feeling confident: a large-scale fMRI study. Soc. Cogn. Affect. Neurosci. 11, 1942–1951 (2016).
Article PubMed PubMed Central Google Scholar
Bang, D. & Fleming, S. M. Distinct encoding of decision confidence in human medial prefrontal cortex. Proc. Natl Acad. Sci. USA115, 6082–6087 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chase, H. W., Kumar, P., Eickhoff, S. B. & Dombrovski, A. Y. Reinforcement learning models and their neural correlates: an activation likelihood estimation meta-analysis. Cogn. Affect. Behav. Neurosci. 15, 435–459 (2015).
Article PubMed PubMed Central Google Scholar
Sepulveda, P. et al. Visual attention modulates the integration of goal-relevant evidence and not value. Elife 9, 1–58 (2020).
Article Google Scholar
Seymour, B., Maruyama, M. & De Martino, B. When is a loss a loss? Excitatory and inhibitory processes in loss-related decision-making. Curr. Opin. Behav. Sci. 5, 122–127 (2015).
Article Google Scholar
Admon, R. et al. Functional and structural neural indices of risk aversion in obsessive-compulsive disorder (OCD). Psychiatry Res. 203, 207–213 (2012).
Article PubMed Google Scholar
Choi, J., Shin, Y., Jung, W. H., Jang, J. H. & Kang, D. Altered brain activity during reward anticipation in pathological gambling and obsessive-compulsive disorder. PLoS One 7, 3–10 (2012).
Article Google Scholar
Clark, L., Boileau, I. & Zack, M. Neuroimaging of reward mechanisms in Gambling disorder: an integrative review. Mol. Psychiatry 24, 674–693 (2019).
Article PubMed Google Scholar
Koob, G. F. & Volkow, N. D. Neurobiology of addiction: a neurocircuitry analysis. Lancet Psychiatry 3, 760–773 (2016).
Article PubMed PubMed Central Google Scholar
Strauss, G. P., Waltz, J. A. & Gold, J. M. A review of reward processing and motivational impairment in schizophrenia. Schizophr. Bull. 40, S107–S116 (2014).
Article PubMed Google Scholar
Hoven, M. et al. Abnormalities of confidence in psychiatry: an overview and future perspectives. Transl. Psychiatry 9, 1–18 (2019).
Article Google Scholar
Hiser, J. & Koenigs, M. The multifaceted role of the ventromedial prefrontal cortex in emotion, decision making, social cognition, and psychopathology. Biol. Psychiatry 83, 638–647 (2018).
Article PubMed Google Scholar
Kepecs, A. & Mainen, Z. F. A computational framework for the study of confidence in humans and animals. Philos. Trans. R. Soc. B: Biol. Sci. 367, 1322–1337 (2012).
Article Google Scholar
Middlebrooks, P. G., Abzug, Z. M. & Sommer, M. A. Studying metacognitive processes at the single neuron level. in The Cognitive Neuroscience of Metacognition 225–244 (Springer-Verlag Berlin Heidelberg, 2013). https://doi.org/10.1007/978-3-642-45190-4_10.
Desender, K., Van Opstal, F., Hughes, G. & Van den Bussche, E. The temporal dynamics of metacognition: dissociating task-related activity from later metacognitive processes. Neuropsychologia 82, 54–64 (2016).
Article PubMed Google Scholar
Maniscalco, B. & Lau, H. A signal detection theoretic approach for estimating metacognitive sensitivity from confidence ratings. Conscious Cogn. 21, 422–430 (2012).
Article PubMed Google Scholar
Rouault, M., McWilliams, A., Allen, M. G. & Fleming, S. M. Human metacognition across domains: insights from individual differences and neuroimaging. Personal. Neurosci. 1, 1–13 (2018).
Article Google Scholar
Poser, B. A., Versluis, M. J., Hoogduin, J. M. & Norris, D. G. BOLD contrast sensitivity enhancement and artifact reduction with multiecho EPI: Parallel-acquired inhomogeneity-desensitized fMRI. Magn. Reson. Med. 55, 1227–1235 (2006).
Article PubMed Google Scholar
Mazaika, Whitfield-Gabrieli & Reiss. A. Artifact repair for fMRI data from high motion clinical subjects. NeuroImage 36:S142 (2007).
Yarkoni, T., Poldrack, R. A., Nichols, T. E., Van Essen, D. C. & Wager, T. D. Large-scale automated synthesis of human functional neuroimaging data. Nat. Methods 8, 665 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fan, L. et al. The human brainnetome atlas: a new brain atlas based on connectional architecture. Cereb. Cortex 26, 3508–3526 (2016).
Article PubMed PubMed Central Google Scholar
Hoven, M. Data and codes for Hoven et al (2022). Commun. Biol. https://doi.org/10.6084/m9.figshare.19114406.v1 (2022).

Download references

Acknowledgements

Data collection for this work was funded by two independent personal Amsterdam Brain and Cognition (ABC) Talent grants to J.L. and R.v.H., and an NWO Veni Fellowship (grant 451-15-015) granted to M.L. M.L. is supported by a Swiss National Fund Ambizione Grant (PZ00P3_174127), J.L. is supported by an NWO VENI Fellowship grant (916-18-119).

Author information

These authors contributed equally: Ruth van Holst, Judy Luigjes, Maël Lebreton.

Authors and Affiliations

Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
Monja Hoven, Gina Brunner, Nina S. de Boer, Anna E. Goudriaan, Damiaan Denys, Ruth J. van Holst & Judy Luigjes
Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, UK
Gina Brunner
Department of Philosophy, Radboud University, Nijmegen, The Netherlands
Nina S. de Boer
Arkin and Jellinek, Mental Health Care, Amsterdam, The Netherlands
Anna E. Goudriaan
Netherlands Institute for Neuroscience, an Institute of the Royal Netherlands Academy of Arts and Sciences, Amsterdam, The Netherlands
Damiaan Denys
Swiss Center for Affective Science, University of Geneva, Geneva, Switzerland
Maël Lebreton
Laboratory for Behavioral Neurology and Imaging of Cognition, Department of Fundamental Neurosciences, University of Geneva, Geneva, Switzerland
Maël Lebreton

Authors

Monja Hoven
View author publications
You can also search for this author in PubMed Google Scholar
Gina Brunner
View author publications
You can also search for this author in PubMed Google Scholar
Nina S. de Boer
View author publications
You can also search for this author in PubMed Google Scholar
Anna E. Goudriaan
View author publications
You can also search for this author in PubMed Google Scholar
Damiaan Denys
View author publications
You can also search for this author in PubMed Google Scholar
Ruth J. van Holst
View author publications
You can also search for this author in PubMed Google Scholar
Judy Luigjes
View author publications
You can also search for this author in PubMed Google Scholar
Maël Lebreton
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: R.J.v.H., J.L., M.L.; methodology: M.H., R.J.v.H., J.L., M.L.; data collection: M.H., N.s.d.B., G.B.; analyses: M.H., M.L.; writing original draft: M.H.; writing review & editing: M.H., G.B., N.s.d.B., A.G., D.D., R.vJ.H., J.L, M.L.; visualization: M.H., M.L.; Supervision: R.J.v.H., J.L., M.L.

Corresponding author

Correspondence to Monja Hoven.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Jacqueline Gottlieb, Karli Montague-Cardoso and George Inglis. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplemental Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hoven, M., Brunner, G., de Boer, N.S. et al. Motivational signals disrupt metacognitive signals in the human ventromedial prefrontal cortex. Commun Biol 5, 244 (2022). https://doi.org/10.1038/s42003-022-03197-z

Download citation

Received: 21 June 2021
Accepted: 24 February 2022
Published: 18 March 2022
DOI: https://doi.org/10.1038/s42003-022-03197-z

This article is cited by

Neural and computational underpinnings of biased confidence in human reinforcement learning
- Chih-Chung Ting
- Nahuel Salem-Garcia
- Maël Lebreton
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Behavioral results

fMRI results

BOLD signal in the VMPFC correlates significantly with early certainty and incentives but weakly with confidence

Accounting for incentive bias in confidence does not restore VMPFC confidence activations

BOLD signal in the VMPFC strongly correlates with the EV

BOLD signal in the VMPFC correlates better with EV than with other variables

Qualitative falsification of the EV model of VMPFC activity

Discussion

Methods

Participants

Decision-making and confidence judgment task

Study procedure

Behavioral measures

Early certainty

EV

Metacognitive sensitivity

fMRI acquisition and preprocessing

Statistics and reproducibility: behavioral analyses

Statistics and reproducibility: fMRI analyses

GLM1: neural signatures of certainty, incentive, and confidence

GLM2a: control for incentive bias 1

GLM2b: control for incentive bias 2

GLM3: neural signatures of EV

GLM4: control for incentive

GLM5: qualitative patterns of activations

Regions of interest

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links