Computational theory-driven studies of reinforcement learning and decision-making in addiction: what have we learned?
Introduction
Reinforcement learning and decision-making — collectively, ‘value-based decision-making’ [1] — are integral to adaptive behavior in everyday life. Value-based decision-making comprises a feedback loop whereby the values of candidate actions are learned and updated through experience, and used to guide behavior that maximizes utility (and minimizes disutility). Disruption in value-based decision-making is considered a key factor in the development and maintenance of addiction [2, 3, 4], across people with substance use disorders (SUD) [5] and laboratory animals exposed to drugs of abuse [6,7], but the specific contributing mechanisms remain unknown. Decision-making biases in addiction may be due to disruption in distinct components of learning, such as error encoding or value updating, or subjective preferences that are not readily observable in coarse behavioral performance measures. The nascent field of computational psychiatry applies formal models to understand the precise mechanisms (or ‘failure modes’) that give rise to pathological behavior in psychiatric conditions [8,9,10••]. While there is no consensus on what qualifies as computational psychiatry, here we take this term to mean a mathematically rigorous understanding of the latent drivers of behavior. Findings from theory-driven computational psychiatry [11] suggest models that focus on algorithmic processes of value-based decision-making (Box 1) are well-suited to identify the specific components of reinforcement learning and decision-making that characterize SUD. This is exciting as such mechanistic research can bridge the behavioral manifestations of SUD with underlying neurobiology, providing fertile ground for cross-species translation [12, 13, 14, 15, 16]. Computational theoretical models thus hold promise as tools to provide additional mechanistic insight into SUD diagnosis and prognosis, and to help guide personalized treatments based on the latent variables governing individual behavior.
Here, we review recent theory-driven computational psychiatry studies of SUD primarily conducted with human subjects, highlighting the ways in which these studies have extended and refined our understanding of value-based decision-making processes in addiction. We focus on two key objectives of this work: to identify deviations from health (via case-control comparisons), and to map specific SUD symptoms and clinically relevant states onto specific model variables — the latter aimed at moving closer to understanding the most defining yet most elusive aspect of the disorder: its dynamic, cyclical course. We conclude by outlining two directions for future research. We propose that a holistic approach that expands the typical parameter space examined within the same individual, and the duration of observation, may better serve these critical objectives and significantly enhance the clinical impact of computational psychiatry for addiction applications.
Section snippets
Deviation from health as indication of psychopathology: diagnostic differences between addicted and healthy individuals
SUD is a chronic, relapsing disorder characterized by repeated periods of drug craving, intoxication, bingeing, and withdrawal [17]. Drug use is maintained despite harmful consequences. The reinforcing and addictive effects of drugs center on the brain’s reward (or ‘valuation’ [18]) circuit. At the core of this circuit lie the dopaminergic pathways originating from the midbrain (ventral tegmental area and substantia nigra) and projecting onto the striatum and prefrontal cortex (orbitofrontal
Capturing addiction dynamics: using computational models to understand within-person variability, symptom expression, prognosis, and treatment
Addiction is not static, and indeed, it can be said that understanding addiction’s longitudinal course is to understand addiction itself. The ‘addiction cycle’ has been described as having three stages: preoccupation-anticipation, bingeing-intoxication, and withdrawal-negative affect [22,62, 63, 64]. These stages are likely associated with distinct value-based processes. Although no research to date has identified the algorithmic mechanisms that underlie the transition between each stage,
Conclusion and future directions
Computational psychiatry has garnered considerable attention in recent years but enthusiasm for its presumed clinical utility is rightly tempered [82]. Here, we review the promise of this approach for addiction applications. While computationally informed studies have produced novel explanatory insights about value-based processes in addiction that help to refine long-held theoretical accounts, we also identified two directions for future research that could significantly enhance the clinical
Conflict of interest statement
Nothing declared.
References and recommended reading
Papers of particular interest, published within the period of review, have been highlighted as:
• of special interest
•• of outstanding interest
Acknowledgements
The authors acknowledge funding from the Brain and Behavior Research Foundation (BBRF NARSAD Grant #25387), Busch Biomedical Research Program, and NIH/NIDA (DA043676). Special thanks to the Addiction and Decision Neuroscience Laboratory members, Silvia Lopez-Guzman, and Paul W. Glimcher for helpful discussions.
References (82)
- et al.
The role of learning-related dopamine signals in addiction vulnerability
Prog Brain Res
(2014) - et al.
21st century neurobehavioral theories of decision making in addiction: review and evaluation
Pharmacol Biochem Behav
(2018) - et al.
Orbitofrontal cortex, decision-making and drug addiction
Trends Neurosci
(2006) Individual decision-making in the causal pathway to addiction: contributions and limitations of rodent models
Pharmacol Biochem Behav
(2018)- et al.
Computational psychiatry
Trends Cogn Sci
(2012) - et al.
Computational psychiatry
Neuron
(2014) - et al.
Translation of computational psychiatry in the context of addiction
JAMA Psychiatry
(2020) - et al.
Beyond simple tests of value: measuring addiction as a heterogeneous disease of computation-specific valuation processes
Learn Mem
(2018) - et al.
Midbrain dopamine neurons encode a quantitative reward prediction error signal
Neuron
(2005) - et al.
A neural substrate of prediction and reward
Science
(1997)
Dysfunction of the prefrontal cortex in addiction: neuroimaging findings and clinical implications
Nat Rev Neurosci
Chronic alcohol intake abolishes the relationship between dopamine synthesis capacity and learning signals in the ventral striatum
Eur J Neurosci
Temporal difference error prediction signal dysregulation in cocaine dependence
Neuropsychopharmacology
Steep delay discounting and addictive behavior: a meta-analysis of continuous associations
Addiction
Drug addiction and its underlying neurobiological basis: neuroimaging evidence for the involvement of the frontal cortex
Am J Psychiatry
Smoking decisions: altered reinforcement learning signals induced by nicotine state
Nicotine Tob Res
A Bayesian observer model of drug craving
JAMA Psychiatry
Dysregulation of decision making related to metabotropic glutamate 5, but not midbrain D3, receptor availability following cocaine self-administration in rats
Biol Psychiatry
Realizing the clinical potential of computational psychiatry: report from the Banbury Center Meeting, February 2019
Biol Psychiatry
A framework for studying the neurobiology of value-based decision making
Nat Rev Neurosci
A unified framework for addiction: vulnerabilities in the decision process
Behav Brain Sci
Addiction as a computational process gone awry
Science
Computational psychiatry as a bridge from neuroscience to clinical applications
Nat Neurosci
Theory-based computational psychiatry
Biol Psychiatry
How preclinical models evolved to resemble the diagnostic criteria of drug addiction
Biol Psychiatry
The neurobiology of impulsive decision-making and reinforcement learning in nonhuman animals
Curr Top Behav Neurosci
Investigating the computational underpinnings of addiction
Neuropsychopharmacology
Diagnostic and Statistical Manual of Mental Disorders
The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value
Neuroimage
Blunted dopamine release as a biomarker for vulnerability for substance use disorders
Biol Psychiatry
Neurobiologic advances from the brain disease model of addiction
N Engl J Med
Disruption of reward processing in addiction: an image-based meta-analysis of functional magnetic resonance imaging studies
JAMA Psychiatry
Role of the value circuit in addiction and addiction treatment
Smokers’ brains compute, but ignore, a fictive error signal in a sequential investment task
Nat Neurosci
Prefrontal cortex fails to learn from reward prediction errors in alcohol dependence
J Neurosci
Reduced neural tracking of prediction error in substance-dependent individuals
Am J Psychiatry
Abnormal brain activity during a reward and loss task in opiate-dependent patients receiving methadone maintenance therapy
Neuropsychopharmacology
Probabilistic reward- and punishment-based learning in opioid addiction: experimental and computational data
Behav Brain Res
Computational modelling reveals contrasting effects on reinforcement learning and cognitive flexibility in stimulant use disorder and obsessive-compulsive disorder: remediating effects of dopaminergic D2/3 receptor agents
Psychopharmacology (Berl)
Chronic exposure to nicotine is associated with reduced reward-related activity in the striatum but not the midbrain
Biol Psychiatry
Biases in the explore-exploit tradeoff in addictions: the role of avoidance of uncertainty
Neuropsychopharmacology
Cited by (31)
The utility of a latent-cause framework for understanding addiction phenomena
2024, Addiction NeuroscienceThe role of reinforcement learning in shaping the decision policy in methamphetamine use disorders
2024, Journal of Choice ModellingRecent Opioid Use Impedes Range Adaptation in Reinforcement Learning in Human Addiction
2024, Biological PsychiatryAdaptive Design Optimization as a Promising Tool for Reliable and Efficient Computational Fingerprinting
2023, Biological Psychiatry: Cognitive Neuroscience and Neuroimaging