Computational theory-driven studies of reinforcement learning and decision-making in addiction: what have we learned?

https://doi.org/10.1016/j.cobeha.2020.08.007Get rights and content

Highlights

  • Computational psychiatry holds promise for mechanistic discovery in addiction.

  • This approach captures latent factors driving behavioral differences from health.

  • Emerging support also for capturing variation defining addiction cycles and states.

  • Research needs to better account for the heterogeneous, dynamic nature of addiction.

  • Expanding the parameter space examined and duration of observation will be key.

Computational psychiatry provides a powerful new approach for linking the behavioral manifestations of addiction to their precise cognitive and neurobiological substrates. However, this emerging area of research is still limited in important ways. While research has identified features of reinforcement learning and decision-making in substance users that differ from health, less emphasis has been placed on capturing addiction cycles/states dynamically, within-person. In addition, the focus on few behavioral variables at a time has precluded more detailed consideration of related processes and heterogeneous clinical profiles. We propose that a longitudinal and multidimensional examination of value-based processes, a type of dynamic ‘computational fingerprint’, will provide a more complete understanding of addiction as well as aid in developing better tailored and timed interventions.

Introduction

Reinforcement learning and decision-making — collectively, ‘value-based decision-making’ [1] — are integral to adaptive behavior in everyday life. Value-based decision-making comprises a feedback loop whereby the values of candidate actions are learned and updated through experience, and used to guide behavior that maximizes utility (and minimizes disutility). Disruption in value-based decision-making is considered a key factor in the development and maintenance of addiction [2, 3, 4], across people with substance use disorders (SUD) [5] and laboratory animals exposed to drugs of abuse [6,7], but the specific contributing mechanisms remain unknown. Decision-making biases in addiction may be due to disruption in distinct components of learning, such as error encoding or value updating, or subjective preferences that are not readily observable in coarse behavioral performance measures. The nascent field of computational psychiatry applies formal models to understand the precise mechanisms (or ‘failure modes’) that give rise to pathological behavior in psychiatric conditions [8,9,10••]. While there is no consensus on what qualifies as computational psychiatry, here we take this term to mean a mathematically rigorous understanding of the latent drivers of behavior. Findings from theory-driven computational psychiatry [11] suggest models that focus on algorithmic processes of value-based decision-making (Box 1) are well-suited to identify the specific components of reinforcement learning and decision-making that characterize SUD. This is exciting as such mechanistic research can bridge the behavioral manifestations of SUD with underlying neurobiology, providing fertile ground for cross-species translation [12, 13, 14, 15, 16]. Computational theoretical models thus hold promise as tools to provide additional mechanistic insight into SUD diagnosis and prognosis, and to help guide personalized treatments based on the latent variables governing individual behavior.

Here, we review recent theory-driven computational psychiatry studies of SUD primarily conducted with human subjects, highlighting the ways in which these studies have extended and refined our understanding of value-based decision-making processes in addiction. We focus on two key objectives of this work: to identify deviations from health (via case-control comparisons), and to map specific SUD symptoms and clinically relevant states onto specific model variables — the latter aimed at moving closer to understanding the most defining yet most elusive aspect of the disorder: its dynamic, cyclical course. We conclude by outlining two directions for future research. We propose that a holistic approach that expands the typical parameter space examined within the same individual, and the duration of observation, may better serve these critical objectives and significantly enhance the clinical impact of computational psychiatry for addiction applications.

Section snippets

Deviation from health as indication of psychopathology: diagnostic differences between addicted and healthy individuals

SUD is a chronic, relapsing disorder characterized by repeated periods of drug craving, intoxication, bingeing, and withdrawal [17]. Drug use is maintained despite harmful consequences. The reinforcing and addictive effects of drugs center on the brain’s reward (or ‘valuation’ [18]) circuit. At the core of this circuit lie the dopaminergic pathways originating from the midbrain (ventral tegmental area and substantia nigra) and projecting onto the striatum and prefrontal cortex (orbitofrontal

Capturing addiction dynamics: using computational models to understand within-person variability, symptom expression, prognosis, and treatment

Addiction is not static, and indeed, it can be said that understanding addiction’s longitudinal course is to understand addiction itself. The ‘addiction cycle’ has been described as having three stages: preoccupation-anticipation, bingeing-intoxication, and withdrawal-negative affect [22,62, 63, 64]. These stages are likely associated with distinct value-based processes. Although no research to date has identified the algorithmic mechanisms that underlie the transition between each stage,

Conclusion and future directions

Computational psychiatry has garnered considerable attention in recent years but enthusiasm for its presumed clinical utility is rightly tempered [82]. Here, we review the promise of this approach for addiction applications. While computationally informed studies have produced novel explanatory insights about value-based processes in addiction that help to refine long-held theoretical accounts, we also identified two directions for future research that could significantly enhance the clinical

Conflict of interest statement

Nothing declared.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

The authors acknowledge funding from the Brain and Behavior Research Foundation (BBRF NARSAD Grant #25387), Busch Biomedical Research Program, and NIH/NIDA (DA043676). Special thanks to the Addiction and Decision Neuroscience Laboratory members, Silvia Lopez-Guzman, and Paul W. Glimcher for helpful discussions.

References (82)

  • R.Z. Goldstein et al.

    Dysfunction of the prefrontal cortex in addiction: neuroimaging findings and clinical implications

    Nat Rev Neurosci

    (2011)
  • L. Deserno et al.

    Chronic alcohol intake abolishes the relationship between dopamine synthesis capacity and learning signals in the ventral striatum

    Eur J Neurosci

    (2015)
  • E.J. Rose et al.

    Temporal difference error prediction signal dysregulation in cocaine dependence

    Neuropsychopharmacology

    (2014)
  • M. Amlung et al.

    Steep delay discounting and addictive behavior: a meta-analysis of continuous associations

    Addiction

    (2017)
  • R.Z. Goldstein et al.

    Drug addiction and its underlying neurobiological basis: neuroimaging evidence for the involvement of the frontal cortex

    Am J Psychiatry

    (2002)
  • T.E. Baker et al.

    Smoking decisions: altered reinforcement learning signals induced by nicotine state

    Nicotine Tob Res

    (2020)
  • X. Gu et al.

    A Bayesian observer model of drug craving

    JAMA Psychiatry

    (2017)
  • S.M. Groman et al.

    Dysregulation of decision making related to metabotropic glutamate 5, but not midbrain D3, receptor availability following cocaine self-administration in rats

    Biol Psychiatry

    (2020)
  • M. Browning et al.

    Realizing the clinical potential of computational psychiatry: report from the Banbury Center Meeting, February 2019

    Biol Psychiatry

    (2020)
  • A. Rangel et al.

    A framework for studying the neurobiology of value-based decision making

    Nat Rev Neurosci

    (2008)
  • A.D. Redish et al.

    A unified framework for addiction: vulnerabilities in the decision process

    Behav Brain Sci

    (2008)
  • A.D. Redish

    Addiction as a computational process gone awry

    Science

    (2004)
  • Q.J. Huys et al.

    Computational psychiatry as a bridge from neuroscience to clinical applications

    Nat Neurosci

    (2016)
  • T.V. Maia et al.

    Theory-based computational psychiatry

    Biol Psychiatry

    (2017)
  • A. Belin-Rauscent et al.

    How preclinical models evolved to resemble the diagnostic criteria of drug addiction

    Biol Psychiatry

    (2016)
  • S.M. Groman

    The neurobiology of impulsive decision-making and reinforcement learning in nonhuman animals

    Curr Top Behav Neurosci

    (2020)
  • S.M. Groman

    Investigating the computational underpinnings of addiction

    Neuropsychopharmacology

    (2019)
  • DSM5

    Diagnostic and Statistical Manual of Mental Disorders

    (2013)
  • O. Bartra et al.

    The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value

    Neuroimage

    (2013)
  • P. Trifilieff et al.

    Blunted dopamine release as a biomarker for vulnerability for substance use disorders

    Biol Psychiatry

    (2014)
  • N.D. Volkow et al.

    Neurobiologic advances from the brain disease model of addiction

    N Engl J Med

    (2016)
  • M. Luijten et al.

    Disruption of reward processing in addiction: an image-based meta-analysis of functional magnetic resonance imaging studies

    JAMA Psychiatry

    (2017)
  • A.B. Konova et al.

    Role of the value circuit in addiction and addiction treatment

  • P.H. Chiu et al.

    Smokers’ brains compute, but ignore, a fictive error signal in a sequential investment task

    Nat Neurosci

    (2008)
  • S.Q. Park et al.

    Prefrontal cortex fails to learn from reward prediction errors in alcohol dependence

    J Neurosci

    (2010)
  • J. Tanabe et al.

    Reduced neural tracking of prediction error in substance-dependent individuals

    Am J Psychiatry

    (2013)
  • V.B. Gradin et al.

    Abnormal brain activity during a reward and loss task in opiate-dependent patients receiving methadone maintenance therapy

    Neuropsychopharmacology

    (2014)
  • C.E. Myers et al.

    Probabilistic reward- and punishment-based learning in opioid addiction: experimental and computational data

    Behav Brain Res

    (2016)
  • J.W. Kanen et al.

    Computational modelling reveals contrasting effects on reinforcement learning and cognitive flexibility in stimulant use disorder and obsessive-compulsive disorder: remediating effects of dopaminergic D2/3 receptor agents

    Psychopharmacology (Berl)

    (2019)
  • E.J. Rose et al.

    Chronic exposure to nicotine is associated with reduced reward-related activity in the striatum but not the midbrain

    Biol Psychiatry

    (2012)
  • L.S. Morris et al.

    Biases in the explore-exploit tradeoff in addictions: the role of avoidance of uncertainty

    Neuropsychopharmacology

    (2016)
  • Cited by (31)

    View all citing articles on Scopus
    View full text