Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits

Renteria, Rafael; Baltz, Emily T.; Gremel, Christina M.

doi:10.1038/s41467-017-02615-9

Download PDF

Article
Open access
Published: 15 January 2018

Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits

Nature Communications volume 9, Article number: 211 (2018) Cite this article

9504 Accesses
93 Citations
29 Altmetric
Metrics details

Subjects

Abstract

Addiction involves a predominance of habitual control mediated through action selection processes in dorsal striatum. Research has largely focused on neural mechanisms mediating a proposed progression from ventral to dorsal lateral striatal control in addiction. However, over reliance on habit striatal processes may also arise from reduced cortical input to striatum, thereby disrupting executive control over action selection. Here, we identify novel mechanisms through which chronic intermittent ethanol exposure and withdrawal (CIE) disrupts top-down control over goal-directed action selection processes to produce habits. We find CIE results in decreased excitability of orbital frontal cortex (OFC) excitatory circuits supporting goal-directed control, and, strikingly, selectively reduces OFC output to the direct output pathway in dorsal medial striatum. Increasing the activity of OFC circuits restores goal-directed control in CIE-exposed mice. Our findings show habitual control in alcohol dependence can arise through disrupted communication between top-down, goal-directed processes onto basal ganglia pathways controlling action selection.

Prior cocaine self-administration impairs attention signals in anterior cingulate cortex

Article Open access 27 November 2019

Daniela Vázquez, Heather J. Pribut, … Matthew R. Roesch

Cognitive rigidity and BDNF-mediated frontostriatal glutamate neuroadaptations during spontaneous nicotine withdrawal

Article 21 November 2019

Robert D. Cole, Matty Zimmerman, … Vinay Parikh

GluN2B inhibition confers resilience against long-term cocaine-induced neurocognitive sequelae

Article 02 September 2022

Dan C. Li, Elizabeth G. Pitts, … Shannon L. Gourley

Introduction

A prominent hypothesis in the drug-abuse field is that addiction involves a transition from goal-directed to habitual control over drug-seeking and taking behaviors^1,2,3,4. How this shift in behavioral control emerges and how it contributes to the addiction is not clear. Research has largely focused on a shift in the underlying neural circuits controlling long-term drug-seeking and drug-taking behaviors^{4,5,6,7,8,9,10}. These studies have demonstrated that habitual drug-seeking depends on dorsal lateral striatum (DLS)^{6,7,8,10,11,12}. Investigations into the development of habitual control have emphasized a corresponding progression from ventral striatal to dorsal striatal control over drug-related behaviors^5,13,14. However, a wealth of research on the neurobiology of action control suggests action selection arises through competition between dorsal striatal subregions^{15,16,17,18,19,20}. In particular, both dorsal medial striatum (DMS) and DLS are concurrently capable of controlling the same action, but compete for goal-directed or habitual control over that action, respectively^17,19,20,21. Given this, it may be that an over reliance on habits in drug dependence originates from a strengthening of DLS habitual processes, and/or, a disruption to DMS control over goal-directed processes.

Recent reports on addicts have highlighted a hypothesis that habitual control comes to dominate decision-making as a consequence of an impaired goal-directed system^22,23. Goal-directed processes underlie decision-making²⁴, and drug dependence induces long-lasting deficits in goal-directed decision-making processes^22,23,25,26. For example, previous findings have reported that alcoholics show persistent disruptions in decision-making processes^27,28 and these disruptions likely contribute to relapse²⁹. Dysfunctional decision-making is most likely the result of dependence-induced changes in the structure and function of corresponding corticostriatal circuits^30,31. However, there is currently no mechanistic understanding as to whether dependence-induced disruption to cortical goal-directed processes directly results in habitual control. Consequently, despite broad interest in the mechanisms through which habits emerge in drug dependence, there is limited information on the contribution of drug dependence-induced changes to cortical function in habit formation.

To directly investigate whether drug dependence itself disrupts goal-directed control to result in a reliance on habitual decision-making, we are using a well-validated and commonly used mouse model, chronic intermittent ethanol exposure and withdrawal (CIE), to produce ethanol dependence^{32,33,34,35,36,37}. In combination with a recently developed instrumental task for food reward, we find that prior CIE produces a long-lasting disruption to goal-directed processes and leave mice reliant on habitual control. We then examined whether CIE induces long-lasting alterations in the function of one corticostriatal circuit known to control goal-directed actions^{19,21,38,39,40,41} namely, the orbital frontal cortex (OFC) and its projections into the medial portion of the dorsal striatum (OFC-DMS). We find that prior CIE exposure decreases activity and output of corticostriatal circuits in a projection and cell-type specific manner, with selective reduction in glutamatergic transmission from OFC-DMS projections onto the direct, but not indirect, output pathway of the basal ganglia. Further, we show that increasing the activity of orbital circuits is sufficient to overcome the reliance on habits and restores goal-directed control in CIE-exposed mice. Together, our findings suggest that dependence-induced reliance on habitual control arises in part through disruption of goal-directed processes including top-down cortical communication onto a basal ganglia pathway controlling action selection.

Results

Induction of ethanol dependence disrupts decision-making

To investigate whether prior drug dependence results in a long-lasting disruption to decision-making processes, we utilized a well-validated CIE model to induce ethanol dependence in mice^{32,33,34,35,36,37}. Mice were exposed to periods of CIE or air (Air) vapor and subsequent withdrawal over a period of four weeks (Fig. 1a, three vapor cohorts, Air n = 15, CIE n = 19). Mice were placed in inhalation chambers and exposed to ethanol or air vapor for 16 h per day, 4 days per week. We did not give a loading dose of ethanol or a pretreatment of pyrazole³³ to avoid confounding effects of stress that can bias reliance on habitual control⁴², as well as to avoid pyrazole’s broad effects on neural activity including actions at the N-methyl-d-aspartate (NMDA) receptor⁴³. Even without pretreatment, our procedure produced mean blood ethanol concentrations of 34.7 ± 2.0 mM, similar to what has previously been reported^34,44. After 72 h of the last CIE exposure, mice were food restricted to achieve 90% of their baseline weight for 2 days prior to instrumental lever press training for food pellets or sucrose.

Decision-making recruits parallel action strategies: goal-directed actions and habitual actions²⁴. If ethanol dependence does induce long-lasting changes to decision-making processes, it may be apparent in the disrupted use of goal-directed actions or a bias towards reliance on habits. We utilized an instrumental task we recently developed, where on the same day, the same mouse will shift between goal-directed and habitual control over food responding^19,21. In brief, mice were trained in two distinct contexts to press a lever in the same location for the same food outcome (food pellet or 20% sucrose). To predispose the use of habitual vs. goal-directed action control, mice were trained to lever press under random interval (RI) and random ratio (RR) schedules of reinforcement, respectively (Fig. 1b)^45,46,47. Trained under these schedules, Air mice and CIE mice acquired lever press behavior for food (Fig. 1c, d; Supplementary Fig. 1). Although visually, it appeared that CIE exposure increased response rate, a three-way repeated measures ANOVA (context × CIE exposure × training day) on response rate during training showed a main effect of training day (F_(8,112) = 30.61, p < 0.001), but no main effect of CIE exposure or interaction (Fs’<1.31). This suggests that while CIE exposure may have led to slightly increased response rates, all mice increased lever pressing in a similar manner across training.

To assess whether an action is goal-directed or habitual, we examined the sensitivity of lever pressing to changes in expected outcome value using outcome devaluation procedures. After 15 to 21 days following the last vapor exposure and lever press training, we subjected Air and CIE mice to sensory-specific satiation of the food outcome (food pellet or sucrose) previously produced by lever pressing (devalued state), or a control outcome (the remaining outcome) mice had previously experienced in their home-cage (valued state) (Fig. 1e). Each prefeeding period was followed by a brief 5-min test in each of the trained contexts, where we measured the number of non-reinforced lever presses made. A significant reduction in lever pressing in the devalued state compared to valued state is indicative of goal-directed control, while similar pressing between states reflects habitual control⁴⁸.

Air mice readily shifted between using a goal-directed strategy in the previously RR trained context and control by more habitual processes in the previously RI trained context, while CIE showed a noted lack of goal-directed control in RI and RR training contexts (Fig. 1f-g; Supplementary Fig. 1e). Since differences in response rates during acquisition and testing were observed within a group, as well as between Air and CIE exposure groups (Supplementary Fig. 1 and 3), lever presses were normalized to total presses made in each context during testing. This allows us to examine CIE effects on decision-making in the absence of differences in response rates. A three-way ANOVA on normalized lever pressing showed a significant three-way interaction (devaluation state × context × CIE exposure: F_{(1, 32)} = 10.83, p = 0.002), a significant two-way interaction of devaluation state × context (F_(1,32) = 10.57, p = 0.003) and a main effect of devaluation state (F_(1,32) = 5.20, p = 0.03), but no other two-way interactions or main effects (Fs < 2.00). This suggests that CIE mice and Air mice show different sensitivity to outcome devaluation testing across RI and RR training contexts.

To examine whether Air and CIE mice differentially distributed lever pressing across valued and devalued days, we performed one-sample t tests performed against 0.5 (equal lever pressing on valued and devalued days) on normalized lever press data. Air mice differentially distributed lever presses between valued and devalued days in the RR training context (t₁₄ = 5.95, p < 0.0001) but not in the RI context (t₁₄ = 0.86, p = 0.40) (Fig. 1e). In striking contrast to Air mice, CIE mice did not differentially distribute lever pressing between valuation states in either RI or RR trained contexts (ts < 0.60) (Fig. 1f). These findings confirm that Air mice reduced responding following outcome devaluation only in the RR context but not RI context, and that CIE exposure resulted in lever pressing insensitive to outcome devaluation in either training context.

We then used a devaluation index to assess whether an individual mouse shifted the degree to which lever pressing was goal-directed between contexts (Methods). We found that CIE exposure disrupts the within-subject shift in goal-directed control. We performed repeated measures ANOVA (context × CIE exposure) and found a significant interaction (F_(1,32) = 10.83, p = 0.002) and a main effect of context (F_(1,32) = 10.57, p = 0.003), but no effect of CIE exposure (F < 1.95). Although Air mice showed an increase in goal-directedness in the RR context compared to the RI context (Bonferroni-corrected p < 0.001), CIE mice showed similar levels of goal directedness between contexts (p > 0.1) (Fig. 1g). One sample t tests performed against a hypothetical 0 devaluation index (equal pressing between valued and devalued states) confirmed significant goal-directed control in Air mice only in the RR context (t₁₄ = 5.95, p < 0.001), but not in the RI context (RI t = 0.86) or in CIE mice in either RI and RR contexts (ts < 0.60).

The lack of goal-directedness in CIE mice cannot be attributed to dependence-induced changes in outcome palatability or sensitivity to devaluation. A subset of Air mice and CIE mice underwent a post-test free feeding assay immediately following outcome devaluation testing. Air and CIE mice consumed similar amounts in prefeeding devaluation procedures as well as in post-test free feeding procedures (Supplementary Fig. 1d, g). Further, correlational analyses performed between the response rate during training and responding during testing suggest that the increased response rate observed in CIE mice did not contribute to the differences in the magnitude of subsequent goal-directed control (Supplementary Fig. 1h). Instead, the present findings suggest that prior CIE exposure results in a long-lasting deficit in decision-making processes, as reflected in the disruption of goal-directed control examined ~3 weeks after the last exposure to ethanol.

CIE exposure selectively alters orbitostriatal circuits

CIE exposure resulted in long-lasting changes to decision-making processes, suggesting that ethanol dependence-induced changes in neural circuits controlling goal-directed and/or habitual actions. For example, CIE exposure may disrupt neural circuits supporting goal-directed control, or enhance control of neural circuits modulating habits. Previous research has shown long-term changes in the cortical activation of abstinent alcoholics³⁰. In particular, hypoactivation of OFC circuits correlated with impaired reward choice behavior in abstinence. Abstinent alcoholics were found to have an immediate reward bias, with BOLD signal in lateral OFC correlated to delayed reward choice²⁷. More recently, OFC activity has been shown to support goal-directed action control across species^{19,21,38,39,40,49}, and is an important regulator of the shift between goal-directed and habitual control. We recently showed that increases in excitatory transmission at OFC terminals in dorsal striatum (OFC-DMS) drive goal-directed control, with habitual control emerging from the attenuation of OFC-DMS transmission²¹.

Given the importance of OFC-DMS function for goal-directed control over actions, we hypothesized that ethanol dependence alters OFC function through changes in synaptic transmission onto DMS. Mice were exposed to CIE procedures and ex vivo whole-cell electrophysiological recordings were conducted 3–21 days after the last vapor exposure, corresponding to the time frame of acquisition and devaluation testing (Fig. 2a). First, we examined whether dependence alters intrinsic properties of OFC projection neurons. We observed a decrease in excitability of OFC projection neurons following CIE procedures (repeated measures ANOVA: interaction (CIE exposure × current) = F_{(10, 170)} = 5.23, p < 0.0001; main effect of current = F_{(10, 170)} = 27.82, p < 0.0001; main effect of CIE exposure = F _{(1, 17)} = 5.81, p = 0.03) (Fig. 2b, c, Supplementary Table 1; 3 vapor cohorts, Air n = 8, CIE n = 11) that was present across the 3–21-day range of testing (Supplementary Fig. 2b). In addition, we found that resting membrane potentials were hyperpolarized in CIE-exposed mice compared to Air controls (Supplementary Table 1). This suggests that ethanol dependence induces a long-lasting reduction in the excitability of OFC projection neurons that is present even after a significant period of abstinence.

We next examined whether ethanol dependence would result in changes to OFC-DMS transmission. OFC projection neurons synapse onto spiny projection neurons (SPNs) of both major basal ganglia output pathways in the DMS in similar proportions⁵⁰; SPNs of the direct pathway that express the dopamine-type 1 receptor (D1 SPNs) and SPNs of the indirect pathway that express dopamine-type 2 receptor (D2 SPNs)⁵¹. Direct and indirect basal ganglia pathways are thought to coordinate activity to support action selection and performance⁵². We hypothesized that OFC-DMS transmission onto direct and indirect pathways may be altered by ethanol dependence. To investigate OFC-DMS circuits in a projection and cell-type-specific manner, we utilized a viral approach in transgenic mice to selectively examine OFC transmission onto D1 or D2 SPNs. To target direct and indirect pathway SPNs, we utilized multiple transgenic lines to ensure the reproducibility of our findings. B6.FVB(Cg)-Tg(Drd1-cre)EY266Gsat/Mmucd (D1-Cre) and B6.Cg-Tg(Drd1a-tdTomato)6Calak/J (D1-tdTomato) transgenic mice were used to target the direct pathway (D1 SPNs), while B6.FVB(Cg)-Tg(Adora2a-cre)KG139Gsat/Mmucd (A2A-Cre) and non-labeled SPNs from D1-tdTomato transgenic lines were used to label the indirect pathway (D2 SPNs). No differences were observed between transgenic lines so results were combined. All mice were injected with AAV5-CamKIIa-GFP-Cre and a Cre-dependent channel rhodopsin (AAV5-Ef1a-DIO-ChR2-eYFP) (UNC viral vector core) in the OFC to limit channel rhodopsin expression to CamKIIa expressing neurons (Fig. 2d, e)^19,21. D1-Cre and A2a-Cre mice were also injected with AAV5-hSyn-DIO-mCherry targeted to the DMS to label SPN populations. After 1 to 3 weeks of the surgery, mice underwent CIE procedures (Fig. 2a). Following acute withdrawal after the last vapor exposure (3–21 days), whole-cell patch-clamp recordings of identified D1 or D2 SPNs were made and transmission in response to light activation of OFC terminals was examined.

We first used paired pulse ratio (PPR) to examine whether CIE procedures altered the probability of neurotransmitter release from OFC terminals onto D1 SPNs. In Air mice, we found a high probability of neurotransmitter release at the OFC input onto D1 SPNs, as indicated by a paired pulse depression (PPD) (Fig. 2f; Supplementary Fig. 2c). In stark contrast, recordings made from CIE mice showed significant paired pulse facilitation (PPF), revealing a decrease in probability of neurotransmitter release from OFC terminals onto D1 SPNs. A direct comparison between Air and CIE mice using a two-way ANOVA (CIE exposure × interstimulus interval (ISI)) showed a significant interaction (F_{(4, 68)} = 3.53, p = 0.01), and main effect of CIE exposure (F_{(1, 17)} = 16.96, p = 0.0007) (Bonferroni-corrected, ****p < 0.0001 vs. Air; **p < 0.0001 vs. Air; *p < 0.05 vs. Air) (Fig. 2f, three vapor cohorts, Air n = 7, CIE n = 15). To further investigate the observed decrease in neurotransmitter release at OFC-DMS terminals, we replaced Ca²⁺ with strontium (Sr²⁺) in the recording solution and optically stimulated the OFC input. The use of Sr²⁺ in the recording solution has previously been used to examine asynchronous release in an input specific manner, with a decrease in release reflecting a decrease in release probability^53,54,55. In CIE mice, we observed a significant decrease in frequency of asynchronous release onto D1 SPNs compared to that observed in Air mice (Student’s t test, t₁₈ = 3.13, p < 0.01) with no change in amplitude (Student’s t test, t₁₈ = 0.20, p = 0.84) (Fig. 2g–i, Air, n = 8, CIE, n = 12). This selective decrease in asynchronous release onto the direct pathway was also apparent across the full range of testing (Supplementary Fig. 2d), suggesting a long-lasting decrease in OFC transmission selectively onto the direct pathway.

We next examined whether the induction of ethanol dependence would alter OFC input onto the indirect pathway. Similar to OFC transmission on to D1 SPNs in Air mice, we observed PPD of OFC transmission onto D2 SPNs in Air control mice. However, the PPD of OFC transmission onto D2 SPNs was still present in CIE mice (two-way ANOVA of CIE exposure × ISI: interaction and main effects Fs’ < 1.0) (Fig. 2j; three vapor cohorts, Air n = 7, CIE n = 9). When we examined asynchronous release at OFC terminals onto D2 SPNs, we found no differences between Air and CIE mice in the presence of Sr²⁺ (frequency: Student’s t test, p = 0.96; amplitude: Student’s t test, p = 0.26) (Fig. 2k–m, Air, n = 7, CIE, n = 7). Together these results show that prior ethanol dependence-induced long-lasting decreases in OFC neurotransmitter release into DMS in a cell-type-specific manner, selectively affecting transmission onto the direct but not indirect output pathway of the basal ganglia.

In addition, the decrease in neurotransmitter release was at least partially selective to the OFC input. When we used electrical stimulation to examine all excitatory input onto either D1 or D2 SPNs (Fig. 3a, b), we found no differences in PPR (no interactions (ps > 0.05), D1 SPNs main effect of ISI: F_{(4, 40)} = 37.69, p < 0.0001; D2 SPNs main effect of ISI: F_{(4, 40)} = 16.84, p < 0.0001) (Fig. 3c, d). Furthermore, we found no differences between Air and CIE mice in spontaneous EPSC (sEPSC) frequency (ps > 0.05) or amplitude in either D1 SPNs (Fig. 3e–g) or D2 SPNs (Fig. 3h–j) (ps > 0.05), again suggesting a disruption at least partially selective to OFC-DMS input. Together, our findings suggest that chronic ethanol dependence induces long-lasting decreases in the excitability and output of a pathway known to control goal-directed actions^19,21, onto a pathway known to support action selection and performance^16,52. Intriguingly, these changes are mediated through selective changes in OFC transmission onto the direct pathway.

OFC activation restores goal-directed control following CIE

While our ex vivo results suggest that the deficits in goal-directed behavior may be in part due to reduced OFC excitability and synaptic transmission into DMS, we do not know whether the observed changes directly biased decision-making. To examine this, we took a chemogenetic approach⁵⁶ to selectively increase OFC projection neuron activity in CIE mice during outcome devaluation testing (Fig. 4a, b). We injected an activating DREADD (AAV5-hSyn-DIO-hM3Dq-mCherry) into the OFC of B6.129S2-Emx1tm1(cre)Krj/J (Emx1-Cre) mice, thereby restricting expression to OFC projection neurons. A subset of Air and CIE mice were injected with AAV5-hSyn-DIO-mCherry (DIO-mCherry) to control for any effects of surgery, AAV infection, and CNO administration. Post-recovery, mice were subjected to CIE exposure procedures, followed by instrumental training and outcome devaluation testing (Fig. 4a, 3 vapor cohorts, groups: Air n = 16, CIE control n = 14, CIE H3 n = 19). To confirm the function of our manipulation, we conducted whole-cell current clamp recordings in identified OFC projection neurons expressing mCherry from infusions of AAV5-hSyn-DIO-hM3Dq-mCherry (Fig. 4b). Bath application of CNO (10 µM) resulted in a significant increase in excitability (two-way repeated measures ANOVA (current × CNO), interaction: F_{(14, 70)} = 7.52, p < 0.0001; main effect of CNO: F_{(1, 5)} = 13.95, p = 0.01) (Fig. 4c, n = 6).

Following CIE procedures, all mice underwent instrumental lever press training for food (Supplementary Fig. 3). Prior to outcome devaluation testing, all mice were given pretreatments of saline or CNO (1 mg/kg, 10 ml/kg). We used virus and drug treatment controls in each Air and CIE control group and did not see differences between controls injected with saline or CNO; therefore, we collapsed across controls for ease of presentation. While Air mice showed more goal directedness in the RR vs. RI context (albeit to lesser degree), and CIE mice showed little sensitivity to outcome devaluation and were habitual in both contexts, CIE H3 mice showed goal-directed control in both RI and RR contexts. This was supported by a three-way repeated measures ANOVA performed on normalized lever presses (devaluation state × context × group) that did not show a significant three-way interaction (F_(2,46) = 1.29, p = 0.28), but did show a significant two-way interaction between context and group (F_(2,46) > 15.0, p < 0.001), suggesting that Air, CIE, and CIE H3 groups showed different patterns of lever pressing in RI and RR training contexts (Fig. 4d, Supplementary Fig. 3f). A main effect of context (F_(1,46) > 15, p < 0.001) and devaluation state (F_(1,46) = 14.24, p < 0.001) was also observed, showing that on average, lever pressing differed between RI and RR training contexts and as well as between valued and devalued states.

The finding of different patterns of lever pressing between groups was further supported by one-sample t tests against 0.5 conducted on normalized lever pressing. While Air mice differentially distributed lever pressing only in the RR context (albeit slightly) (t₁₅ = 2.24, p < 0.05) and not in the RI context (t₁₅ = 1.12, p = 0.28), CIE mice did not differentially distribute lever pressing between valuation states in either context (one-sample t test, RI: t₁₃ = 0.96; RR: t₁₃ = 0.84). CNO administration to CIE mice expressing the activating DREADD in OFC projection neurons (CIE H3) restored goal-directed control in the RR context (one-sample t test, t₁₈ = 3.90, p < 0.01) and resulted in goal-directed control in the RI context (one-sample t test, t₁₈ = 4.85, p < 0.001) (Fig. 4d).

This pattern of H3 activation in CIE producing goal-directed control in otherwise habitual mice was again observed when we examined the within-subject shift in goal directedness using the devaluation index (Fig. 4e). CIE H3 mice displayed greater goal-directed control with index values closer to 1 in both contexts, while Air mice showed slight goal-directed control in the RR, but not RI context. CIE mice showed little goal-directed control in either context (values closer to 0). These findings were supported by a two-way repeated measures ANOVA that examined whether mice differentially shifted action control between contexts. While there was not a significant interaction (context × group; F < 0.6) or main effect of context (F < 0.66), it did reveal a main effect of group (F_(2,44) = 3.21, p = 0.04), confirming that in general, the groups showed different magnitudes of goal-directedness. To examine if mice showed significant goal-directed control (closer to 1) vs. habitual control (closer to 0), we performed one sample t tests performed against a devaluation index of 0. While Air mice tended to show less goal-directed control overall, there was a trend toward greater goal-directedness in the RR context (t₁₃ = 1.82, p = 0.09) but not in the RI context (t₁₃ = 0.39, p = 0.70). CIE mice did not show significant goal-directedness in either training context and did not differ from zero in either context (ts < 0.96, ps > 0.3). In contrast, H3 activation in CIE H3 mice led to significant goal-directedness in the RI (t₁₈ = 4.85, p < 0.001) and RR (t₁₈ = 3.91, p < 0.01) contexts. Our results show increasing OFC projection neuron activity alone is sufficient to restore goal-directed control in ethanol dependent mice. This suggests that changes in OFC-DMS activity do contribute to the disruption in goal-directed decision-making. Importantly, increasing OFC activity was sufficient to overcome any other neural circuit change that may be predisposing habitual control following the induction of dependence.

Discussion

The data presented in this study uncover neural mechanisms through which chronic ethanol exposure and withdrawal disrupts decision-making and results in a predominance of habitual control. Our results show that long-lasting dependence-induced changes in the function of goal-directed circuits contributes strongly to the reliance on habitual control. By targeting our investigation in a cell-type and projection-specific manner, we identified dependence-induced changes in OFC excitability and OFC transmission onto the direct, but not indirect, output pathway of the basal ganglia known to contribute to goal-directed control.

To avoid the confound of extended training on the emergence of habits present in long-term drug self-administration and oral consumption experiments, we employed the widely used and well-validated CIE procedure to model ethanol dependence. Therefore, we were able to examine the direct effect of chronic ethanol exposure and withdrawal on the subsequent ability to use decision-making circuits. We followed CIE procedures with a recently developed instrumental training paradigm where we can examine the shift between goal-directed and habitual action control in the same mouse, on the same day^19,21. In multiple experiments, each with replicating cohorts, we found that CIE exposure disrupts the ability to shift and use goal-directed action strategies indexed by lever pressing behaviors (Figs. 1 and 4). This is in line with previous observations that drug-exposure itself may bias habitual control^57,58,59. Thus it appears that the direct effects of CIE exposure and repeated withdrawal are sufficient to disrupt decision-making processes, in the absence of extended drug-self-administration training.

By examining changes ex vivo in corticostriatal circuits post-dependence during the time course that corresponds to action learning and performance, we identified long-lasting dependence-induced changes in OFC excitability and OFC-DMS transmission (Fig. 2). Our data suggest these changes do contribute to the loss of goal-directed control, since increasing OFC activity via activation of G_q-coupled hM3D receptors in OFC projection neurons in CIE mice specifically during outcome devaluation testing restored goal-directed control (Fig. 4). However, we did not dissociate effects of OFC activation on contributions from changes in excitability from changes in transmission. Further, our data does not address the disrupted aspect of goal-directed control such as updating value vs. using value, or effects on contingency control. It is highly unlikely that OFC and OFC-DMS circuits are the sole decision-making circuit altered following dependence³. Further investigation is needed into potential alterations in corticostriatal circuits and their contribution to the reliance on habits.

The induction of ethanol dependence resulted in fairly long-lasting changes in OFC circuit function. We observed a decrease in OFC excitability and synaptic transmission that persisted for up to 21 days after the last vapor exposure while the devaluation test was conducted 15–21 days in withdrawal (Fig. 2; Supplementary Fig. 2). This differs from a previous report that CIE exposure results in a shorter term hyperexcitability of OFC neurons observed 3–10 days after exposure⁶⁰. Here, we used the same CIE procedure and obtained similar BECs but did not use pyrazole, suggesting that the discrepancy in OFC excitability may be due to off-target actions of pyrazole⁴³. However, our findings are in line with a recent study on long-term, heavy drinking macaques that found a similar decrease in OFC neuron excitability⁶¹. In the present study, the reduced OFC excitability following dependence was accompanied by a decrease in OFC synaptic transmission selectively onto the direct, but not indirect, output pathway of the basal ganglia. The observed changes in OFC function and transmission likely contributed to both the development and reliance on habitual control over actions, as these long-term changes were still present at a time point that corresponded to devaluation testing.

Limited work has been done examining effects of chronic ethanol consumption or exposure on glutamatergic activity in dorsal striatum³. Here we found a selective reduction in the probability of OFC transmission onto D1 SPNs of the direct pathway following the induction of ethanol dependence in the medial portion of the dorsal striatum (Fig. 2). When we probed transmission onto D1 or D2 SPNs from all excitatory inputs via intra-striatal electrical stimulation, we did not find an effect of dependence, suggesting that dependence selectively changes OFC-mediated inputs to D1 SPNs. (Fig. 3). Our finding of intact transmission at OFC-D2 SPNs suggests that indirect pathway function alone is insufficient for DMS-dependent goal-directed behavior. Previous reports have implicated functional changes in D1 SPNs of the upper DMS following chronic alcohol consumption⁶², with a recent report of selective potentiation of NMDA-mediated synaptic activity in these D1 SPNs during acute withdrawal from chronic alcohol consumption⁶³. In contrast, recordings made from SPNs in the DLS (putamen) of very long-term ethanol drinking non-human primates 28 days into abstinence, did find increases in the frequency, but not amplitude of glutamatergic mEPSCs⁶⁴. Given these findings, the consistency of chronic alcohol consumption and exposure effects on glutamatergic activity across striatal subregions and MSN subtypes deserves further investigation.

Our data do suggest at least a partial selectivity of dependence effects on transmission in the medial striatum, where we observed altered OFC output. Given the similar proportion of OFC inputs onto D1 and D2 SPNs⁵⁰, the lack of changes in OFC transmission onto the indirect pathway is intriguing since the same OFC neuron may send collaterals to both D1 and D2 SPNs. This raises an additional hypothesis that the cell-type specificity of the ethanol-induced decrease in OFC transmission is post-synaptically mediated in a retrograde manner; one potential target being post-synaptic endocannabinoid release and retrograde activation of cannabinoid type-1 receptors located on OFC terminals²¹.

Furthermore, it is unclear how the selective reduction in OFC-D1 SPN transmission would alter direct pathway function and output. Given the convergence of numerous associative cortical inputs onto the same SPN within this DMS region⁶⁵, the relative importance of a decrease in excitatory drive from a portion of inputs on the overall D1 SPNs function and output during action selection is unknown. From previous work, we do know that goal-directed learning differentially alters plasticity of D1 and D2 SPNs in DMS, increasing the AMPA/NMDA ratio in D1 but not D2 SPNs⁶⁶. This suggests that the reduced glutamatergic input from OFC could affect the necessary D1 SPNs plasticity that may sustain goal-directed control. Interestingly, previous work has found that suppression of habitual control is accompanied by decreased output of the direct pathway in DLS⁶⁷, suggesting that direct pathway activity in DLS or DMS is necessary for habitual and goal-directed behavior, respectively. Our finding that CIE results in a selective disruption to corticostriatal circuits in a projection and cell-type-specific manner emphasizes the need for highly specific circuit interrogation in the examination of disordered action selection in disease states.

The present findings highlight the effect drug dependence has on cortical-based decision-making. Although the focus has largely been on bottom up driven transitions underlying the transition from goal-directed to habitual control^2,3,14, the present findings highlight the contribution of top down processes in this biased use of habitual processes. Alteration of OFC function has frequently been observed in drug dependence^25,29,31,68, and in particular, ethanol dependence alters OFC function and alcohol-related behaviors^36,69,70. Reducing OFC activity increased quinine-insensitive alcohol drinking in ethanol-dependent mice³⁶, further suggesting that reduced OFC control in dependence escalates habitual-like behaviors. Here we found that increasing the activity of OFC neurons was sufficient to restore goal-directed control and overcome the bias towards habitual action control (Fig. 4). This non-physiological increase in OFC activity was neither temporally nor spatially specific, as we aimed to counter the observed decrease in OFC excitability as well as reduced OFC-DMS transmission. Therefore, the effectiveness of increasing OFC activity on restoring goal-directed control suggests OFC is a viable area to target in the treatment of drug dependence.

Methods

Mice

All experiments were approved by the Institutional Animal Care and Use Committees of the University of California San Diego. C57BL/6 J and B6.129S2-Emx1^tm1(cre)Krj/J (Emx1-Cre) mice⁷¹ were used for behavioral experiments. B6.FVB(Cg)-Tg(Drd1-cre)EY266Gsat/Mmucd (D1-Cre), B6.FVB(Cg)-Tg(Adora2a-cre)KG139Gsat/Mmucd (A2a-Cre)⁷², and B6.Cg-Tg(Drd1a-tdTomato)6Calak/J (D1-tdTomato)⁷³ were used for electrophysiological recordings. All mouse lines were obtained from Jackson Laboratory or Mutant Mouse Resource and Research Center (MMRRC) and bred with C57Bl/6 J mice (Jackson Laboratory) for one generation, in-house. Adult (>8 weeks) male and female mice were housed in groups of one to four, with mouse chow and water ad libitum unless stated otherwise, and were kept on a 14 h light/10 h dark cycle.

Viral injections

Mice were anesthetized with isoflurane and were given stereotaxically guided injections into the OFC (coordinates from Bregma in mm: anterior [A], 2.70; medial [M] ± 1.65; ventral [V]: 2.65). Emx1-Cre and C57BL6/J mice used for behavioral experiments were injected with 200 nl of AAV5-hSyn-DIO-hM3D-mCherry or AAV5-hSyn-DIO-mCherry in the OFC. D1-Cre, A2a-Cre and D1-tdTomato mice were used for patch-clamp recordings and were coinjected with 100 nl AAV5-CamKIIa-GFP-Cre and 100 nl AAV5-Ef1a-DIO-ChR2-eYFP in the OFC. To identify D1 and D2 SPNs, D1 Cre and A2a Cre were also injected with 200 nl AAV5-hSyn-DIO-mCherry in the DMS ([A], 0.5; [M], ± 1.5; [V], 3.25). Viral spread was assessed by imaging the extent of fluorescence in brain slices (Olympus MVX10).

Chronic intermittent ethanol exposure and repeated withdrawal

Multiple cohorts of mice were exposed to four rounds of ethanol vapor or air (behavioral experiments cohort n = 3, ex vivo experiments cohort n = 4, and for OFC activation cohort n = 3). Each round consisted of 16 h of vapor exposure followed by an 8 h withdrawal, repeated for 4 consecutive days. Ethanol was volatilized by bubbling air through a flask containing 95% ethanol at a rate of 2–3 l/min. The resulting ethanol vapor was combined with a separate air stream to give a total flow rate of approximately 10 l/min, which was delivered to the mice housed in Plexiglas chambers (Plas Labs Inc). Blood ethanol concentrations were collected at the end of each round from sentinel mice (mean BEC = 34.7 ± 2.0 mM). No pyrazole or loading ethanol injections were given prior to placement in vapor chambers.

Operant training

Training was conducted as previously described^19,21. Two days prior to training, mice were food restricted and maintained at ~90% of their baseline body weight throughout training and testing. Mice were placed in sound attenuating operant boxes (Med-Associates) and were trained to press a single lever (left or right) for a food reinforcer (chow pellet or 20% sucrose solution). Each mouse was trained in two contexts daily, differentiated by the presence of clear Plexiglas side-walls, or black and white striped plexiglass side-walls. In each context, mice were first trained to retrieve the outcome, in the absence of levers, under a random time schedule (RT60) in which the outcome was delivered on average every 60 s. Mice were then trained on a continuous reinforcement schedule (CRF) in each context in which each lever press produced a single outcome, and the maximum number of reinforcers earned in 3 sessions being 5, 15, and 30, respectively. Following CRF training, mice were trained under a RI schedule in one context and RR schedule in the remaining context. Mice received 2 days of training in RI30 (the first lever press after an average 30 s produces the outcome) and RR10 (on average the 10th lever press produces the outcome), followed by four days under RI60 and RR20. Sessions ended in each context after 15 reinforcers were earned or after 60 min had elapsed. Each day 1–4 h following schedule training, mice had 1 h exposure to a separate outcome (20% sucrose solution or food pellets) in their home cage.

Devaluation testing through sensory-specific satiation was conducted across 2 days: a valued day and a devalued day. Mice were allowed to prefeed for 1 h on the home-cage control outcome (valued day), or the outcome previously earned by lever pressing (devalued day). Mice that did not consume pellets or sucrose during prefeeding were excluded from subsequent analysis. Each day immediately following prefeeding, mice were placed into each context for 5 min where the number of lever presses made were recorded but no outcome was delivered. For groups that received CNO or saline (1 mg/kg, 10 ml/kg), mice were given an intraperitoneal injection 15–30 min prior to prefeeding. Mice in all experimental groups were counterbalanced for schedule exposure, order of devaluation testing, outcome, and lever position. The order of context exposure was kept constant across training and testing. Investigators were not blind to the experimental groups. A subset of Air and CIE mice underwent a post-test feeding assay for 1 h immediately after operant testing on each of the devaluation days.

Brain slice preparation

Mice were at least 16 weeks of age at the time of slice preparation. Coronal slices (250 μm thick) containing the OFC or DMS were prepared using a Pelco easiSlicer (Ted Pella Inc., Redding, CA). Mice were anesthetized by inhalation of isoflurane, and brains were rapidly removed and placed in 4 °C oxygenated ACSF containing the following (in mM): 210 sucrose, 26.2 NaHCO₃, 1 NaH₂PO₄, 2.5 KCl, 11 dextrose, bubbled with 95 O₂/5% CO₂. Slices were transferred to an ACSF solution for incubation containing the following (in mM): 120 NaCl, 25 NaHCO₃, 1.23 NaH₂PO₄, 3.3 KCl, 2.4 MgCl₂, 1.8 CaCl₂, 10 dextrose. Slices were continuously bubbled with 95 O₂/5% CO₂ at pH 7.4, 32 °C, and were maintained in this solution for at least 60 min prior to recording.

Patch-clamp electrophysiology

Whole-cell patch-clamp recordings were made in identified D1 and D2 SPNs of the DMS and pyramidal cells of the OFC. In D1-tdTomato mice, D1+ cells were identified as D1 SPNs and D1− cells as D2 SPNs. Cells were identified using an Olympus BX51WI microscope mounted on a vibration isolation table. Prior to patching onto a cell, the presence of td-Tomato expression was used to verify cell-type as well as eYFP expression for terminal expression of ChR2. eYFP expression was never observed in SPN cell bodies. Recordings were made in ACSF containing (in mM): 120 NaCl, 25 NaHCO₃, 1.23 NaH₂PO₄, 3.3 KCl, 0.9 MgCl₂, 2.0 CaCl₂, and 10 dextrose, bubbled with 95 O₂/5% CO₂. ACSF was continuously perfused at a rate of 2.0 ml/min and maintained at a temperature of 32 °C. Picrotoxin (50 µM) was included in the recording ACSF to block GABA_A receptor-mediated synaptic currents. For experiments measuring asynchronous release, Ca²⁺ was replaced with 2 mM Sr²⁺ in the recording solution⁵⁴. Recording electrodes (thin-wall glass, WPI Instruments) were made using a PC-10 puller (Narishige International, Amityville, NY) to yield resistances between 3–6 MΩ. For current clamp experiments, electrodes were filled with (in mM): 135 KMeSO₄, 12 NaCl, 0.5 EGTA, 10 HEPES, 2 Mg-ATP, 0.3 Tris-GTP, 260–270 mOsm (pH 7.3). For voltage clamp experiments, electrodes were filled with (in nM): 120 CsMeSO₄, 15 CsCl, 8 NaCl, 10 HEPES, 0.2 EGTA, 10 TEA-Cl, 4 Mg-ATP, 0.3 Na-GTP, 0.1 spermine, and 5 QX-314-Cl. Access resistance was monitored throughout the experiments. Cells in which access resistance varied more than 20% were not included in the analysis.

Data acquisition

Glutamatergic afferents were stimulated either electrically or optically. For electrical stimulation, a stainless steel bipolar stimulating electrode (FHC, Inc.) was placed dorsal to the recording electrode, about 150–300 μm from the cell body. Optical stimulation was done using 470 nm blue light (4 ms) delivered via field illumination using a high-power LED (LED4D067, Thor Labs). Light intensity was adjusted to produce optically evoked excitatory post-synaptic currents (oEPSCs) with a magnitude of 100–300 pA. Recordings were made using a MultiClamp 700B amplifier (Molecular Devices, Union City, CA), filtered at 2 kHz, digitized at 10 kHz with Instrutech ITC-18 (HEKA Instruments, Bellmore, NY), and displayed and saved using AxographX (Axograph, Sydney, Australia). For PPR, 2 EPSCs were evoked separated by an ISI of 50–250 ms for 5–10 trials, collected at 0.1 Hz. Measurements of frequency and amplitude of asynchronous release in Sr²⁺ containing solution were restricted to 50–2050 ms after stimulus onset for 30 trials, collected at 0.05 Hz. Data from each neuron within a treatment group was combined and presented as mean ± SEM.

Statistical analysis

Statistical significance was defined as an alpha of p < 0.05. To ensure reproducibility of any observed effects, multiple cohorts (at least three cohorts) of Air and CIE exposure were used for all experiments except the experiment looking at asynchronous release (where only two cohorts were used). No effect of cohort was observed. Sample size was determined from previous studies and power analyses on necessary to detect action shifting in control mice. Statistical analysis was performed using GraphPad Prism 6 (GraphPad Software) and JASP. Acquisition data, including lever presses, response rate, rewards earned, and head entries were analyzed using three-way repeated measures ANOVA (day × context × CIE exposure or group). For outcome devaluation testing, three-way repeated measure ANOVAs (context × devaluation state × CIE exposure or group) were performed to examine differences in the pattern of responding. For the outcome devaluation test, lever presses in the valued and devalued states were normalized to total lever pressing (valued + devalued) in each context. In addition, one-sample t tests analyses were conducted on the distribution of lever presses to examine whether mice differentially distributed lever presses between valued and devalued states (i.e., did they differ from 0.5 which indicates equal lever presses made between valued and devalued states). We also calculated a devaluation index for each mouse in each context by applying the following equation: [(valued lever presses – devalued lever presses)/(valued lever presses + devalued lever presses)]. We then applied a two-way repeated measures ANOVA (CIE exposure × context) to assess whether there was a within-subject shift in the degree of goal-directed control between contexts, followed by Bonferroni-corrected post hoc follow-ups performed between contexts within a group. We then used a one-sample t test against a hypothetical value of 0 (indicating equal pressing between states) to examine the degree to which lever pressing was goal-directed. For patch-clamp experiments, action potential firing and PPR data were analyzed using a two-way ANOVA with Bonferroni-corrected post hoc analyses. Frequency and amplitude of asynchronous release were analyzed using a two-tailed Student’s t test. Data are presented as mean ± SEM. No significant differences in the spread of variance were observed between groups, and all data was normally distributed.

Data availability

All relevant data are available from the authors upon request.

References

Belin, D., Belin-Rauscent, A., Murray, J. E. & Everitt, B. J. Addiction: failure of control over maladaptive incentive habits. Curr. Opin. Neurobiol. 23, 564–574 (2013).
Article CAS PubMed Google Scholar
Everitt, B. J. & Robbins, T. W. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat. Neurosci. 8, 1481–1489 (2005).
Article CAS PubMed Google Scholar
Gremel, C. M. & Lovinger, D. M. Associative and sensorimotor cortico-basal ganglia circuit roles in effects of abused drugs. Genes Brain. Behav. 16, 71–85 (2017).
Article CAS PubMed Google Scholar
Hogarth, L., Balleine, B. W., Corbit, L. H. & Killcross, S. Associative learning mechanisms underpinning the transition from recreational drug use to addiction. Ann. N. Y. Acad. Sci. 1282, 12–24 (2013).
Article ADS CAS PubMed Google Scholar
Belin, D. & Everitt, B. J. Cocaine seeking habits depend upon dopamine-dependent serial connectivity linking the ventral with the dorsal striatum. Neuron 57, 432–441 (2008).
Article CAS PubMed Google Scholar
Fuchs, R. A., Branham, R. K. & See, R. E. Different neural substrates mediate cocaine seeking after abstinence versus extinction training: a critical role for the dorsolateral caudate-putamen. J. Neurosci. 26, 3584–3588 (2006).
Article CAS PubMed PubMed Central Google Scholar
Murray, J. E., Belin, D. & Everitt, B. J. Double dissociation of the dorsomedial and dorsolateral striatal control over the acquisition and performance of cocaine seeking. Neuropsychopharmacology 37, 2456–2466 (2012).
Article CAS PubMed PubMed Central Google Scholar
See, R. E., Elliott, J. C. & Feltenstein, M. W. The role of dorsal vs ventral striatal pathways in cocaine-seeking behavior after prolonged abstinence in rats. Psychopharmacology 194, 321–331 (2007).
Article CAS PubMed Google Scholar
Vanderschuren, L. J. M. J., Di Ciano, P. & Everitt, B. J. Involvement of the dorsal striatum in cue-controlled cocaine seeking. J. Neurosci. 25, 8665–8670 (2005).
Article CAS PubMed Google Scholar
Corbit, L. H., Nie, H. & Janak, P. H. Habitual responding for alcohol depends upon both AMPA and D2 receptor signaling in the dorsolateral striatum. Front. Behav. Neurosci. 8, 301 (2014).
Article PubMed PubMed Central Google Scholar
Corbit, L. H., Nie, H. & Janak, P. H. Habitual alcohol seeking: time course and the contribution of subregions of the dorsal striatum. Biol. Psychiatry 72, 389–395 (2012).
Article PubMed PubMed Central Google Scholar
Zapata, A., Minney, V. L. & Shippenberg, T. S. Shift from goal-directed to habitual cocaine seeking after prolonged experience in rats. J. Neurosci. 30, 15457–15463 (2010).
Article CAS PubMed PubMed Central Google Scholar
Belin, D., Jonkman, S., Dickinson, A., Robbins, T. W. & Everitt, B. J. Parallel and interactive learning processes within the basal ganglia: Relevance for the understanding of addiction. Behav. Brain. Res. 199, 89–102 (2009).
Article PubMed Google Scholar
Everitt, B. J. & Robbins, T. W. Drug addiction: updating actions to habits to compulsions ten years on. Annu. Rev. Psychol. 67, 23–50 (2016).
Article PubMed Google Scholar
Yin, H. H., Knowlton, B. J. & Balleine, B. W. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur. J. Neurosci. 19, 181–189 (2004).
Article PubMed Google Scholar
Yin, H. H., Ostlund, S. B., Knowlton, B. J. & Balleine, B. W. The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 22, 513–523 (2005).
Article PubMed Google Scholar
Yin, H. H., Knowlton, B. J. & Balleine, B. W. Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res. 166, 189–196 (2006).
Article PubMed Google Scholar
Hilario, M., Holloway, T., Jin, X. & Costa, R. M. Different dorsal striatum circuits mediate action discrimination and action generalization. Eur. J. Neurosci. 35, 1105–1114 (2012).
Article PubMed PubMed Central Google Scholar
Gremel, C. M. & Costa, R. M. Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions. Nat. Commun. 4, 2264 (2013).
Article ADS PubMed PubMed Central Google Scholar
Yin, H. H., Knowlton, B. J. & Balleine, B. W. Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur. J. Neurosci. 22, 505–512 (2005).
Article PubMed Google Scholar
Gremel, C. M. et al. Endocannabinoid modulation of orbitostriatal circuits gates habit formation. Neuron 90, 1312–1324 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ersche, K. D. et al. Carrots and sticks fail to change behavior in cocaine addiction. Science 352, 1468–1471 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Sjoerds, Z. et al. Behavioral and neuroimaging evidence for overreliance on habit learning in alcohol-dependent patients. Transl. Psychiatry 3, e337 (2013).
Article CAS PubMed PubMed Central Google Scholar
Balleine, B. W. & O’Doherty, J. P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 35, 48–69 (2010).
Article PubMed Google Scholar
Goldstein, R. Z. & Volkow, N. D. Dysfunction of the prefrontal cortex in addiction: neuroimaging findings and clinical implications. Nat. Rev. Neurosci. 12, 652–669 (2011).
Article CAS PubMed PubMed Central Google Scholar
McKim, T. H., Bauer, D. J. & Boettiger, C. A. Addiction history associates with the propensity to form habits. J. Cogn. Neurosci. 28, 1024–1038 (2016).
Article PubMed PubMed Central Google Scholar
Boettiger, C. A. et al. Immediate reward bias in humans: fronto-parietal networks and a role for the catechol-O-methyltransferase 158(Val/Val) genotype. J. Neurosci. 27, 14383–14391 (2007).
Article CAS PubMed Google Scholar
Loeber, S. et al. Impairment of cognitive abilities and decision making after chronic use of alcohol: the impact of multiple detoxifications. Alcohol Alcohol 44, 372–381 (2009).
Article CAS PubMed Google Scholar
Duka, T. et al. Unique brain areas associated with abstinence control are damaged in multiply detoxified alcoholics. Biol. Psychiatry 70, 545–552 (2011).
Article PubMed PubMed Central Google Scholar
Oscar-Berman, M. & Marinković, K. Alcohol: effects on neurobehavioral functions and the brain. Neuropsychol. Rev. 17, 239–257 (2007).
Article PubMed PubMed Central Google Scholar
Crews, F. T. & Boettiger, C. A. Impulsivity, frontal lobes and risk for addiction. Pharmacol. Biochem. Behav. 93, 237–247 (2009).
Article CAS PubMed PubMed Central Google Scholar
Becker, H. C. Positive relationship between the number of prior ethanol withdrawal episodes and the severity of subsequent withdrawal seizures. Psychopharmacology 116, 26–32 (1994).
Article CAS PubMed Google Scholar
Becker, H. C. & Lopez, M. F. Increased ethanol drinking after repeated chronic ethanol exposure and withdrawal experience in C57BL/6 mice. Alcohol. Clin. Exp. Res. 28, 1829–1838 (2004).
Article CAS PubMed Google Scholar
Lopez, M. F. & Becker, H. C. Effect of pattern and number of chronic ethanol exposures on subsequent voluntary ethanol intake in C57BL/6J mice. Psychopharmacology 181, 688–696 (2005).
Article CAS PubMed Google Scholar
Griffin, W. C. 3rd, Lopez, M. F. & Becker, H. C. Intensity and duration of chronic ethanol exposure is critical for subsequent escalation of voluntary ethanol drinking in mice. Alcohol. Clin. Exp. Res 33, 1893–1900 (2009).
Article CAS PubMed PubMed Central Google Scholar
den Hartog, C. et al. Inactivation of the lateral orbitofrontal cortex increases drinking in ethanol-dependent but not non-dependent mice. Neuropharmacology 107, 451–459 (2016).
Article Google Scholar
Renteria, R., Buske, T. R. & Morrisett, R. A. Long-term subregion-specific encoding of enhanced ethanol intake by D1DR medium spiny neurons of the nucleus accumbens. Addict. Biol. https://doi.org/10.1111/adb.12526 (2017).
Gourley, S. L. et al. The orbitofrontal cortex regulates outcome-based decision-making via the lateral striatum. Eur. J. Neurosci. 38, 2382–2388 (2013).
Article PubMed Google Scholar
Rhodes, S. E. V. & Murray, E. A. Differential effects of amygdala, orbital prefrontal cortex, and prelimbic cortex lesions on goal-directed behavior in Rhesus Macaques. J. Neurosci. 33, 3380–3389 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bradfield, L. A., Dezfouli, A., van Holstein, M., Chieng, B. & Balleine, B. W. Medial orbitofrontal cortex mediates outcome retrieval in partially observable task situations. Neuron 88, 1268–1280 (2015).
Article CAS PubMed Google Scholar
Stalnaker, T. A., Cooch, N. K. & Schoenbaum, G. What the orbitofrontal cortex does not do. Nat. Neurosci. 18, 620–627 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dias-Ferreira, E. et al. Chronic stress causes frontostriatal reorganization and affects decision-making. Science 325, 621–625 (2009).
Article ADS CAS PubMed Google Scholar
Pereira, E. F., Aracava, Y., Aronstam, R. S., Barreiro, E. J. & Albuquerque, E. X. Pyrazole, an alcohol dehydrogenase inhibitor, has dual effects on N-methyl-d-aspartate receptors of hippocampal pyramidal cells: agonist and noncompetitive antagonist. J. Pharmacol. Exp. Ther. 261, 331–340 (1992).
CAS PubMed Google Scholar
Becker, H. C., Diaz-Granados, J. L. & Weathersby, R. T. Repeated ethanol withdrawal experience increases the severity and duration of subsequent withdrawal seizures in mice. Alcohol 14, 319–326 (1997).
Article CAS PubMed Google Scholar
Adams, C. D. & Dickinson, A. Instrumental responding following reinforcer devaluation. Q. J. Exp. Psychol. B 33, 109–121 (1981).
Article Google Scholar
Adams, C. D. Variations in the sensitivity of instrumental responding to reinforcer devaluation. Q. J. Exp. Psychol. B 34, 77–98 (1982).
Article Google Scholar
Dickinson, A. & Balleine, B. Motivational control of goal-directed action. Anim. Learn. Behav. 22, 1–18 (1994).
Article Google Scholar
Dickinson, A. Actions and habits: the development of behavioural autonomy. Philos. Trans. R. Soc. B Biol. Sci. 308, 67–78 (1985).
Article ADS Google Scholar
Valentin, V. V., Dickinson, A. & O’Doherty, J. P. Determining the neural substrates of goal-directed learning in the human brain. J. Neurosci. 27, 4019–4026 (2007).
Article CAS PubMed Google Scholar
Wall, N. R., De La Parra, M., Callaway, E. M. & Kreitzer, A. C. Differential innervation of direct- and indirect-pathway striatal projection neurons. Neuron 79, 347–360 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gerfen, C. R. & Surmeier, D. J. Modulation of striatal projection systems by dopamine. Annu. Rev. Neurosci. 34, 441–466 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tecuapetla, F., Jin, X., Lima, S. Q. & Costa, R. M. Complementary contributions of striatal projection pathways to action initiation and execution. Cell 166, 703–715 (2016).
Article CAS PubMed Google Scholar
Ding, J., Peterson, J. D. & Surmeier, D. J. Corticostriatal and thalamostriatal synapses have distinctive properties. J. Neurosci. 28, 6483–6492 (2008).
Article CAS PubMed PubMed Central Google Scholar
Sciamanna, G., Ponterio, G., Mandolesi, G., Bonsi, P. & Pisani, A. Optogenetic stimulation reveals distinct modulatory properties of thalamostriatal vs corticostriatal glutamatergic inputs to fast-spiking interneurons. Sci. Rep. 5, 16742 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Sciamanna, G. et al. Cholinergic dysfunction alters synaptic integration between thalamostriatal and corticostriatal inputs in DYT1 dystonia. J. Neurosci. 32, 11991–12004 (2012).
Article CAS PubMed PubMed Central Google Scholar
Alexander, G. M. et al. Remote control of neuronal activity in transgenic mice expressing evolved G protein-coupled receptors. Neuron 63, 27–39 (2009).
Article CAS PubMed PubMed Central Google Scholar
Corbit, L. H., Chieng, B. C. & Balleine, B. W. Effects of repeated cocaine exposure on habit learning and reversal by N-acetylcysteine. Neuropsychopharmacology 39, 1893–1901 (2014).
Article CAS PubMed PubMed Central Google Scholar
DePoy, L. M., Zimmermann, K. S., Marvar, P. J. & Gourley, S. L. Induction and blockade of adolescent cocaine-induced habits. Biol. Psychiatry 81, 595–605 (2016).
Article PubMed Google Scholar
Nelson, A. & Killcross, S. Amphetamine exposure enhances habit formation. J. Neurosci. 26, 3805–3812 (2006).
Article CAS PubMed Google Scholar
Nimitvilai, S., Lopez, M. F., Mulholland, P. J. & Woodward, J. J. Chronic intermittent ethanol exposure enhances the excitability and synaptic plasticity of lateral orbitofrontal cortex neurons and induces a tolerance to the acute inhibitory actions of ethanol. Neuropsychopharmacology 41, 1112–1127 (2016).
Article CAS PubMed Google Scholar
Nimitvilai, S. et al. Orbitofrontal neuroadaptations and cross-species synaptic biomarkers in heavy drinking macaques. J. Neurosci. 37, 3646–3660 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Alcohol elicits functional and structural plasticity selectively in dopamine D1 receptor-expressing neurons of the dorsomedial striatum. J. Neurosci. 35, 11634–11643 (2015).
Article CAS PubMed PubMed Central Google Scholar
Cheng, Y. et al. Distinct synaptic strengthening of the striatal direct and indirect pathways drives alcohol consumption. Biol. Psychiatry 81, 918–929 (2017).
Article CAS PubMed Google Scholar
Cuzon Carlson, V. C. et al. Synaptic and morphological neuroadaptations in the putamen associated with long-term, relapsing alcohol drinking in primates. Neuropsychopharmacology 36, 2513–2528 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hunnicutt, B. J. et al. A comprehensive excitatory input map of the striatum reveals novel functional organization. Elife 5, e19103 (2016).
Article PubMed PubMed Central Google Scholar
Shan, Q., Ge, M., Christie, M. J. & Balleine, B. W. The acquisition of goal-directed actions generates opposing plasticity in direct and indirect pathways in dorsomedial striatum. J. Neurosci. 34, 9196–9201 (2014).
Article PubMed Google Scholar
O’Hare, J. K. et al. Pathway-specific striatal substrates for habitual behavior. Neuron 89, 472–479 (2016).
Article PubMed PubMed Central Google Scholar
Volkow, N. D. & Fowler, J. S. Addiction, a disease of compulsion and drive: involvement of the orbitofrontal cortex. Cereb. Cortex. 10, 318–325 (2000).
Article CAS PubMed Google Scholar
Badanich, K. A., Becker, H. C. & Woodward, J. J. Effects of chronic intermittent ethanol exposure on orbitofrontal and medial prefrontal cortex-dependent behaviors in mice. Behav. Neurosci. 125, 879–891 (2011).
Article CAS PubMed PubMed Central Google Scholar
McGuier, N. S., Padula, A. E., Lopez, M. F., Woodward, J. J. & Mulholland, P. J. Withdrawal from chronic intermittent alcohol exposure increases dendritic spine density in the lateral orbitofrontal cortex of mice. Alcohol 49, 21–27 (2015).
Article CAS PubMed Google Scholar
Gorski, J. A. et al. Cortical excitatory neurons and glia, but not GABAergic neurons, are produced in the Emx1-expressing lineage. J. Neurosci. 22, 6309–6314 (2002).
CAS PubMed Google Scholar
Durieux, P. F. et al. D2R striatopallidal neurons inhibit both locomotor and drug reward processes. Nat. Neurosci. 12, 393–395 (2009).
Article CAS PubMed Google Scholar
Shuen, J. A., Chen, M., Gloss, B. & Calakos, N. Drd1a-tdTomato BAC transgenic mice for simultaneous visualization of medium spiny neurons in the direct and indirect pathways of the basal ganglia. J. Neurosci. 28, 2681–2685 (2008).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was funded by National Institutes of Health grants R01AA026077 and R00AA021780-02 awarded to C.M.G., as well as funding from the Brain and Behavior Research (NARSAD) and Whitehall Foundations awarded to C.M.G.

Author information

Authors and Affiliations

Department of Psychology, University of California San Diego, La Jolla, CA, 92093, USA
Rafael Renteria, Emily T. Baltz & Christina M. Gremel
The Neurosciences Graduate Program, University of California San Diego, La Jolla, CA, 92093, USA
Christina M. Gremel

Authors

Rafael Renteria
View author publications
You can also search for this author in PubMed Google Scholar
Emily T. Baltz
View author publications
You can also search for this author in PubMed Google Scholar
Christina M. Gremel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This study was designed by R.R. and C.M.G. The manuscript was written by R.R. and C.M.G. Experiments were performed by R.R., C.M.G., and E.T.B. The data were analyzed by R.R. and C.M.G.

Corresponding author

Correspondence to Christina M. Gremel.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Renteria, R., Baltz, E.T. & Gremel, C.M. Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits. Nat Commun 9, 211 (2018). https://doi.org/10.1038/s41467-017-02615-9

Download citation

Received: 29 March 2017
Accepted: 13 December 2017
Published: 15 January 2018
DOI: https://doi.org/10.1038/s41467-017-02615-9

This article is cited by

Amelioration of obsessive-compulsive disorder by intracellular acidification of cortical neurons with a proton pump inhibitor
- Hikari Hatakama
- Nozomi Asaoka
- Shuji Kaneko
Translational Psychiatry (2024)
The development of compulsive coping behavior depends on dorsolateral striatum dopamine-dependent mechanisms
- Lucia Marti-Prats
- Chiara Giuliano
- David Belin
Molecular Psychiatry (2023)
External globus pallidus input to the dorsal striatum regulates habitual seeking behavior in male mice
- Matthew Baker
- Seungwoo Kang
- Doo-Sup Choi
Nature Communications (2023)
Adaptor protein complex 2 in the orbitofrontal cortex predicts alcohol use disorder
- Patrick J. Mulholland
- Stefano Berto
- John J. Woodward
Molecular Psychiatry (2023)
Chemosensory representation of first-time oral exposure to ethanol in the orbitofrontal cortex of mice
- E. Perrusquia-Hernández
- R. D. Andrade-González
- Isaac Obed Pérez-Martínez
Experimental Brain Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.