How can we estimate QALYs based on PHQ-9 scores? Equipercentile linking analysis of PHQ-9 and EQ-5D

Toshi A Furukawa; Stephen Z Levine; Claudia Buntrock; David D Ebert; Simon Gilbody; Sally Brabyn; David Kessler; Cecilia Björkelund; Maria Eriksson; Annet Kleiboer; Annemieke van Straten; Heleen Riper; Jesus Montero-Marin; Javier Garcia-Campayo; Rachel Phillips; Justine Schneider; Pim Cuijpers; Eirini Karyotaki

doi:10.1136/ebmental-2020-300240

Article Text

PDF

XML

Health economics

Original research

How can we estimate QALYs based on PHQ-9 scores? Equipercentile linking analysis of PHQ-9 and EQ-5D

http://orcid.org/0000-0003-2159-3776Toshi A Furukawa1,
http://orcid.org/0000-0002-5544-0420Stephen Z Levine2,
Claudia Buntrock3,
David D Ebert4,
Simon Gilbody5,
Sally Brabyn5,
David Kessler6,
Cecilia Björkelund7,
Maria Eriksson7,
Annet Kleiboer4,
Annemieke van Straten4,
Heleen Riper4,
Jesus Montero-Marin8,
Javier Garcia-Campayo9,10,
Rachel Phillips11,
Justine Schneider12,
Pim Cuijpers4,
Eirini Karyotaki4

¹ Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine / School of Public Health, Kyoto, Japan
² Department of Community Mental Health, Faculty of Social Welfare and Health Sciences, University of Haifa, Haifa, Israel
³ Department of Clinical Psychology and Psychotherapy, Friedrich-Alexander-University Erlangen-Nuremberg, Erlangen, Germany
⁴ Department of Clinical, Neuro and Developmental Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
⁵ Department of Health Sciences, University of York, York, UK
⁶ Population Health Sciences & National Institute for Health Research Bristol Biomedical Research Centre, University of Bristol, Bristol, UK
⁷ Primary Health Care, School of Public Health and Community Medicine, Institute of Medicine, University of Gothenburg, Gothenburg, Sweden
⁸ Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, UK
⁹ Aragon Institute for Health Research (IIS Aragón), Miguel Servet University Hospital, Zaragoza, Spain
¹⁰ Primary Care Prevention and Health Promotion Research Network, RedIAPP, Madrid, Spain
¹¹ Faculty of Medicine, School of Public Health, Imperial College London, London, UK
¹² School of Sociology & Social Policy and Institute of Mental Health, University of Nottingham, Nottingham, UK

Correspondence to Professor Toshi A Furukawa, Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine / School of Public Health, Kyoto, Japan; furukawa{at}kuhp.kyoto-u.ac.jp

Abstract

Background Quality-adjusted life years (QALYs) are widely used to measure the impact of various diseases on both the quality and quantity of life and in their economic valuations. It will be clinically important and informative if we can estimate QALYs based on measurements of depression severity.

Objective To construct a conversion table from the Patient Health Questionnaire-9 (PHQ-9), the most frequently used depression scale in recent years, to the Euro-Qol Five Dimensions Three Levels (EQ-5D-3L), one of the most commonly used instruments to assess QALYs.

Methods We obtained individual participant data of randomised controlled trials of internet cognitive-behavioural therapy which had administered depression severity scales and the EQ-5D-3L at baseline and at end of treatment. Scores from depression scales were all converted into the PHQ-9 according to the validated algorithms. We used equipercentile linking to establish correspondences between the PHQ-9 and the EQ-5D-3L.

Findings Individual-level data from five trials (total N=2457) were available. Subthreshold depression (PHQ-9 scores between 5 and 10) corresponded with EQ-5D-3L index values of 0.9–0.8, mild major depression (10–15) with 0.8–0.7, moderate depression (15–20) with 0.7–0.5 and severe depression (20 or higher) with 0.6–0.0. A five-point improvement in PHQ-9 corresponded approximately with an increase in EQ-5D-3L score by 0.03 and a ten-point improvement by approximately 0.25.

Conclusions and Clinical Implications The conversion table between the PHQ-9 and the EQ-5D-3L scores will enable fine-grained assessment of burden of depression at its various levels of severity and of impacts of its various treatments.

depression & mood disorders

Data availability statement

Data are available upon reasonable request. The overall database used for this IPD is restricted due to data sharing agreements with the research institutes where the studies were conducted. IPD from individual studies are available from the individual study authors.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/ebmental-2020-300240

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

depression & mood disorders

Introduction

Quality-adjusted life years (QALYs) have been increasingly used in general medicine and in psychiatry to evaluate the impact of a disease on both the quantity and quality of life.1 One QALY is equal to 1 year in perfect health, can range down to zero (death) or may take negative values (worse than death). QALYs can be used to compare the burdens of various diseases, to appreciate the impact of their interventions, to help set priorities in resource allocations across different diseases and interventions and to inform personal decisions.

The representative method to evaluate QALYs is the generic, preference-based measure of health including the Euro-Qol five dimensions (EQ-5D)2 3 and the SF-6D based on Short Form Survey-36 (SF-36).4 5 Of these, the EQ-5D is the most frequently used and is the preferred instrument by the National Institute of Health and Care Excellence in the UK. While the responsiveness of such generic measures to various mental conditions, especially severe mental illnesses, has been questioned,6 its validity and responsiveness to common mental disorders including depression and anxiety have been generally established.7 8

However, the traditional focus of measurements in mental health has centred mainly on symptoms. Many trials have, therefore, not administered the generic health-related quality of life measures. This has hindered comparison of impacts of mental disorders vis-à-vis other medical conditions on the one hand and also evaluation of values of their interventions on the other.9 10

We have been collecting individual participant-level data from randomised controlled trials of internet cognitive-behavioural therapies (iCBT) for depression,11 several of which administered both symptomatologic scales and generic health status scales simultaneously. This study, therefore, attempts to link the depression-specific measure onto the generic measure of health in order to enable estimation of QALYs for depressive states and their changes. Such cross-walking should facilitate assessment of burden of depression at its various severity and of the impacts of its various treatments.

Methods

Database

We have been accumulating a data set of individual participant data of randomised controlled trials of iCBT among adults with depressive symptoms, as established by specified cut-offs on self-report scales or by diagnostic interviews.11 For this study, we have selected studies that have administered the EQ-5D and depression severity scales at baseline and at end of treatment. We excluded patients if they had missing data in either of the two scales at baseline or at endpoint. We excluded studies that focused on patients with general medical disorders (eg, diabetes, glioma) and depressive symptoms.

Measures

EQ-5D-3L

The EQ-5D-3L comprises five dimensions of mobility, self-care, usual activities, pain/discomfort and anxiety/depression, each rated on three levels corresponding with 1=no problems, 2=some/moderate problems or 3=extreme problems/unable to do. This produces 3ˆ5=243 different health states, ranging from no problem at all in any dimension (11111) to severe problems on all dimensions (33333). Each of these 243 states is provided with a preference-based score, as determined through the time trade-off (TTO) technique in a sample of the general population. In TTO, respondents are asked to give the relative length of time in full health that they would be willing to sacrifice for the poor health states as represented by each of the 243 combinations above. The EQ-5D scores range between 1=full health and 0=death to minus values=worse than death bounded by −1. The scoring algorithm for the UK is based on TTO responses of a random sample (n=2997) of noninstitutionalised adults. Over the years, value sets for EQ-5D-3L have been produced for many countries/regions.2 3 7

Depression severity scales

We included any validated depression severity measures. The scale scores were converted into the most frequently used scale, namely, the Patient Health Questionnaire-9 (PHQ-9),12 using the established conversion algorithms13 14 for the Beck Depression Inventory, second edition (BDI-II)15 or the Centre for Epidemiologic Studies Depression Scale (CES-D).16

The PHQ-9 consists of the nine diagnostic criteria items of major depression from the DSM-IV, each rated on a scale between 0 and 3, making the total score range 0–27. The instrument has demonstrated excellent reliability, validity and responsiveness. The cut-offs have been proposed as 0–4, 5–9, 10–14, 15–19 and 20- for no, mild, moderate, moderately severe and severe depression, respectively.12

Statistical analyses

We first calculated Spearman correlation coefficients between PHQ-9 and EQ-5D total scores at baseline, at end of treatment and their changes, to establish if the linking is justified. Correlations were considered weak if scores were <0.3, moderate if scores were ≥0.3 and<0.7 and strong if scores were ≥0.7.17 Correlations ≥0.3 have been recommended to establish linking.18 We then applied the equipercentile linking procedure,19 which identified scores on PHQ-9 and EQ-5D or their changes with the same percentile ranks and allows for a nominal translation from PHQ-9 to EQ-5D by using their percentile values. This approach has been used successfully for scales in depression, schizophrenia or Alzheimer’s disease.14 20–22 We analysed all trials collectively rather than by trial to maximise the sample size, ensure variability in the included populations and attain robust estimates.

We conducted a sensitivity analysis by excluding studies that require the conversion of various depression severity scores into PHQ-9.

All the analyses were conducted in R V.4.0.2, with the package equate V.2.0.7.23

Ethics statement

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. Ethical approval was not required for this study as it used only deidentified patient data.

Findings

Included studies

We identified seven RCTs of iCBT (total n=2457), which administered validated depression scales and EQ-5D both at baseline and at endpoint (online supplemental eTable 1). Three studies included only patients with major depressive disorder (MDD), one only patients with subthreshold depression and the remaining three included both. All the studies administered EQ-5D-3L. PHQ-9 scores were converted from the BDI-II in three studies24–26 and from the CES-D in one study.27 The mean age of the participants was 41.8 (SD=12.3) years, 66.0% (1622/2457) were women and they scored 14.0 (5.4) on PHQ-9 and 0.74 (0.20) on EQ-5D at baseline and 9.1 (6.0) and 0.79 (0.21), respectively, at endpoint. When using the standard cut-offs of the PHQ-9,12 2.4% (60/2449) suffered from no depression (PHQ-9 scores <5), 20.2% (492/2449) from subthreshold depression (5≤PHQ-9 scores <10), 33.5% (820/2449) from mild depression (10≤PHQ-9 scores <15), 26.5% (649/2449) from moderate depression (15≤PHQ-9 scores <20) and 17.3% (424/2449) from severe depression (20≤PHQ-9 scores) at baseline.

Supplemental material

[ebmental-2020-300240supp001.pdf]

Equipercentile linking

Spearman’s correlation coefficient between the PHQ-9 and the EQ-5D scores was r=−0.29 at baseline, increased to r=−0.50 after intervention and was r=−0.38 for change scores.

Figure 1 shows the equipercentile linking between PHQ-9 and EQ-5D total scores at baseline and at endpoint. Figure 2 shows the same between their change scores. Table 1 summarises the correspondences between the two scales.

Figure 1

PHQ-9 and EQ-5D total scores at baseline and endpoint. EQ-5D,Euro-Qol Five Dimensions; PHQ-9, PatientHealth Questionnaire-9.

Figure 2

PHQ-9 change scores and EQ-5D change scores. EQ-5D, Euro-Qol Five Dimensions; PHQ-9, Patient Health Questionnaire-9.

View this table:

Table 1

Conversion table from PHQ-9 to EQ-5D total and change scores

Sensitivity analysis

When we limited the samples to the three studies28–30 that administered PHQ-9 (total n=1375), the linking results were replicated (online supplemental eFigure 1).

Discussion

This is the first study to link a depression severity measure with the EQ-5D-3L both for total and change scores. To summarise, subthreshold depression corresponded with EQ-5D-3L index values of 0.9–0.8, mild major depression with 0.8–0.7, moderate depression with 0.7–0.5 and severe depression with 0.6–0.0. A five-point improvement in PHQ-9 corresponded approximately with an increase in EQ-5D-3L index values by 0.03, and a ten-point improvement can lead to an increase by approximately 0.25.

A systematic review of utility values for depression31 found that the pooled mean (SD) utilities based on studies using the standard gamble as a direct valuation method were 0.69 (0.14) for mild, 0.52 (0.28) for moderate and 0.27 (0.26) for severe major depression. The estimates based on studies using EQ-5D as an indirect valuation method were 0.56 (0.16) for mild, 0.52 (0.28) for moderate and 0.25 (0.15) for severe depression. One recent study regressed PHQ-9 on SF-6D scores among 394 patients in theimproving Access to Psychological Therapies (IAPT) cohort7 32 and estimated none/mild depression on PHQ-9 to be worth 0.73 SF-6D scores, moderate depression 0.65 and severe depression 0.56. Our results are largely in line with these aforementioned studies.

There was a consistent difference of about 0.07 EQ-5D scores for the same PHQ-9 score if it represented the baseline or endpoint measurements (figure 1). This is understandable because a patient would rate their health status less satisfactory if they stayed equally symptomatic as before after the treatment and also because it means that they continued to suffer from depression for longer. It is, therefore, reasonable to use the conversion table at baseline for relatively new cases of depression and that at end of treatment for more chronic cases (table 1).

An effect size to be typically expected after 2 months of antidepressant pharmacotherapy33 or psychotherapy27 34 over the pill placebo condition is 0.3. Given that the average SD of PHQ-9 in the studies was about 6, an effect size of 0.3 corresponds to a difference by two points on PHQ-9. The differences in EQ-5D scores corresponding with the end-of-treatment PHQ-9 scores of x versus x+2, where x is between 5 and 15 (table 1), ranges between 0.08 and 0.13, producing an approximate average of 0.1 EQ-5D scores. If we assume that the same difference would continue for the ensuing 10 months, the gain in QALY per year would be equal to 0.09 QALY; if we assume that the difference would eventually wear out over the course of the year due to naturalistic improvements to be expected in the control group, the gain in QALY per year would be equal to 0.05 QALY. (See figure 3 for a schematic drawing to help understand the calculation of QALYs based on changing EQ-5D scores. In reality, the changes will be more smoothly curvilinear but the calculation will be similar.) Since one QALY is typically valuated at US$50 000 or 3000 Stirling pounds,35 such therapies would be cost-effective if they cost US$2500 to US$4500 (150 to 270 pounds) or less. If a 1 day fill of generic selective serotonergic reuptake inhibitor antidepressants costs 1–3 dollars and a 1-year prescription costs US$400–1200 dollars, or if 8–16 sessions of psychotherapy cost US$1600–3200 dollars, both therapies would be deemed largely cost-effective. An individual’s decision, by contrast, will and should be more variable and no one can categorically reject nor require such treatments for all patients.

Figure 3

A schematic graph showing gains in QALY due to typical pharmacotherapies or psychotherapies. A patient may start with PHQ-9 of 20, corresponding with EQ-5D index value of 0.5. Then they may improve after 2 months of antidepressant therapy to EQ-5D score of 0.9 (solid line), while they may improve to EQ-5D score of 0.8 even if on placebo (dashed line). If we assume that the same difference would continue for the ensuing 10 months while showing slow gradual improvement in both cases, the gain in QALY per year would be equal to 0.09 QALY; if we assume that the difference would eventually wear out over the course of the year due to naturalistic improvements to be expected in the control group, the gain in QALY per year would be equal to 0.05 QALY. Please note that this is a schematic drawing for illustrative purposes: in reality, the changes will be more smoothly curvilinear but the calculation will be similar. EQ-5D, Euro-Qol Five Dimensions; PHQ-9, Patient Health Questionnaire-9; QALY, quality-adjusted life years.

Several caveats should be considered when interpreting the results. First, our sample was limited to participants of trials of iCBT. It may be argued that the results, therefore, would not apply to patients with depression undergoing other therapies or in other settings. Second, the correlations between PHQ-9 and EQ-5D were strong enough for total scores at endpoint and for change scores to justify linking but were somewhat weaker at baseline, probably due to limited variability in PHQ-9 scores at baseline because some studies required minimum depression scores. However, the overall correspondence between PHQ-9 scores and EQ-5D had the same shape between baseline and endpoint, which will increase credibility of the linking at baseline as well. Third, we were able to compare PHQ-9 to EQ-5D-3L only. The EQ-5D-5L, which measures health in five levels instead of three, has been developed to be more sensitive to change and to milder conditions.36 When data become available, we will need to link PHQ-9 and EQ-5D-5L to examine if we can obtain similar conversion values.

Our study also has several important strengths. First, our sample included patients with subthreshold depression and major depression and from the community or workplace and the primary care. Furthermore, they encompassed mild through severe major depression in approximately equal proportions. Second, all the patients in our sample received iCBT or control interventions including care as usual. Potential side effects of different antidepressants, repetitive brain stimulation, electroconvulsive therapy and other more aggressive therapies must of course be taken into consideration when evaluating their impacts, but our estimates, arguably independent of major side effects, can better inform such considerations. Finaly, unlike any prior studies, we were able to link specific PHQ-9 scores and their changes scores to EQ-5D-3L index values.

Conclusion and clinical implications

In conclusion, we constructed a conversion table linking the EQ-5D, the representative generic preference-based measure of health status, and the PHQ-9, one of the most popular depression severity rating scale, for both its total scores and change scores. The table will enable fine-grained assessment of burden of depression at its various levels of severity and of impacts of its various treatments which may bring various degrees of improvement at the expense of some potential side effects.

Data availability statement

Ethics statements

References

↵
2. Drummond MF ,
3. Sculpher MJ ,
4. Claxton K
. Methods for the economic evaluation of health care programmes. Oxford, UK: Oxford University Press, 2015.
↵
1. EuroQol Group
. EuroQol--a new facility for the measurement of health-related quality of life. Health Policy 1990;16:199–208.doi:10.1016/0168-8510(90)90421-9 pmid:http://www.ncbi.nlm.nih.gov/pubmed/10109801
OpenUrl CrossRef PubMed Web of Science
↵
2. Devlin NJ ,
3. Brooks R
. EQ-5D and the EuroQol group: past, present and future. Appl Health Econ Health Policy 2017;15:127–37.doi:10.1007/s40258-017-0310-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28194657
OpenUrl PubMed
↵
2. Ware JE ,
3. Snow KK ,
4. Kolinski M
. Sf-36 health survey manual and interpretation guide. Boston, MA.: The Health Institute, New England Medical Center, 1993.
↵
2. Brazier JE ,
3. Roberts J
. The estimation of a preference-based measure of health from the SF-12. Med Care 2004;42:851–9.doi:10.1097/01.mlr.0000135827.18610.0d pmid:http://www.ncbi.nlm.nih.gov/pubmed/15319610
OpenUrl CrossRef PubMed Web of Science
↵
2. Brazier J
. Is the EQ-5D fit for purpose in mental health? Br J Psychiatry 2010;197:348–9.doi:10.1192/bjp.bp.110.082453 pmid:http://www.ncbi.nlm.nih.gov/pubmed/21037210
OpenUrl Abstract/FREE Full Text
↵
2. Brazier J ,
3. Connell J ,
4. Papaioannou D , et al
. A systematic review, psychometric analysis and qualitative assessment of generic preference-based measures of health in mental health populations and the estimation of mapping functions from widely used specific measures. Health Technol Assess 2014;18:vii–viii. xiii-xxv, 1-188.doi:10.3310/hta18340 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24857402
OpenUrl PubMed
↵
2. Payakachat N ,
3. Ali MM ,
4. Tilford JM
. Can the EQ-5D detect meaningful change? A systematic review. Pharmacoeconomics 2015;33:1137–54.doi:10.1007/s40273-015-0295-6 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26040242
OpenUrl PubMed
↵
2. Brazier JE ,
3. Yang Y ,
4. Tsuchiya A , et al
. A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures. Eur J Health Econ 2010;11:215–25.doi:10.1007/s10198-009-0168-z pmid:http://www.ncbi.nlm.nih.gov/pubmed/19585162
OpenUrl CrossRef PubMed Web of Science
↵
2. Aceituno D ,
3. Pennington M ,
4. Iruretagoyena B , et al
. Health state utility values in schizophrenia: protocol for a systematic review and meta-analysis. Evid Based Ment Health 2019;22:142–4.doi:10.1136/ebmental-2019-300089 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31126911
OpenUrl Abstract/FREE Full Text
↵
2. Karyotaki E ,
3. Efthimiou O ,
4. Miguel C , et al
. Internet-Based cognitive behavioral therapy for depression: a systematic review and individual patient data network meta-analysis. JAMA Psychiatry 2021. doi:doi:10.1001/jamapsychiatry.2020.4364. [Epub ahead of print: 20 Jan 2021].pmid:http://www.ncbi.nlm.nih.gov/pubmed/33471111
↵
2. Kroenke K ,
3. Spitzer RL ,
4. Williams JB
. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 2001;16:606–13.doi:10.1046/j.1525-1497.2001.016009606.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/11556941
OpenUrl CrossRef PubMed Web of Science
↵
2. Wahl I ,
3. Löwe B ,
4. Bjorner JB , et al
. Standardization of depression measurement: a common metric was developed for 11 self-report depression measures. J Clin Epidemiol 2014;67:73–86.doi:10.1016/j.jclinepi.2013.04.019 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24262771
OpenUrl CrossRef PubMed
↵
2. Furukawa TA ,
3. Reijnders M ,
4. Kishimoto S , et al
. Translating the BDI and BDI-II into the HAMD and vice versa with equipercentile linking. Epidemiol Psychiatr Sci 2019;29:1–13.doi:10.1017/S2045796019000088 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30867082
OpenUrl PubMed
↵
2. Beck AT ,
3. Steer RA ,
4. Brown GK
. BDI-II: Beck depression inventory. Second Edition. Manual. San Antonia: The Psychological Corporation, 1996.
↵
2. Radloff LS
. The CES-D scale: a self-report depression scale for research in the general population. Applied Psychological Measurement 1977;1:385–401.
OpenUrl CrossRef
↵
2. Mulhern B ,
3. Mukuria C ,
4. Barkham M , et al
. Using generic preference-based measures in mental health: psychometric validity of the EQ-5D and SF-6D. Br J Psychiatry 2014;205:236–43.doi:10.1192/bjp.bp.112.122283 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24855127
OpenUrl Abstract/FREE Full Text
↵
2. Revicki D ,
3. Hays RD ,
4. Cella D , et al
. Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J Clin Epidemiol 2008;61:102–9.doi:10.1016/j.jclinepi.2007.03.012 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18177782
OpenUrl CrossRef PubMed Web of Science
↵
2. Linn RL
. Linking results of distinct assessments. Applied Measurements in Education 1993;6:83–102.
OpenUrl
↵
2. Leucht S ,
3. Kane JM ,
4. Kissling W , et al
. What does the PANSS mean? Schizophr Res 2005;79:231–8.doi:10.1016/j.schres.2005.04.008 pmid:http://www.ncbi.nlm.nih.gov/pubmed/15982856
OpenUrl CrossRef PubMed Web of Science
↵
2. Leucht S ,
3. Fennema H ,
4. Engel RR , et al
. Translating the HAM-D into the MADRS and vice versa with equipercentile linking. J Affect Disord 2018;226:326–31.doi:10.1016/j.jad.2017.09.042 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29031182
OpenUrl PubMed
↵
2. Levine SZ ,
3. Yoshida K ,
4. Goldberg Y , et al
. Linking the Mini-Mental state examination, the Alzheimer's disease assessment Scale-Cognitive Subscale and the severe impairment battery: evidence from individual participant data from five randomised clinical trials of donepezil. Evid Based Ment Health 2021;24:56–61.doi:10.1136/ebmental-2020-300184 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33023920
OpenUrl Abstract/FREE Full Text
↵
2. Albano AD
. An R package for observed-score linking and equating. J Stat Softw 2016;74.
↵
2. Kessler D ,
3. Lewis G ,
4. Kaur S , et al
. Therapist-Delivered Internet psychotherapy for depression in primary care: a randomised controlled trial. Lancet 2009;374:628–34.doi:10.1016/S0140-6736(09)61257-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19700005
OpenUrl CrossRef PubMed Web of Science
↵
2. Kivi M ,
3. Eriksson MCM ,
4. Hange D , et al
. Internet-based therapy for mild to moderate depression in Swedish primary care: short term results from the PRIM-NET randomized controlled trial. Cogn Behav Ther 2014;43:289–98.doi:10.1080/16506073.2014.921834 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24911260
OpenUrl CrossRef PubMed
↵
2. Montero-Marín J ,
3. Araya R ,
4. Pérez-Yus MC , et al
. An Internet-based intervention for depression in primary care in Spain: a randomized controlled trial. J Med Internet Res 2016;18:e231. doi:10.2196/jmir.5695 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27565118
OpenUrl PubMed
↵
2. Cuijpers P ,
3. Noma H ,
4. Karyotaki E , et al
. A network meta-analysis of the effects of psychotherapies, pharmacotherapies and their combination in the treatment of adult depression. World Psychiatry 2020;19:92–107.doi:10.1002/wps.20701 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31922679
OpenUrl PubMed
↵
2. Gilbody S ,
3. Brabyn S ,
4. Lovell K , et al
. Telephone-supported computerised cognitive-behavioural therapy: REEACT-2 large-scale pragmatic randomised controlled trial. Br J Psychiatry 2017;210:362–7.doi:10.1192/bjp.bp.116.192435 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28254959
OpenUrl Abstract/FREE Full Text
↵
2. Kleiboer A ,
3. Donker T ,
4. Seekles W , et al
. A randomized controlled trial on the role of support in Internet-based problem solving therapy for depression and anxiety. Behav Res Ther 2015;72:63–71.doi:10.1016/j.brat.2015.06.013 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26188373
OpenUrl PubMed
↵
2. Phillips R ,
3. Schneider J ,
4. Molosankwe I , et al
. Randomized controlled trial of computerized cognitive behavioural therapy for depressive symptoms: effectiveness and costs of a workplace intervention. Psychol Med 2014;44:741–52.doi:10.1017/S0033291713001323 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23795621
OpenUrl CrossRef PubMed
↵
2. Mohiuddin S ,
3. Payne K
. Utility values for adults with unipolar depression: systematic review and meta-analysis. Med Decis Making 2014;34:666–85.doi:10.1177/0272989X14524990 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24695961
OpenUrl CrossRef PubMed
↵
2. Parry G ,
3. Barkham M ,
4. Brazier J
. An evaluation of a new service model: improving access to psychological therapies demonstration sites 2006–2009. Programme NSDaO, 2011.
↵
2. Cipriani A ,
3. Furukawa TA ,
4. Salanti G , et al
. Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet 2018;391:1357–66.doi:10.1016/S0140-6736(17)32802-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29477251
OpenUrl PubMed
↵
2. Furukawa TA ,
3. Weitz ES ,
4. Tanaka S , et al
. Initial severity of depression and efficacy of cognitive-behavioural therapy: individual-participant data meta-analysis of pill-placebo-controlled trials. Br J Psychiatry 2017;210:190–6.doi:10.1192/bjp.bp.116.187773 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28104735
OpenUrl Abstract/FREE Full Text
↵
2. Shiroiwa T ,
3. Sung Y-K ,
4. Fukuda T , et al
. International survey on willingness-to-pay (WTP) for one additional QALY gained: what is the threshold of cost effectiveness? Health Econ 2010;19:422–37.doi:10.1002/hec.1481 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19382128
OpenUrl CrossRef PubMed Web of Science
↵
2. Herdman M ,
3. Gudex C ,
4. Lloyd A , et al
. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res 2011;20:1727–36.doi:10.1007/s11136-011-9903-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/21479777
OpenUrl CrossRef PubMed Web of Science

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

Twitter @Toshi_FRKW, @szlevine
TAF and SZL contributed equally.
Contributors TAF and EK conceived the study. TAF and SZL designed the study. PC and EK selected the studies and collected, cleaned and combined the IPD. CBu, DDE, SG, SB, DK, MK, CBj, AK, AvS, HR, JM-M, JG-C, RP and JS contributed to the IPD. SZL and TAF analysed the data and interpreted the results. TAF wrote the initial draft manuscript, and all authors provided critical input and revisions to the draft manuscript and approved the final manuscript.
Funding This study was supported in part by JSPS Grant-in-Aid for Scientific Research (grant number 17K19808) to TAF. EK was supported by the Netherlands Organisation for Health Research and Development (NWO; project number 019.182SG.001). JM-M is supported by the WellcomeTrust Grant (104908/Z/14/Z).
Disclaimer The views expressed are those of the authors and not necessarily those of any of the funding agencies listed above.
Competing interests TAF reports grants and personal fees from Mitsubishi-Tanabe, personal fees from MSD, personal fees from Shionogi, outside the submitted work; In addition, TAF has a patent 2018-177688 concerning smartphone CBT apps pending, and intellectual properties for Kokoro-app licensed to Tanabe-Mitsubishi. JMM is supported by the Wellcome Trust Grant (104908/Z/14/Z).
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵

Drummond MF ,
Sculpher MJ ,
Claxton K
. Methods for the economic evaluation of health care programmes. Oxford, UK: Oxford University Press, 2015.

[3] Drummond MF ,

[4] Sculpher MJ ,

[5] Claxton K

[6] ↵
EuroQol Group
. EuroQol--a new facility for the measurement of health-related quality of life. Health Policy 1990;16:199–208.doi:10.1016/0168-8510(90)90421-9 pmid:http://www.ncbi.nlm.nih.gov/pubmed/10109801
OpenUrl CrossRef PubMed Web of Science

[7] EuroQol Group

[8] ↵

Devlin NJ ,
Brooks R
. EQ-5D and the EuroQol group: past, present and future. Appl Health Econ Health Policy 2017;15:127–37.doi:10.1007/s40258-017-0310-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28194657
OpenUrl PubMed

[10] Devlin NJ ,

[11] Brooks R

[12] ↵

Ware JE ,
Snow KK ,
Kolinski M
. Sf-36 health survey manual and interpretation guide. Boston, MA.: The Health Institute, New England Medical Center, 1993.

[14] Ware JE ,

[15] Snow KK ,

[16] Kolinski M

[17] ↵

Brazier JE ,
Roberts J
. The estimation of a preference-based measure of health from the SF-12. Med Care 2004;42:851–9.doi:10.1097/01.mlr.0000135827.18610.0d pmid:http://www.ncbi.nlm.nih.gov/pubmed/15319610
OpenUrl CrossRef PubMed Web of Science

[19] Brazier JE ,

[20] Roberts J

[21] ↵

Brazier J
. Is the EQ-5D fit for purpose in mental health? Br J Psychiatry 2010;197:348–9.doi:10.1192/bjp.bp.110.082453 pmid:http://www.ncbi.nlm.nih.gov/pubmed/21037210
OpenUrl Abstract/FREE Full Text

[23] Brazier J

[24] ↵

Brazier J ,
Connell J ,
Papaioannou D , et al
. A systematic review, psychometric analysis and qualitative assessment of generic preference-based measures of health in mental health populations and the estimation of mapping functions from widely used specific measures. Health Technol Assess 2014;18:vii–viii. xiii-xxv, 1-188.doi:10.3310/hta18340 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24857402
OpenUrl PubMed

[26] Brazier J ,

[27] Connell J ,

[28] Papaioannou D , et al

[29] ↵

Payakachat N ,
Ali MM ,
Tilford JM
. Can the EQ-5D detect meaningful change? A systematic review. Pharmacoeconomics 2015;33:1137–54.doi:10.1007/s40273-015-0295-6 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26040242
OpenUrl PubMed

[31] Payakachat N ,

[32] Ali MM ,

[33] Tilford JM

[34] ↵

Brazier JE ,
Yang Y ,
Tsuchiya A , et al
. A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures. Eur J Health Econ 2010;11:215–25.doi:10.1007/s10198-009-0168-z pmid:http://www.ncbi.nlm.nih.gov/pubmed/19585162
OpenUrl CrossRef PubMed Web of Science

[36] Brazier JE ,

[37] Yang Y ,

[38] Tsuchiya A , et al

[39] ↵

Aceituno D ,
Pennington M ,
Iruretagoyena B , et al
. Health state utility values in schizophrenia: protocol for a systematic review and meta-analysis. Evid Based Ment Health 2019;22:142–4.doi:10.1136/ebmental-2019-300089 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31126911
OpenUrl Abstract/FREE Full Text

[41] Aceituno D ,

[42] Pennington M ,

[43] Iruretagoyena B , et al

[44] ↵

Karyotaki E ,
Efthimiou O ,
Miguel C , et al
. Internet-Based cognitive behavioral therapy for depression: a systematic review and individual patient data network meta-analysis. JAMA Psychiatry 2021. doi:doi:10.1001/jamapsychiatry.2020.4364. [Epub ahead of print: 20 Jan 2021].pmid:http://www.ncbi.nlm.nih.gov/pubmed/33471111

[46] Karyotaki E ,

[47] Efthimiou O ,

[48] Miguel C , et al

[49] ↵

Kroenke K ,
Spitzer RL ,
Williams JB
. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 2001;16:606–13.doi:10.1046/j.1525-1497.2001.016009606.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/11556941
OpenUrl CrossRef PubMed Web of Science

[51] Kroenke K ,

[52] Spitzer RL ,

[53] Williams JB

[54] ↵

Wahl I ,
Löwe B ,
Bjorner JB , et al
. Standardization of depression measurement: a common metric was developed for 11 self-report depression measures. J Clin Epidemiol 2014;67:73–86.doi:10.1016/j.jclinepi.2013.04.019 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24262771
OpenUrl CrossRef PubMed

[56] Wahl I ,

[57] Löwe B ,

[58] Bjorner JB , et al

[59] ↵

Furukawa TA ,
Reijnders M ,
Kishimoto S , et al
. Translating the BDI and BDI-II into the HAMD and vice versa with equipercentile linking. Epidemiol Psychiatr Sci 2019;29:1–13.doi:10.1017/S2045796019000088 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30867082
OpenUrl PubMed

[61] Furukawa TA ,

[62] Reijnders M ,

[63] Kishimoto S , et al

[64] ↵

Beck AT ,
Steer RA ,
Brown GK
. BDI-II: Beck depression inventory. Second Edition. Manual. San Antonia: The Psychological Corporation, 1996.

[66] Beck AT ,

[67] Steer RA ,

[68] Brown GK

[69] ↵

Radloff LS
. The CES-D scale: a self-report depression scale for research in the general population. Applied Psychological Measurement 1977;1:385–401.
OpenUrl CrossRef

[71] Radloff LS

[72] ↵

Mulhern B ,
Mukuria C ,
Barkham M , et al
. Using generic preference-based measures in mental health: psychometric validity of the EQ-5D and SF-6D. Br J Psychiatry 2014;205:236–43.doi:10.1192/bjp.bp.112.122283 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24855127
OpenUrl Abstract/FREE Full Text

[74] Mulhern B ,

[75] Mukuria C ,

[76] Barkham M , et al

[77] ↵

Revicki D ,
Hays RD ,
Cella D , et al
. Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J Clin Epidemiol 2008;61:102–9.doi:10.1016/j.jclinepi.2007.03.012 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18177782
OpenUrl CrossRef PubMed Web of Science

[79] Revicki D ,

[80] Hays RD ,

[81] Cella D , et al

[82] ↵

Linn RL
. Linking results of distinct assessments. Applied Measurements in Education 1993;6:83–102.
OpenUrl

[84] Linn RL

[85] ↵

Leucht S ,
Kane JM ,
Kissling W , et al
. What does the PANSS mean? Schizophr Res 2005;79:231–8.doi:10.1016/j.schres.2005.04.008 pmid:http://www.ncbi.nlm.nih.gov/pubmed/15982856
OpenUrl CrossRef PubMed Web of Science

[87] Leucht S ,

[88] Kane JM ,

[89] Kissling W , et al

[90] ↵

Leucht S ,
Fennema H ,
Engel RR , et al
. Translating the HAM-D into the MADRS and vice versa with equipercentile linking. J Affect Disord 2018;226:326–31.doi:10.1016/j.jad.2017.09.042 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29031182
OpenUrl PubMed

[92] Leucht S ,

[93] Fennema H ,

[94] Engel RR , et al

[95] ↵

Levine SZ ,
Yoshida K ,
Goldberg Y , et al
. Linking the Mini-Mental state examination, the Alzheimer's disease assessment Scale-Cognitive Subscale and the severe impairment battery: evidence from individual participant data from five randomised clinical trials of donepezil. Evid Based Ment Health 2021;24:56–61.doi:10.1136/ebmental-2020-300184 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33023920
OpenUrl Abstract/FREE Full Text

[97] Levine SZ ,

[98] Yoshida K ,

[99] Goldberg Y , et al

[100] ↵

Albano AD
. An R package for observed-score linking and equating. J Stat Softw 2016;74.

[102] Albano AD

[103] ↵

Kessler D ,
Lewis G ,
Kaur S , et al
. Therapist-Delivered Internet psychotherapy for depression in primary care: a randomised controlled trial. Lancet 2009;374:628–34.doi:10.1016/S0140-6736(09)61257-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19700005
OpenUrl CrossRef PubMed Web of Science

[105] Kessler D ,

[106] Lewis G ,

[107] Kaur S , et al

[108] ↵

Kivi M ,
Eriksson MCM ,
Hange D , et al
. Internet-based therapy for mild to moderate depression in Swedish primary care: short term results from the PRIM-NET randomized controlled trial. Cogn Behav Ther 2014;43:289–98.doi:10.1080/16506073.2014.921834 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24911260
OpenUrl CrossRef PubMed

[110] Kivi M ,

[111] Eriksson MCM ,

[112] Hange D , et al

[113] ↵

Montero-Marín J ,
Araya R ,
Pérez-Yus MC , et al
. An Internet-based intervention for depression in primary care in Spain: a randomized controlled trial. J Med Internet Res 2016;18:e231. doi:10.2196/jmir.5695 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27565118
OpenUrl PubMed

[115] Montero-Marín J ,

[116] Araya R ,

[117] Pérez-Yus MC , et al

[118] ↵

Cuijpers P ,
Noma H ,
Karyotaki E , et al
. A network meta-analysis of the effects of psychotherapies, pharmacotherapies and their combination in the treatment of adult depression. World Psychiatry 2020;19:92–107.doi:10.1002/wps.20701 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31922679
OpenUrl PubMed

[120] Cuijpers P ,

[121] Noma H ,

[122] Karyotaki E , et al

[123] ↵

Gilbody S ,
Brabyn S ,
Lovell K , et al
. Telephone-supported computerised cognitive-behavioural therapy: REEACT-2 large-scale pragmatic randomised controlled trial. Br J Psychiatry 2017;210:362–7.doi:10.1192/bjp.bp.116.192435 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28254959
OpenUrl Abstract/FREE Full Text

[125] Gilbody S ,

[126] Brabyn S ,

[127] Lovell K , et al

[128] ↵

Kleiboer A ,
Donker T ,
Seekles W , et al
. A randomized controlled trial on the role of support in Internet-based problem solving therapy for depression and anxiety. Behav Res Ther 2015;72:63–71.doi:10.1016/j.brat.2015.06.013 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26188373
OpenUrl PubMed

[130] Kleiboer A ,

[131] Donker T ,

[132] Seekles W , et al

[133] ↵

Phillips R ,
Schneider J ,
Molosankwe I , et al
. Randomized controlled trial of computerized cognitive behavioural therapy for depressive symptoms: effectiveness and costs of a workplace intervention. Psychol Med 2014;44:741–52.doi:10.1017/S0033291713001323 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23795621
OpenUrl CrossRef PubMed

[135] Phillips R ,

[136] Schneider J ,

[137] Molosankwe I , et al

[138] ↵

Mohiuddin S ,
Payne K
. Utility values for adults with unipolar depression: systematic review and meta-analysis. Med Decis Making 2014;34:666–85.doi:10.1177/0272989X14524990 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24695961
OpenUrl CrossRef PubMed

[140] Mohiuddin S ,

[141] Payne K

[142] ↵

Parry G ,
Barkham M ,
Brazier J
. An evaluation of a new service model: improving access to psychological therapies demonstration sites 2006–2009. Programme NSDaO, 2011.

[144] Parry G ,

[145] Barkham M ,

[146] Brazier J

[147] ↵

Cipriani A ,
Furukawa TA ,
Salanti G , et al
. Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet 2018;391:1357–66.doi:10.1016/S0140-6736(17)32802-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29477251
OpenUrl PubMed

[149] Cipriani A ,

[150] Furukawa TA ,

[151] Salanti G , et al

[152] ↵

Furukawa TA ,
Weitz ES ,
Tanaka S , et al
. Initial severity of depression and efficacy of cognitive-behavioural therapy: individual-participant data meta-analysis of pill-placebo-controlled trials. Br J Psychiatry 2017;210:190–6.doi:10.1192/bjp.bp.116.187773 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28104735
OpenUrl Abstract/FREE Full Text

[154] Furukawa TA ,

[155] Weitz ES ,

[156] Tanaka S , et al

[157] ↵

Shiroiwa T ,
Sung Y-K ,
Fukuda T , et al
. International survey on willingness-to-pay (WTP) for one additional QALY gained: what is the threshold of cost effectiveness? Health Econ 2010;19:422–37.doi:10.1002/hec.1481 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19382128
OpenUrl CrossRef PubMed Web of Science

[159] Shiroiwa T ,

[160] Sung Y-K ,

[161] Fukuda T , et al

[162] ↵

Herdman M ,
Gudex C ,
Lloyd A , et al
. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res 2011;20:1727–36.doi:10.1007/s11136-011-9903-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/21479777
OpenUrl CrossRef PubMed Web of Science

[164] Herdman M ,

[165] Gudex C ,

[166] Lloyd A , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

Introduction

Methods

Database

Measures

EQ-5D-3L

Depression severity scales

Statistical analyses

Ethics statement

Findings

Included studies

Supplemental material

Equipercentile linking

Sensitivity analysis

Discussion

Conclusion and clinical implications

Data availability statement

Ethics statements

Patient consent for publication

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password