Introduction

Interstitial lung disease (ILD) is frequent in systemic sclerosis (SSc), with a prevalence reported between 35 and 52% [1]. Considering that ILD represents a major cause of mortality in SSc [2], preventing the progression of respiratory impairment is of utmost importance to improve life expectancy of SSc patients and it has been the target of several investigational studies. Pulmonary function tests (PFTs) were always, although variably, included among the assessed outcomes, and, in clinical practice, PFTs have an established role in screening and follow-up of SSc patients [3].

Lung involvement is also a major contributor to morbidity in SSc [4], thus it is conceivable that pulmonary complications may affect health-related quality of life (HRQoL). Among the generic instruments available to assess HRQoL in patients with SSc, short-form 36 (SF-36), health assessment questionnaire (HAQ), and EQ-5D are widely used for their reliability and validity. Feasibility, acceptability, and construct validity of EQ-5D have been demonstrated in SSc patients and, when compared with the scleroderma HAQ and with SF-36, the EQ-5D was shown to cover all domains [5]. Tools to assess HRQoL are also adopted in randomized trials, typically as secondary outcomes, remarking the importance of evaluating the efficacy of novel therapies also in terms of HRQoL improvement. An association between impaired pulmonary function and decreased EQ-5D has been described in SSc [6], in patients with other respiratory diseases [7], and also in lung-healthy individuals [8]. Nevertheless, the extent of contribution given by changes in PFTs over time to the evolution of HRQoL in SSc patients is largely unknown.

Therefore, the aim of our study is to retrospectively evaluate how changes in PFTs contribute to longitudinal variations in HRQoL assessed using the multidimensional questionnaire EQ-5D in a large cohort of patients with SSc, at first considering the whole population and then only patients with ILD. Secondly, based on previously described minimum clinically important difference (MCID) [9], we determined the number of patients in the entire cohort experiencing relevant fluctuations in EQ-5D utility score, and whether these could be explained by changes in PFTs or not.

Materials and methods

We conducted a retrospective observational study including adult patients enrolled in the Leiden Combined Care in SSc (CCISS) cohort [10] and having a first visit performed between 2 April 2009 and 30 April 2018. Since 2009, at Leiden University Medical Center (LUMC), SSc patients are followed in a standardized care pathway, consisting of yearly multidisciplinary medical consultations and extensive work-up, comprising PFTs, x-ray and high-resolution computed tomography (HRCT) of the thorax, electrocardiogram, echocardiography, and 6-min walk test. As part of this comprehensive evaluation, dedicated questionnaires are proposed on an annual basis to all patients, with the aim of assessing HRQoL. For the purpose of this study, follow-up data collected until 2 May 2019 were analysed. Being interested in longitudinal variations, we selected patients with at least one baseline and one follow-up visit in which PFT measurements were performed and complete EQ-5D utility scores and VAS were obtained. All patients had to fulfil the 2013 ACR/EULAR classification criteria for SSc [11]. For cases enrolled before their availability, the abovementioned criteria were retrospectively applied.

Of the 697 patients enrolled in the CCISS cohort until 30 April 2018, 321 were excluded because of not meeting the 2013 ACR/EULAR classification criteria or lack of at least one follow-up visit with complete PFTs and EQ-5D obtained before 2nd May 2019 (Fig. 1).

Fig. 1
figure 1

Patients included in the study. CCISS: Combined Care in Systemic Sclerosis; PFTs: pulmonary function tests; HRCT: high-resolution computed tomography; ILD: interstitial lung disease; FVC: forced vital capacity

Research on the CCISS cohort is approved by the Ethics Committee of LUMC (approval number P09.003), and all patients gave written informed consent. The study was conducted in accordance with the principles of the Declaration of Helsinki.

All data were collected in an electronic database dedicated to the CCISS cohort. In case of missing variables, medical records were checked. Demographic and clinical characteristics were gathered at the first visit included in the study and at each following time point. Disease duration was defined as time since the onset of first sign or symptom attributable to SSc different from Raynaud’s phenomenon. Patients were categorized as never, former, or current smokers, and the presence of pulmonary comorbidities, namely chronic obstructive pulmonary disease, asthma, and a current or recent history of lung cancer, was also described. Disease subset, presence of ILD or pulmonary arterial hypertension (PAH) and skin fibrosis involving the anterior chest or abdomen were reported. Forced vital capacity (FVC%) and single-breath diffusing capacity of the lungs for carbon monoxide corrected for haemoglobin (DLCO%) were expressed as percentages of the predicted reference values in accordance with published standards [12, 13]. The diagnosis of ILD was based on evidence of compatible interstitial abnormalities detected at HRCT of the thorax and reported by an experienced thoracic radiologist.

In the CCISS cohort, the EQ-5D-3L version is used. The questionnaire consists of two distinct parts. The first part is composed by five domains: mobility, self-care, usual activities, pain/discomfort, anxiety/depression. Patients can choose, for each dimension, among three severity levels: no problems, some problems, extreme problems. A sum utility score is calculated using nation-specific algorithms [14]. The Dutch tariff [15] was applied in the present study. Results vary from − 0.59 to 1. Negative scores indicate patient’s perception of a health status worse than death, while a score of 1 means perfect health. The second part of the questionnaire consists of a single visual analogue scale (VAS) through which patients are asked to rate their health of the day from 0 to 100. Higher values represent better health. For the EQ-5D utility score, MCID in SSc has been determined [9], corresponding to an increase of 0.05 to 0.10 for minimally improved categories, and to a decrease of 0.12 to 0.14 for minimally deteriorated categories.

Patients’ characteristics were reported using descriptive statistics with mean and standard deviation, or median and interquartile range (IQR), expressed when appropriate. The association over time between PFTs and EQ-5D was analysed using generalized estimating equations (GEEs). A model with linear response and autoregressive correlation structure was chosen. Two longitudinal models were set to analyse the entire population, with either EQ-5D utility score or VAS as dependent variable. Both models included FVC% and DLCO% as predictors, and were adjusted for age, disease duration, smoking status, presence of pulmonary comorbidities, and skin fibrosis involving anterior chest or abdomen. In subanalyses, the same models were applied to all patients with ILD and to the subgroup of patients with ILD and FVC <70%. A p < 0.05 was considered significant.

Results

We included 378 patients (301 women, 77 men) with a minimum of 2 and a maximum of 11 annual visits, accounting for a median follow-up of 3.7 years (IQR 1.9–6.4) and a total of 1619 measurements (Table 1). Mean age at first visit included in the study was 53.1 ± 15.4 years, and median disease duration was 3.8 years (IQR 1.1–10.5). The majority of patients had the limited cutaneous disease subset (57%), and ILD was present in 156 cases (41%), whereas PAH was considerably less common (3%). At baseline, mean FVC% was 99.1 ± 20.0% and mean DLCO% was 66.1 ± 16.7%. Regarding HRQoL, median EQ-5D utility score was 0.77 (IQR 0.65–0.84) and median EQ-5D VAS was 70 (IQR 55–84).

Table 1 Baseline characteristics of the studied population

The GEE models (Table 2) showed a significant association between increase of FVC% and improvement of EQ-5D over time, in both its utility score and VAS components. The β coefficients demonstrated how a patient with a 1% higher FVC% was expected to have a 0.001 (95% CI 0.000 to 0.002; p = 0.003) higher EQ-5D utility score and a 0.188 (95% CI 0.111 to 0.264; p < 0.001) higher VAS. Furthermore, a significant association between increased DLCO% and improvement of EQ-5D utility score was found, while the results with VAS as dependent variable were non-significant. For each 1% increase in DLCO%, a 0.001 (95% CI 0.000 to 0.002; p = 0.038) growth in utility score and a 0.020 increase in VAS (95% CI -0.079 to 0.120; p = 0.690) were expected.

Table 2 Association of lung function with EQ-5D over time in the entire population

In the first subanalysis, 156 patients with ILD and 715 measurements were included. The GEE models (Table 3) showed a significant association between increase of FVC% and improvement of VAS over time (β coefficient = 0.167; 95% CI 0.058 to 0.277; p = 0.003). In the second subanalysis, 19 patients with ILD and FVC <70% were included, accounting for 94 measurements. The GEE models (Table 3) did not show any significant association between changes in FVC% or DLCO% and evolution of EQ-5D.

Table 3 Association of lung function with EQ-5D over time in 156 patients with ILD at baseline and in the subgroup of 19 patients with ILD and FVC < 70% at baseline

In the studied population, 72% of patients experienced one or more EQ-5D utility score changes consistent with MCID for improvement (≥0.05) or worsening (≤0.12), between any two consecutively available time points. In particular, utility score increase was observed in 111 (30%) and decline in 39 (10%), while 122 patients (32%) experienced both variations during follow-up. EQ-5D changes compatible with MCID were identified in a total of 519 measurements. Of these, 317 (61%) were improvements and 202 (39%) were worsening. According to the β coefficients shown by the GEE models in the entire population, in one case only, the FVC% increase alone could account for the ≥0.05 change in EQ-5D compatible with relevant improvement.

Discussion

We analysed the extent of contribution given by PFTs to change in HRQoL over time in all SSc patients enrolled in the CCISS cohort, then in SSc patients with ILD, and in the subgroup of patients with ILD and FVC < 70%. Our findings demonstrate a statistically significant, but minor, impact of improvement in FVC% and DLCO% on HRQoL in SSc. Specifically, in the studied population, improvement of FVC% is associated with increase in both EQ-5D utility score and VAS over time, and improvement of DLCO% is associated with increase in EQ-5D utility score. Interestingly, in patients with SSc-ILD, the only significant finding was the association between changes of FVC% and variations in VAS, while no significant result was observed in the subgroup of patients with ILD and FVC < 70%. A possible explanation is that, in the subanalyses, we were underpowered to detect other small but significant associations similar to those found in the entire population.

Analysing all patients, we found that for a 1% increase in either FVC% or DLCO%, a 0.001 improvement in utility score is expected. According to the previously described MCID for EQ-5D utility score [9], our results indicate that a substantial variation in FVC% or in DLCO% would be needed to determine a change in EQ-5D utility score approaching MCID. In particular, the magnitude of change required to reach MCID for worsening is unreasonable, being equal to a 120% deterioration in FVC% or DLCO%, whereas the 50% increase necessary to explain MCID for improvement is extremely rare in clinical practice. Taken as a whole, these results indicate that other aspects contribute largely to SSc patients’ perception of quality of life, even when ILD with ventilatory impairment is present. Consequently, we cannot expect drugs specifically targeting SSc-associated ILD to have an impact on general HRQoL large enough to approach EQ-5D utility score MCID.

Analysing data from Scleroderma Lung Study I (SLS-I) and II (SLS-II), a MCID for FVC% changes after 12 months in patients with SSc and ILD has been estimated. It corresponds to a change between 3 and 5.3% for improvement, and between − 3 and − 3.3% for worsening [16]. When subjects who improved or worsened were compared, a statistically significant difference in patient reported outcomes was noted, suggesting a relationship between better pulmonary function and improvement of HRQoL. However, this analysis included a selected trial population, with evidence of ILD and symptoms of breathlessness, and the tools used to assess HRQoL were both generic and specific for respiratory manifestations, but EQ-5D was not comprised. Conversely, we studied an unselected cohort, with data collected over a considerably longer time span, and focusing the analysis on EQ-5D variation.

In line with our findings, a large international survey observed that patients with SSc frequently complain of impaired HRQoL but lung involvement is not among the most important factors influencing disease severity and daily life from the patient’s perspective [17]. In the ASTIS trial [18], a significant difference in mean change of EQ-5D utility score was shown after 2 years for patients treated with haematopoietic stem cell transplantation compared with cyclophosphamide. When other patient-reported outcomes are considered, as in the recent SENSCIS trial [19], the superiority of antifibrotic therapy over placebo in lowering the annual rate of FVC% decline was not reflected by a concurrent change in HRQoL measures.

Beyond SSc, EQ-5D has been used also in other rheumatic diseases. In rheumatoid arthritis it is valid, reliable, and responsive to change [20]. EQ-5D is associated with disease activity, functional impairment, coronary artery syndrome, depression, and anxiety [21]. Moreover, EQ-5D has been included among patient reported outcomes investigated in trials of new treatments [22] and to assess the efficacy of disease modifying antirheumatic drugs, indicating the need to achieve control of disease activity to improve patients’ quality of life [23]. However, although EQ-5D has been widely applied to rheumatic diseases, the extent of contribution given by respiratory impairment to HRQoL in patients with lung involvement secondary to autoimmune rheumatic diseases is largely unknown.

Our research is not without limitations. The EQ-5D was not originally developed for SSc, neither for respiratory diseases. It is thus possible that EQ-5D is not sensitive enough to SSc-specific factors. Moreover, we used the EQ-5D-3L version which, compared with the 5L version, is less sensitive, less precise, and more prone to ceiling effect [24]. Finally, we took several factors into account, but we cannot exclude that in the complicated relationship between lung physiology and HRQoL, other confounders might play a role. Nevertheless, we also believe that our study has strengths. First, all PFTs were standardly and centrally obtained in the same department of our academic centre, ensuring quality of the measurements. Secondly, according to the inclusion criteria, we had no missing data in either dependent or independent variables and we could contribute information from a large number of patients with heterogeneous disease characteristics. As such we were able to include 1619 time points describing 378 patients.

In conclusion, studying a large and unselected cohort of SSc patients we demonstrated that changes in PFTs have a significant, although minor, impact on HRQoL in SSc. Our findings again underline the importance of treating lung involvement in SSc and also indicate that generic questionnaires might not be sensitive enough to evaluate the impact on HRQoL of therapies targeting specific SSc manifestations.