Introduction

Disease control is the primary goal of asthma therapy being linked to absence of symptoms and exacerbations1. Regrettably, up to half of the patients are poorly controlled2,3. Despite therapeutic advances, numbers remained fairly unaltered during the last decade.

The clinical and pathophysiological explanations associated with poor disease control are heterogenous. In general, more severe disease is related to more frequent exacerbations, health care contacts3 and symptoms4. Lung function is also impaired in severe disease indicated by a lower forced expiratory volume in one second (FEV1) and lower forced vital capacity (FVC)3. Both parameters represent rather central sites of obstruction. However, involvement of peripheral airways is common in the majority of asthmatic patients5. This holds true across the whole spectrum of severity6 and may be a consequence of several influencing factors. These include inflammation, wall thickening, smooth muscle hypertrophy, and mucus7,8,9,10. However, changes in the lung periphery are still often missed by commonly used techniques such as spirometry.

Impulse oscillometry (IOS) is an inexpensive non-invasive technique to measure airway resistance. It was shown to identify small airway obstruction11, the related characteristics of disease control12,13 and response to acute bronchodilator therapy14. Small airway dysfunction was shown to impair quality of life in smokers15. This even applies in absence of obstructive lung disease.

Other techniques aiming at an early diagnosis or phenotyping have also been investigated. Most recently, Sugawara and co-workers could identify three subtypes of cough variant asthma. They were able to demonstrate a differential therapeutic effect of inhaled corticosteroids (ICS)16. Eosinophilic airway inflammation is common in asthma and associated with increased corticosteroid responsiveness. Blood eosinophilia is already a widely used parameter in patients with severe disease planned for targeted antibody therapy in personalized medicine17. Single surrogate markers such as blood eosinophil count, immunoglobulin E, and fractional exhaled nitric oxide (FeNO) have overall moderate diagnostic accuracy18. However, it was recently demonstrated that FeNO predicts response to ICS in patients with non-specific respiratory symptoms19. These may represent a distinct phenotype or stage of asthmatic disease not detected using current diagnostic work-up. Ventilation heterogeneity is also frequent in obstructive lung disease and can be assessed using multiple breath wash-out testing (MBW). The technique was shown to be feasible in adult patients with chronic obstructive pulmonary disease (COPD)20,21,22 and pulmonary hypertension23. Asthmatic patients were found to have increased ventilation heterogeneity as compared to controls24,25 that could be improved by therapy24,26,27 and is worse during exacerbations28. Moreover, previous investigations also suggest a link to poorer asthma control27,29. An association of ventilation heterogeneity with airway responsiveness was seen in asthma but not in COPD30.

Hence, MBW has great potential to improve phenotyping as well as differential diagnostics. Nevertheless, several metrological issues have still to be overcome. Most MBW data in bronchial asthma has been acquired in paediatric patients using nitrogen (N2) as tracer gas. The latter potentially suffers from technical issues such as N2-back diffusion, measurement inaccuracies and vulnerability to leak flows31,32,33,34. Mass spectrometry using open wash-in MBW sulphur hexafluoride (SF6) is considered the gold standard35. However, this has been associated with high costs and effort. Introduction of a photo-magneto-acoustic multi-gas analyser allowed the direct measurement of SF6 concentrations with high accuracy36. Recently, this led to construction of a closed circuit SF6-MBW setup considerably facilitating application and reducing costs37. We therefore set out to evaluate the feasibility of SF6-MBW in adult patients with bronchial asthma and derive potential clinical applications.

Methods

Subjects

We prospectively evaluated patients with known or first diagnosis of bronchial asthma as well as non-asthmatic controls. Classification criteria are given in detail in the online supplement. The study protocol was approved by the Medical Ethics Committee II of the Medical Faculty Mannheim (Heidelberg University), compliant with the Declaration of Helsinki and registered at clinicaltrials.gov (NCT03820427). Written informed consent was obtained from all participants and/or the respective legal guardian prior to inclusion.

Study protocol

Spirometry and whole-body plethysmography

All subjects underwent pre-bronchodilator conventional lung function testing including spirometry as well as whole-body plethysmography (MasterScreen Body) and were in stable clinical condition. Existing maintenance therapy was not withheld. Reversibility testing was then performed with doses of 40 µg ipratropium bromide and 100 µg fenoterol hydrobromide administered via soft-mist haler (Berodual Respimat, Boehringer Ingelheim Pharma GmbH & Co. KG, Germany). Indications include presence of either obstruction (FEV1/FVC < 70% of predicted), hyperinflation (residual volume/total lung capacity (RV/TLC) > 40%), and/or respective flow-volume-curves or flow-pressure curves. In addition to these parameters, maximum expiratory flow (MEF) at 75, 50 and 25% of FVC as well as functional residual capacity (FRC) were determined. Reference values were derived from the revised 1993 version of the European Community for Steel and Coal equations (ECSC)38.

Transfer factor for carbon monoxide (TLCO)

TLCO was determined pre-bronchodilator in single breath technique and corrected for ventilated alveolar volume (TLCO/VA). The difference was defined as ΔTLCO.

Multiple breath wash-out (MBW) testing

A commercially available closed-circuit system (Innocor, PulmoTrace ApS, Glamsbjerg, Denmark) was used for MBW measurements as previously described in detail37. The device consists of a 3-liter rebreathing bag filled with a mixture of room air and test gas (94% O2, 1% SF6 and 5% N2O, PulmoTrace ApS) from an on-board gas cylinder. Functional residual capacity (FRC), acinar (Sacin) and conductive (Scond) ventilation heterogeneity were derived from three consecutive wash-outs using proprietary software provided by the manufacturer (software version 8.0 beta 1). LCI was calculated for the 2.5% (LCI2.5) and 5% (LCI5) stopping points, respectively. Subjects were breathing tidally and the test was stopped when end tidal SF6 had fallen below 1/40 of the starting concentration for three consecutive breaths. Wash-out was always performed until the later LCI2.5 stopping point. To show the time saving potential of an earlier LCI5 stopping point, wash-out times were calculated separately. Only patients with at least two technically acceptable LCI measurements based on slightly modified ATS/ERS criteria (online supplement) were included in final analysis. All subjects underwent three consecutive pre-bronchodilator MBW tests in upright position.

Impulse oscillometry (IOS)

IOS measurements were performed during tidal breathing (MasterScreen IOS, CareFusion 234 GmbH, Höchberg, Germany). We conducted three consecutive pre-bronchodilator measurements and derived parameters of airway resistance (frequency dependence D5–20) and reactance (resonance frequency Fres, reactance area Ax), respectively. In patients undergoing reversibility testing according to above mentioned indications, three consecutive post-bronchodilator measurements were additionally performed.

All lung function tests were independently assessed by two experienced investigators. MBW and IOS were conducted prior to all forced lung function manoeuvres. The difference between FRC as determined by whole-body plethysmography and MBW was defined as ΔFRCpleth-MBW.

Statistical analysis

Mean values are given ± standard deviation (SD) unless stated otherwise. Differences between groups were assessed by Student’s t-test for continuous variables or Chi-squared test for categorical variables. The coefficient of variation (CV) was calculated as SD/mean from two or three valid MBW measurements, respectively. Sample size calculation was based on previously published data39. An overall cohort of 60 subjects would provide 95% power to detect a 1.8 ± 0.4 difference in LCI with a 2:1 allocation of patients and non-asthmatic controls, respectively. An alpha error of less than 5% in two-sided testing was considered statistically significant. R Statistical Software (v3.4.2, Foundation for Statistical Computing, Vienna, Austria) was used for all data analysis40. A subgroup of asthma patients with normal lung function in conventional testing was prespecified and defined as having normal FEV1 (>80% of predicted) and no obstruction in spirometry (FEV1/FVC > 70%). All lung function parameters differing between asthma patients with normal lung function and non-asthmatic controls in univariate analysis (p ≤ 0.05) were included in an elastic-net generalized linear model corrected for age. L1 regularization was used for variable selection based on a k-fold cross validation approach to find optimal hyper-parameters. Diagnostic performance was evaluated using receiver operating curve (ROC) analysis and calculation of area under the curve (AUC).

Results

Final analysis was performed in 91 patients with bronchial asthma (66%) and 47 non-asthmatic controls (34%) with valid data sets. Only two valid MBW measurements could be obtained in 11 non-asthmatic controls (23%) and 32 asthmatics (35%, p = 0.2). Five asthmatic patients (3.4%) had to be excluded due to the prespecified acceptability criteria for MBW testing whereas in another 4 patients (2.7%) only one valid MBW measurement could be obtained. Baseline characteristics are given in Table 1. Reversibility testing was performed in 63 patients (69% of asthmatic patients) with details given in Supplementary Table S1. Positive rates were 38% for FEV1 and Ax, 21% for RV and 29% for D5–20, respectively.

Table 1 Baseline characteristics.

All lung function parameters differed between asthmatic patients and non-asthmatic controls except for total lung capacity (TLC), functional residual capacity as determined by whole-body plethysmography (FRCpleth) and ΔTLCO (Table 2). Forty-one asthma patients had normal lung function in spirometry with details of differences given in Supplementary Table S2. LCI2.5 and LCI5 were both significantly higher in asthmatic patients as compared to non-asthmatic controls (Fig. 1A, p < 0.001). Likewise, Sacin and Scond were both elevated and significantly different (Fig. 1B, p < 0.001 and p < 0.01). CV was 3.4% for LCI2.5 and 3.5% for LCI5 in patients with asthma. No significant differences were found as compared to 3.0% for LCI2.5 (p = 0.28) and 3.2% for LCI5 (p = 0.37) in non-asthmatic controls. Parameters of local ventilation heterogeneity showed considerably larger variation. CVs were 42% for Sacin and 82% for Scond in asthma as compared to 39% (p = 0.74) and 127% (p = 0.17) in non-asthmatic controls, respectively. ΔFRCpleth-MBW was 0.04 ± 0.52 L in non-asthmatic controls as compared to 0.61 ± 0.75 L in asthmatic patients (Fig. 1C, p<<0.001). Asthmatic patients with normal spirometry also had an elevated ΔFRCpleth-MBW of 0.46 ± 0.62 as compared to non-asthmatic controls (p < 0.001).

Table 2 Lung function testing (whole cohort).
Figure 1
figure 1

MBW parameters. Box-and-whisker-plots for (A) LCI2.5 (p < 0.001), (B) Sacin (p < 0.001) and (C) ΔFRCpleth-MBW (p<<0.001) separated for asthma patients and non-asthmatic controls. Connectors indicate statistically significant (p < 0.05) differences.

In the regularized generalized linear model used for variable selection, FEV1/FVC, maximum expiratory flow at 25% of FVC (MEF25), LCI2.5, FRCMBW, area under the reactance curve (Ax), resonance frequency (Fres), TLCO/VA and ΔFRCpleth-MBW remained as predictors for asthma after adjustment for age and smoking status. ROC analysis revealed an AUC of 0.906 for the differentiation from non-asthmatic controls that exceeded diagnostic performance of the individual parameters (Fig. 2A). Diagnostic accuracy was significantly lower when only including parameters of conventional lung function testing (AUC = 0.819, p < 0.05, Fig. 2B). A model only consisting of conventional lung function testing and MBW but without IOS parameters yielded an improved AUC of 0.863 (p = 0.2, Fig. 2C). No differences were found between LCI5 (AUC = 0.683) and LCI2.5 (AUC = 0.697, p = 0.27) when used as distinct parameters. Sacin and Scond had lower diagnostic accuracy (AUC = 0.615 and 0.593).

Figure 2
figure 2

Diagnostic performance. ROC analysis for differentiation of asthmatic patients with normal conventional lung function testing (n = 41, FEV1 ≥ 80% predicted, FEV1/FVC ≥ 70%) and non-asthmatic controls (n = 47). (A) Good diagnostic performance (AUC = 0.906) of the model when including predictors identified during regularization. (B) Significantly lower accuracy (AUC = 0.819, p < 0.05) when including only parameters of conventional lung function testing. (C) Addition of MBW without IOS parameters also improves diagnostic performance (AUC = 0.863, p = 0.2).

Mean total wash-out time was 102 ± 37 s in non-asthmatic controls and 102 ± 39 s in patients with asthma for LCI2.5. When using the earlier LCI5 stopping point, wash-out times were significantly reduced by 24 s (p<<0.001) and 28 s (p<<0.001), respectively.

Discussion

We were able to demonstrate that MBW measurements are feasible in patients with bronchial asthma. Global and local ventilation heterogeneity is increased as compared to non-asthmatic controls. These differences persist when comparing asthmatic patients with normal conventional lung function parameters to non-asthmatic controls. MBW parameters therefore may contain additional information that can be used to identify asthmatic patients with abnormal ventilation missed by current tests. Including parameters of ventilation heterogeneity and small airway disease significantly improves diagnostic accuracy of our prediction model. Using an earlier stopping point does not affect diagnostic performance of LCI while considerably reducing wash-out duration.

Our findings are in accordance with previously published data in asthmatic children. Residual global ventilation heterogeneity could be identified despite well-controlled disease and normal FEV1. We also found significant differences between adults with asthma and non-asthmatic in acinar but not conductive ventilation heterogeneity. In contrast, parameters of regional inhomogeneity did not differ significantly between asthmatic and healthy children25. LCI was moderately, but statistically significantly elevated in a smaller cohort of asthmatic children while effect of acute bronchodilator therapy with salbutamol was detected. However, a very small subgroup of four patients with large improvement of airway obstruction were identified to show a decreased ventilation heterogeneity41. MBW indices were found not to be ideal for assessing bronchodilator reversibility in younger children with stable wheeze42. Although measurements using our setup themselves were demonstrated to be rather time effective, an additional interval should have been introduced after the forced breathing manoeuvres required during spirometry and plethysmography. We therefore decided to focus on the pre-bronchodilator in our investigation acknowledging the potential value of additional post-bronchodilator measurements to be further evaluated in adults. Abnormalities identified visually using magnetic resonance imaging (MRI) were shown to significantly correspond to poor asthma control43. In adult respiratory care, application of MBW comprises numerous potential scenarios. Kjellberg and co-workers were able to identify a “small airway subtype” in asthma patients. It was linked to reduced FEV1, smoking and blood eosinophilia and may potentially improve phenotyping. IOS and N2-MBW were found to give complementary information about peripheral airway dysfunction. These are attributable to different domains of small airway disease addressed by either test as recently demonstrated5. Altered values of D5–20 and LCI were found in 23% and 45% of asthmatic patients with normal FEV144. This favourably corresponds to our findings at hand in asthma and also with previous reports in COPD using IOS and MBW, respectively. Small airway disease was detected with IOS even in early stages and linked to typical symptoms such as dyspnoea45. In early stages, IOS parameters outperformed conventional spirometry when differentiating heavy smokers with normal lung function as well as early-stage COPD46. Likewise, LCI was shown to be elevated in COPD in general and increases with GOLD stage20. Even in absence of spirometric obstruction, LCI was already pathologically altered22. Differences in FRC as measured by plethysmography and MBW were described previously in patients with obstructive lung disease39,47,48. When using an exogenous tracer gas such as SF6, these can be easily explained as only ventilated areas will contribute to FRC determination. In contrast, all compressible gas volumes are measured in body plethysmography49. Correspondingly, we found an increased ΔFRCpleth-MBW in asthmatic patients even in absence of spirometric changes. Inclusion of this MBW parameter increases diagnostic accuracy already considerably. A model based on all objectively selected parameters including MBW and IOS further improves diagnostic accuracy. This outperforms both conventional tests and as well as distinct parameters from novel tests alone. Therefore, ΔFRCpleth-MBW itself may yield diagnostic information due to tracer gas not reaching non-ventilated areas when obstruction is present. Taken together, evidence exists that novel parameters of small airway disease improve diagnostic accuracy and have therapeutic implications.

In clinical and scientific routine, several metrological aspects should be considered. Recently, it was demonstrated that duplicate LCI measurements are sufficient in adults39 and preschool children50. Intra-test repeatability is not affected while total test time is reduced significantly. These findings could be replicated in pre-school children suffering from cystic fibrosis using an N2-based setup51 with all three studies opposing current general recommendations35. Although we were able to acquire at least two successful runs in at least 90% of asthmatic patients and all non-asthmatic controls, using an earlier stopping point may eventually contribute to even higher success rates. In accordance with our findings, LCI5 was shown to be reliable in childhood and adolescence52 with equal diagnostic performance to the traditional stopping point53. However, this approach requires separate normal values that may be overcome using predictive modelling. An algorithm was demonstrated to reliably extrapolate LCI2.5 from earlier stopping points with reasonable accuracy. Additionally, this has the potential to further reduce wash-out time by up to 41% when using a 10% cut-off for prediction54. A similar 43% reduction in wash-out time can be potentially achieved when estimating LCI using Bayesian modelling55. Meaningful differences in MBW outcome measures depending on the test gases have been described20,56. In general, nitrogen-based systems reproducibly yield higher absolute LCI readings than SF6-MBW49,57,58. N2 has a higher diffusion rate and smaller molar mass as compared to SF6 resulting in a more proximal diffusion-convection front35. While SF6 may not reach very poorly ventilated regions during wash-in, endogenous N2 may prolong wash-out from these regions resulting in higher LCI values. Hence, attention has to be paid to test gas choice when interpreting MBW results59. Despite the above-mentioned potential advantages of SF6 -MBW, it should be noted that tracer gas still is associated with considerable costs. The system we used is commercially available and has regulatory approval (FDA and CE mark). Nevertheless, SF6 is not allowed for medical use in several countries. While no solid reference values are available for closed-circuit SF6-MBW in adults to date, age was identified the major factor influencing factor for nitrogen-based LCI measurements in adults and therefore prespecified as a control variable in our model60. The influence of height seems to be negligible in patients older than six years while a nonlinear decrease pattern was described when using SF6 as tracer gas61. Single breath N2-wash-out can be seen as a potentially time efficient alternative to MBW testing. However, sensitivity was found to be lower in adult asthmatic patients as compared to MBW and may therefore be a marker of more severe disease62. Within-test repeatability found in our study is comparable to previously published data in stable adults showing CVs between 2.5 and 4.5% for LCI using SF6 as tracer gas37,39,63,64. When paediatric populations are investigated, CV tends to be slightly higher ranging from 3.2 to 8.5%50,54,65. Likewise, considerably larger CVs have been shown for Scond and Sacin with more variability in non-asthmatic controls than in COPD ranging from 17 to 73%66. This corresponds favourably to our findings. We found similar CV ranges in both groups except for Scond in non-asthmatic controls showing a higher repeatability of 127%. We can reasonably exclude poor quality control as all lung function tests were independently assessed by two experienced investigators. Rather this may be attributable to the overall lower values of Scond making it more vulnerable to variation in non-asthmatic controls. The fact that variation was largest in non-asthmatic controls lets us conclude that this does not negatively affect diagnostic performance in disease. In summary, our findings add well to previous reports in paediatric cohorts and different test gases and may further facilitate the routine use of MBW. Results obtained with different tracer gases should not be compared directly.

When interpreting our results, several points should be taken into consideration. First, we did not obtain structured information on asthma control, symptoms or medication. Moreover, diagnosis relied on a respiratory specialist evaluation according to Global Initiative for Asthma (GINA) criteria1. Although this clinical information would have been valuable, we do not consider this a major drawback of our study as we primarily set out for feasibility evaluation. The same holds for biomarkers such as laboratory parameters or FeNO. The latter did not show differences between asthmatic children and controls whereas LCI was significantly elevated25. Second, our cross-sectional design does not allow to provide estimates of the minimal clinical important difference for SF6-MBW. Due to the high intra-test repeatability, it should be possible to detect small longitudinal changes while further research is required. Nevertheless, our findings contribute to closing another knowledge gap in adult MBW testing. Personalized medicine cannot be reduced to a single biomarker or functional test. Assessment of ventilation heterogeneity using MBW may be one brick in the wall for improving phenotyping of asthmatic patients that warrants further investigation. Third, reference values were derived from the revised 1993 version of European Community for Steel and Coal (ECSC) equations38. This was mainly as the current Global Lung Function Initiative (GLI) reference values are restricted to spirometry. Requiring separate reference calculations for whole-body plethysmography, we aimed to avoid any mixing of the two statistically differing concepts.

Conclusions

SF6-MBW measurements are feasible in adult patients with bronchial asthma. Indices of global (LCI) and local ventilation heterogeneity (Sacin, Scond) both are elevated as compared to non-asthmatic controls. Differences persist in asthmatic patients without obstruction in spirometry. MBW parameters therefore may contain additional information that can be used to identify asthmatic patients with abnormal ventilation missed by current tests. For LCI as the most important MBW outcome measure, using an earlier stopping point does not affect diagnostic performance while considerably reducing the duration of the wash-out procedure. Reproducibility was high for both LCI stopping points. Sacin and Scond both showed larger variability.