Machine learning for the development of diagnostic models of decompensated heart failure or exacerbation of chronic obstructive pulmonary disease.

doi:10.21203/rs.3.rs-2782146/v1

Download PDF

Article

Machine learning for the development of diagnostic models of decompensated heart failure or exacerbation of chronic obstructive pulmonary disease.

https://doi.org/10.21203/rs.3.rs-2782146/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 05 Aug, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

Heart failure (HF) and chronic obstructive pulmonary disease (COPD) are two chronic diseases with the greatest adverse impact on the general population, and early detection of their decompensation is an important objective. However, very few diagnostic models have achieved adequate diagnostic performance. The aim of this trial was to develop diagnostic models of decompensated heart failure or COPD exacerbation with machine learning techniques based on physiological parameters.

A total of 135 patients hospitalized for decompensated heart failure and/or COPD exacerbation were recruited. Each patient underwent three evaluations: one in the decompensated phase (during hospital admission) and two more consecutively in the compensated phase (at home, 30 days after discharge). In each evaluation, heart rate (HR) and oxygen saturation (Ox) were recorded continuously (through a pulse oximeter) during a period of walking for 6 minutes, followed by a recovery period of 4 minutes.

To develop the diagnostic models, predictive characteristics related to HR and Ox were initially selected through classification algorithms. Potential predictors included age, sex and baseline disease (heart failure or COPD). Next, diagnostic classification models (compensated vs. decompensated phase) were developed through different machine learning techniques. The diagnostic performance of the developed models was evaluated according to sensitivity (S), specificity (E) and the accuracy (A).

Data from 22 patients with decompensated heart failure, 25 with COPD exacerbation and 13 with both decompensated pathologies were included in the analyses. Of the 99 characteristics of HR and Ox initially evaluated, 19 were selected. Age, sex and baseline disease did not provide greater discriminative power to the models. The techniques with S and E values above 80% were logistic regression (S: 80.83%; E: 86.25%; A: 83.61%) and the support vector machine (S: 81.67%; E: 85%; A: 82.78%).

The diagnostic models developed achieved good diagnostic performance for decompensated HF or COPD exacerbation. To our knowledge, this study is the first to report diagnostic models of decompensation potentially applicable to both COPD and HF patients. However, these results are preliminary and it warrants further investigation to be confirmed.

Health sciences/Cardiology

Health sciences/Diseases

Health sciences/Medical research

Heart failure (HF) and chronic obstructive pulmonary disease (COPD) are two chronic diseases with the greatest adverse impact on the general population^1–3. Decompensations (in HF) or exacerbations (in COPD) are especially important since they affect autonomy and quality of life and increase mortality and the need for hospital admissions or visits to emergency services^4–7. Therefore, developing methods that allow early detection of these decompensations is important since such detection allows faster recovery or avoids the need for a major intervention such as hospital admission^8,9.

The usual approach of the methods developed to date to detect early decompensation of both diseases is based on ambulatory monitoring of clinical parameters such as heart rate (HR), or oxygen saturation (Ox), using predictive models or diagnostic algorithms applied continuously or intermittently^10,11. Although various algorithms have been developed to date, very few overcome the sensitivity (S) and specificity (E) threshold of 80%^12–15. In addition, some of those that overcome this barrier are based on monitoring from invasive devices implanted in patients (such as pacemakers or defibrillators)¹² or the introduction of specific devices in their homes such as indoor air quality analysers¹⁴, all of which restrict their widespread use.

Our group previously developed and reported diagnostic algorithms for the detection of COPD exacerbation (S: 90%, E: 89%) and decompensated HF (S: 85%, E: 75%) based on noninvasive monitoring of physiological parameters of patients in compensated and decompensated phases of their diseases¹⁶. The “expert rules” algorithms were developed based on an analysis of the mean physiological parameters evaluated and a strategy including parallel and serial tests¹⁷. Despite the good diagnostic performance observed, these algorithms suffer from some important limitations, such as inefficient exploitation of the data (although the data were collected continuously second by second, the analysis was based on reducing these data to their means) and the absence of validation. These difficulties limit the acceptance and application of these algorithms in routine clinical practice.

To overcome these limitations, we believe that machine learning (ML) techniques can be useful. This approach allows more efficient and individualized use of the vast amount of data produced from continuous monitoring of physiological parameters¹⁸, especially with respect to early detection of risky clinical situations^18,19. An individualized approach that allows this technique has been proposed as a tool to improve the limitation of false positives, which frequently occur when fixed and identical limits or thresholds are used for all patients^19,20. Likewise, the use of these techniques is increasingly frequent in regard to implementing algorithms for monitoring physiological parameters in “real conditions”²¹, which may avoid the need for controlled situations or specific protocols for the application of the developed algorithms. Finally, unlike the usual statistical techniques where inference is usually the most important factor (that is, investigation of the relationships between variables or understanding a phenomenon rather than its identification or detection), ML techniques have a primary purpose of prediction or identification of a situation or event (for example, to identify if a patient is in the decompensated phase of a chronic disease)²².

In this study, we report the diagnostic performance of diagnostic algorithms based on physiological parameters (HR and Ox) and developed with ML techniques to classify patients’ disease phases (compensated or decompensated). The recommended guidelines for reporting this type of study have been considered^23–25.

Design

This is a prospective multicentre observational study. Unlike studies of prognostic models, in the present study, diagnostic models were developed, that is, models designed to determine whether a patient was in the compensated or decompensated phase of their disease (exacerbation of COPD and/or HF decompensation).

Sample

The criteria for admission to the study and the recruitment process have been previously reported¹⁶. Patients older than 55 years, able to walk at least 30 m, with a main diagnosis of decompensated HF and/or exacerbation of COPD, and hospitalized at the Department of Internal Medicine, Cardiology or Pneumology, were included. Participants with a pacemaker or intracardiac device, domiciliary oxygen therapy users prior to admission and patients with HF functional class IV of the New York Heart Association (NYHA) were excluded²⁶.

Four hospitals participated: two tertiary university hospitals (600–900 hospital beds) and two regional secondary care hospitals (150–400 hospital beds) from the provinces of Barcelona and Madrid.

Each centre had a trained interviewer, and each department had a reference physician who was accessible to the interviewer. Each day, the interviewer contacted the referring physicians to review the hospitalization census and identify patients with the diagnosis of interest. Next, the interviewer confirmed the main diagnosis (decompensated HF and/or exacerbation of COPD) with the physician responsible for the patient and then contacted the participant (the same day or the next day) to obtain informed consent and verify compliance with all admission criteria of the study. The sample was obtained through convenience sampling, and all patients were enrolled consecutively as they were identified.

The recruitment and follow-up periods lasted 18 months from November 2010.

Evaluation of the participants

Each patient received three identical evaluations: the first in the hospitalization unit (V1) and the other two consecutively and at least 24 hours apart in the participant's home at 30 days after hospital discharge (V2 and V3). Thus, each participant received one evaluation in the decompensated phase (V1) and two in the compensated phase (V2, V3) of their disease.

The evaluation protocol¹⁶ included documentation of symptoms (dyspnoea according to the NYHA²⁶ and Modified Medical Research Council (mMRC)²⁷ scales) and physiological parameters (HR and Ox) in two consecutive periods: effort (walking at a normal pace and on flat terrain for a maximum of 6 minutes) and recovery (seated for 4 minutes after the end of the effort period).

HR and Ox were considered as time series with a sample frequency of 1 Hz, and were collected throughout the evaluation through a pulse oximeter (Model 3100, brand Nonin Medical, Inc., Plymouth, MN, USA) placed on the left index finger.

Reference standard diagnostic test

Given the absence of a single standard diagnostic test to verify whether a patient was in the compensated or decompensated phase of their disease, the clinical judgment of the participant’s responsible physician was considered a standard diagnostic test. Thus, in the decompensated phase, the diagnosis of decompensated HF and/or COPD exacerbation corresponded to the confirmed diagnosis from the participant’s attending physician (in cases of diagnostic doubt, the patient was excluded). For the compensated phase, the standard diagnosis of compensated HF and/or stable COPD was confirmed by a study physician through telephone contact with the participant 30 days after hospital discharge. During this telephone interaction, the patient was considered to be in the compensated phase if none of the following events had occurred since hospital discharge: increased cough, sputum or dyspnoea; initiation of or an increase in corticosteroid use; initiation of antibiotic treatment and medical consultation for worsening of the clinical situation from any cause. In cases of doubt or if the compensated phase could not be confirmed, successive telephone contacts were made until the phase could be confirmed. The interviewer scheduled home visits for the respective evaluations (V2, V3) only after confirmation and within 24–48 hours of receiving confirmation.

Index test: diagnostic algorithms

Initial preparation of potentially predictive variables or characteristics

Given the objective of the study (development of an “online” algorithm capable of detecting the proximity of an exacerbation from HR and Ox data), various characteristics of each of the evaluations were extracted (V1, V2, V3). For this purpose, the effort phase (walking) and recovery phase of each evaluation were separated by verifying the times recorded manually in the data collection records at the beginning and end of each phase of the test and visually reviewing the signals to confirm the manual records. Once the signals were separated according to the evaluation phase, the corresponding characteristics of the available measures were extracted.

Numerous characteristics were extracted from the signals. During each of the tests, two different phases were considered: effort and recovery, which were treated separately. From each of the phases, three signals were considered: HR, Ox and the normalized difference between them. From each of these three temporal signals, the characteristics of the temporal (the mean and standard deviation) and frequencial domains (the characteristics of the first and second harmonics, the sum of all harmonics and the six first indexes of the principal component analysis [PCA] for the normalized fast Fourier transform [FFT] of the signal) were extracted. Accordingly, 13 characteristics of each phase were obtained (26 characteristics total for each evaluation).

Labelling and definition of the events to be detected

Given that the main objective of the study was detection of a transition from a state considered normal or stable (HF or COPD in the compensated phase [V2, V3]) to a state of decompensation or exacerbation (decompensated phase [V1]), a methodological scheme was applied based on calculation of the differences between the evaluations of each available characteristic. Thus, if a patient had three evaluations (V1, V2 and V3), six differences or useful comparative signals were obtained from these evaluations (V1-V2, V1-V3, V2-V1, V2-V3, V3-V1, V3-V2). The label of each of these comparative signals is illustrated in Fig. 1.

Although the differences V1-V2 and V1-V3 might be more appropriately considered as “decompensation recovery” rather than “no decompensation”, we decided to discard a third label category (“decompensation recovery”) due to the small sample size and because the main objective of the trial was the detection of a decompensation.

Selection of predictor variables or characteristics

In a first approximation, potential predictive characteristics were selected using the random forest²⁸, gradient boosting classifier²⁸ and light gradient-boosting machine (LGBM)²⁹ classification algorithms, which integrate the functions of characteristic selection by importance within the decision.

Figure 2 shows an outline of the process for preparation and selection of the characteristics of the signals.

During the process of selecting characteristics, all those that were redundant or had very low variabilities were discarded. In this study, by definition, we did not have variables with perfect separation that could cause overestimation of the diagnostic capacity of the models (overfitting)²³.

In addition to the characteristics selected from the HR and Ox signals, the age, sex and baseline disease (HF or COPD) of the patients were considered potential predictors.

Development and validation of algorithms

For the development of the algorithms, the ML techniques most used in the studies of classification models were considered: (i) decision trees, (ii) random forest, (iii) k-nearest neighbour (KNN), (iv) support vector machine (SVM), (v) logistic regression, (vi) naive Bayes classifier, (vii) gradient-boosting classifier and (viii) LGBM.

For each of these techniques, hyperparameters were selected based on a brute force scheme using all available data through a cross validation scheme (K-fold cross validation, k = 5). A normalization process based on the median and interquartile ranges (IQRs) was applied to all characteristics²⁸.

Once the best parameters of each technique were identified, internal validation was performed with a leave-one-patient-out method. Thus, a new model was calculated for each patient by replacing the model’s data from the training and validation sets with the patient’s data. Figure 3 shows an outline of the training and validation process.

The observation units (inputs) on which the algorithms were applied were the differences between two different evaluations, as illustrated in Fig. 1. Thus, the algorithms classified the evaluated difference as a state of “no decompensation” (label = 0) or “a change to decompensation” (label = 1). Therefore, the following parameters were defined:

True positive (TP): “a change to decompensation” as classification result for V3-V1 or V2-V1 comparison.
True negative (TN): “no decompensation” as classification result for V1-V2, V1-V3, V2-V3 or V3-V2 comparison.
False positive (FP): “change to decompensation” as classification result for V1-V2, V1-V3, V2-V3 or V3-V2 comparison.
False negative (FN): “no decompensation” as classification result for V3-V1 or V2-V1 comparison.

The parameters used to evaluate the diagnostic performance of the algorithms were the S, E and A. Each patient could have up to six observation units or inputs; therefore, up to six classification results were obtained, which were then defined as TP, TN, FP or FN. Then, the S, E and A were obtained for each patient. The final S, E and A of the entire sample were calculated from the mean of parameters obtained from each patient.

The predictive values were not considered because the proportions of evaluations in the decompensated phase (33% [V1]) and compensated phase (66% [V2, V3]) did not correspond to the usual proportion found in clinical practice (the vast majority of patients in the community are usually in the compensated phase).

Missing data, excluded data and indeterminate results

Missing data were not included in the analysis but patients with lost data were not excluded (all available patient data was included in the analysis). No imputation of the lost data was performed.

During the process of signals review and verification of the start and end times of each evaluation from the manual records, lost sections of HR and/or Ox data due to poor contact between the skin and the sensor was observed. This incidence caused introduction of some filters to be applied to exclude these lost sections from the analysis. Thus, an evaluation was excluded if it had a loss rate (measures lost divided by the total number of measures) greater than 10% in any phase. In addition, evaluations performed at home (V2, V3) that did not reveal an improvement in the sensation of dyspnoea for the patient (of at least one point according to the mMRC scale²⁷) with respect to the decompensated phase evaluation (V1) were also excluded to ensure that home assessments were performed in the “compensated phase”.

No indeterminate results were noted in the index test (algorithms); in all cases, the model produced a “no decompensation” or “a change to decompensation” result. On the other hand, all evaluations were always performed after a definitive result of the standard diagnostic reference test: clinical diagnosis of the decompensated phase by the doctor responsible for the patient in the hospital evaluation (V1) and clinical diagnosis of the compensated phase by the doctor who contacted them by phone before home evaluations (V2, V3). Thus, the algorithms were developed and applied on evaluations clearly labelled as the compensated or decompensated phase by the reference diagnostic test.

Approval of the Ethics Committee

The study was developed according to the Declaration of Helsinki and approved by the Ethics and Research Committee (ERC) of the centre promoting the study (ERC of the Mataró Hospital, approval number 1851806). Informed consent was obtained from all participants and/or their legal guardians.

Participants and evaluations

A total of 135 patients were recruited. After excluding evaluations according to the criteria described above (patients without both V2 and V3 evaluations (home evaluations), signal loss greater than 10% in assessment V1, or V2 and V3; and no improvement of at least one point for in dyspnoea in the compensated phase), 60 patients were available for inclusion in the analyses. Figure 4 shows the flow of study participants.

Of the 60 patients included, all performed the hospital evaluation (V1), but not all performed both home evaluations (V2, V3). Therefore, not all patients included had the six observation units derived from the three planned evaluations (V1, V2, V3). In total, 93 observation units of the “change to decompensation” type (label = 1) and 159 of the “no decompensation” type (label = 0) were obtained.

No relevant medical events occurred during the evaluations.

The baseline characteristics of the participants finally selected for model development models according to the underlying pathology and the severity of the clinical picture on admission (dyspnoea according to the NYHA scale²⁶) are shown in Table 1.

Table 1. Baseline characteristics of the patients analysed. SD: standard deviation; IQR: interquartile range; HF: heart failure; COPD: chronic obstructive pulmonary disease.

	Decompensated HF	Exacerbated COPD	Decompensation of both pathologies
Total N	22	25	13
Age, years (SD)	78 (8)	72 (8)	75 (11)
Male sex (n,%)	8 (36)	19 (76)	10 (77)
Body mass index (SD)	27 (5)	26 (4)	28 (9)
Type 2 diabetes mellitus (n,%)	9 (41)	6 (24)	5 (38)
Dyslipidaemia (n,%)	7 (32)	11 (44)	5 (38)
Active smoking (n,%)	0	7 (28)	1 (8)
Osteoarthritis (n,%)	16 (73)	13 (52)	6 (46)
Mean hospital stay in days (SD)	7.6 (4.3)	7.3 (3)	27.4 (43.5)
Previous admissions for HF/COPD, IQR [25,75]	1.0 IQR [0.0, 2.0]	1.0 IQR [1.0, 2.0]	1.0 IQR [1.0, 3.0]
Number of days prior to discharge at the V1 assessment (SD)	4.9 (16.7)	7.3 (15.4)	4.6 (7)
Dyspnoea according to the NYHA scale, IQR [25,75]	2.0 IQR [2.0, 2.75]	2.0 IQR [1.0, 3.0]	2.0 IQR [1.0, 2.0]
Dyspnoea according to the mMRC scale, IQR [25,75]	3.0 IQR [3.0, 4.0]	3.0 IQR [2.0, 3.0]	4.0 IQR [1.0, 4.0]

Selected characteristics or predictor variables

In terms of the selection of predictor variables, Table 2 shows the selected characteristics and their descriptions. Of 99 characteristics, 19 were ultimately selected. Using the 3 previously mentioned classification algorithms we found that the 3 most important predictive characteristics were the following: “meanHRminusOx Recovery”, “meanOxRecovery”, “meanHRminusOxWalk” (random forest); “PC2-Ox-Recovery”, “PC6-HRminusOx-Walk”, “stdOxWalk” (gradient boosting classifier); and “meanOxRecovery”, “meanHRminusOx Recovery”, “stdOxWalk” (LGBM).

None of the other predictors evaluated (age, sex and baseline disease) provided greater discriminative power to the models.

Table 2. Selected predictor characteristics or variables. HR: heart rate; Ox: oxygen saturation; PC: principal component; FFT: fast Fourier transform.

Nomenclature	Type	Signal	Phase	Scope
meanHRWalk	Mean	HR	Walk	Temporal
meanHRRecovery		HR	Recovery
meanOxWalk		Ox	Walk
meanOxRecovery		Ox	Recovery
meanHRminusOxWalk		HR-Ox	Walk
meanHRminusOx Recovery			Recovery
stdHRminusOxWalk	Standard deviation		Walk
stdOxRecovery		Ox	Recovery
stdOxWalk		Ox	Walk
frecFirstArmHRRecovery	Frequency of the largest harmonic	HR	Recovery	Frequency
frecSecArmHRminusOxWalk	Frequency of the second largest harmonic	HR-Ox	Walk
frecSecArmHRminusOxRecovery		HR-Ox	Recovery
frecSecArmHRWalk		HR	Walk
seconArmOxWalk	Amplitude of the second largest harmonic	Ox	Walk
sumAllArmHRminusOxRecovery	Sum of all harmonics	HR-Ox	Recovery
sumAllArmOxWalk	Sum of all harmonics	Ox	Walk
PC6-HRminusOx-Walk	Sixth principal component of the FFT	HR-Ox	Walk
PC2-Ox-Recovery	Second principal component ofd the FFT	Ox	Recovery
PC4-Ox-Recovery	Fourth principal component of the FFT	Ox

Diagnostic algorithms

The diagnostic performance of the algorithms developed according to the technique used is shown in Table 3. The techniques with S and E values above 80% were logistic regression and SVM.

Table 3. Diagnostic capacity of the algorithms developed according to the technique used.

Machine Learning Technique	True Positive	False Negative	True Negative	False Positive	Sensitivity *	Specificity *	Accuracy *
Random forest	75	18	138	21	78.3	88.8	83.6
Logistic Regression	74	19	129	30	80.8	86.3	83.6
Decision Tree	72	21	137	22	78.3	85.8	83.1
Naive Bayes	73	20	142	17	75	90.4	83.1
SVM	77	16	129	30	81.7	85	82.3
LGBM	70	23	132	27	73.3	87.5	80.6
Gradient-Boosting Classifier	64	29	137	22	69.2	88.3	80.3
KNN	52	41	133	26	53.3	84.2	70.8

(*): These parameters were obtained from the mean of all patients (since not all have the same number of evaluations, the mean does not necessarily correspond to that obtained from the total true positive, false negative, true negative and false positive data available in the entire sample).

Main results

The present study reports diagnostic models that have achieved a good detection capacity for exacerbation of COPD or HF decompensation (S and E greater than 80%). Although the data for S and E were slightly lower than those of two other studies (Vamos et al.¹² for HF and Wu et al.¹⁴ for COPD), we highlight that the models in our study, unlike these previous models, do not require complex devices such as intradomiciliary sensors or cardiac defibrillators for their implementation in clinical practice. A study that is potentially more comparable to ours in terms of the technology used and the method developed for the algorithms is that of Stehlik et al.¹³. The study reported HF decompensation detection models developed through ML from the monitoring of physiological parameters of 100 patients collected through a cutaneous patch at the thoracic level. The models developed obtained an S of 76 to 88% and an E of 85%, values similar to those of the models in our study. Recently, Morrill et al.³⁰ reported diagnostic models of decompensated HF developed with ML techniques with an S of 100% and E of 73% but based on simulated clinical situations and not real patients.

Another important result was that the underlying disease (COPD or HF) did not influence the development or diagnostic performance of the models; thus, to our knowledge, this is the first study reporting diagnostic models of decompensation potentially applicable to patients affected by COPD and by HF, which may be relevant given the increasing proportion of patients affected by both pathologies. However, our study can only be considered preliminary about this point because the trial size was modest and the design was not robust enough to confirm that this result is generalisable. Therefore, this result warrants further investigation. As hypothesis, we propose the coexistence of pathophysiological mechanisms in the decompensation of both diseases, with HR, Ox and their relationship serving as parameters that could represent a relevant common denominator for decompensation of both pathologies. Ox has already shown considerable utility in the detection of acute HF in previous studies³¹ and has been considered the physiological parameter with the greatest discriminative power in COPD^10,32. In addition, the cutoff point of Ox for the detection of acute HF does not seem to be modified in patients who also have COPD³¹. This study also proposes HR and its relationship with Ox as parameters of interest in the pathophysiological mechanisms related to decompensation of both diseases because although most of the characteristics chosen for the development of the models (eight of 19) were only related to Ox, four were exclusively related to HR, and the rest (seven of 19) were related to the combination of the two parameters (HR-Ox). In any case, further research should explore this hypothesis.

Validity

With the methodological approach considered, we believe that none of the selected characteristics or the other potentially predictive variables evaluated were associated with a possible phenomenon of “information leakage” from the outcome variable (compensated or decompensated phase) to the predictor variables (“outcome leakage”)²³. However, we must recognize a possible “validation leakage”²³ because we could not use a completely independent sample for validation of the diagnostic models developed (the sample size we had led us to prioritize the development of the models with the maximum available sample), and we must recognize the possibility of some overestimation in the diagnostic performance obtained.

We began the study with a cohort of patients in the decompensated phase. This design allowed us to have sufficient observations for both categories of the outcome variable and to develop and evaluate the models obtained (if we had started with a cohort of stable patients, only a small proportion would have presented with decompensation). In addition, the design allowed each patient act as their own control. Although choosing hospitalization as reference for decompensated phase was not ideal because the ultimate goal of these algorithms is to detect clinical decompensation in an earlier phase, the evaluation during hospitalization was performed once the patients were clinically stable, when they were able to walk at least 30 m, so the hospital evaluation(V1) was actually carried out once the most acute phase of decompensation had passed.

The time interval between the standard diagnostic method (confirmation of the compensated or decompensated phase by a doctor) and data collection for the development of the diagnostic models was quite short (24–48 hours); therefore, we do not believe that considerable changes in the clinical states of the patients occurred between these events to influence the results for the diagnostic performance of the models developed.

In terms of the extrapolation of our results to other populations, the models are designed to detect the most severe exacerbations of the disease (those that motivate hospital admission) and not milder exacerbations (such as those requiring only outpatient management). The inclusion of centres with different levels of complexity in two different geographical areas allowed us to include a sample of patients representing a large part of the clinical spectrum of both pathologies.

Clinical implications

Pending external validation and demonstration of their efficacy in routine clinical practice, the models developed in this study are designed for implementation in minimally invasive or nondisruptive devices for routine, continuous out-of-hospital monitoring of certain patients. Although the data in this study were collected from a pulse oximeter, various commonly used devices (for example, smartwatches) are capable of continuously monitoring the physiological parameters included in the diagnostic models developed.

Limitations

In addition to the limitations mentioned in previous paragraphs, we must recognize the high proportion of valuations lost or excluded from the analysis, which may have adversely influenced the diagnostic performance achieved by the models developed. Thus, we must accept the possibility of a non-negligible selection bias in the final sample available for analysis. We also emphasize that the conditions in which the assessments were performed were controlled (a specific protocol of walking and recovery was followed), and evaluations in more “real” conditions are still pending. Finally, although the high proportion of evaluations in the decompensated phase allowed us to enhance model development, this proportion is considerably higher than that in the real world (in usual conditions, most patients are in the compensated phase of their disease); therefore, a compensated/decompensated phases proportion closer to real should be considered in future studies to avoid a high false-positive rate precluding implementation in clinical practice.

The diagnostic models developed achieved good diagnostic performance for decompensated HF or COPD exacerbation.

To our knowledge, this study is the first to report diagnostic models of decompensation potentially applicable to both COPD and HF patients. However, these results are preliminary and it warrants further investigation to be confirmed.

Declarations of interest: none.

FUNDING

This research was partially funded by the European Commission (enhanced Complete Ambient Assisted Living Experiment (eCaalyx) project); grant number AAL-2008-1-032).

The funders had no role in the study design, data collection and analysis, decision to publish or preparation of the manuscript.

COMPETING INTEREST STATEMENT

The author(s) declare no competing interests.

DATA AVAILABILITY

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

AUTHOR CONTRIBUTIONS STATEMENT

All authors made substantial contributions to the conception and design of the work; and acquisition, analysis, and interpretation of data. CGB, CPL and ARM wrote the main manuscript text. FVA, FF, JR, RB, DC, CI participated in the recruitment process. All authors have substantially reviewed and approved the manuscript.

Boult, C., Altmann, M., Gilbertson, D., Yu, C. & Kane, R. L. Decreasing disability in the 21st century: the future effects of controlling six fatal and nonfatal conditions. Am J Public Health 86, 1388–1393 (1996).
Mannino, D. M. COPD: epidemiology, prevalence, morbidity and mortality, and disease heterogeneity. Chest 121, 121S-126S (2002).
Muñiz García, J., Crespo Leiro, M. G. & Castro Beiras, A. [Epidemiology of heart failure in Spain and the importance of adhering to clinical practice guidelines]. Rev Esp Cardiol 6 Suppl F, 2–8 (2006).
Anzueto, A., Sethi, S. & Martinez, F. J. Exacerbations of chronic obstructive pulmonary disease. Proc Am Thorac Soc 4, 554–564 (2007).
Giamouzis, G. et al. Hospitalization epidemic in patients with heart failure: risk factors, risk prediction, knowledge gaps, and future directions. J. Card. Fail. 17, 54–75 (2011).
Liao, L., Allen, L. A. & Whellan, D. J. Economic burden of heart failure in the elderly. Pharmacoeconomics 26, 447–462 (2008).
Zhang, Y. et al. A systematic review of how patients value COPD outcomes. Eur Respir J 52, 1800222 (2018).
Cotter, G. et al. Acute heart failure: a novel approach to its pathogenesis and treatment. Eur. J. Heart Fail. 4, 227–234 (2002).
Wilkinson, T. M. A., Donaldson, G. C., Hurst, J. R., Seemungal, T. A. R. & Wedzicha, J. A. Early therapy improves outcomes of exacerbations of chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 169, 1298–1303 (2004).
Al Rajeh, A. M. & Hurst, J. R. Monitoring of Physiological Parameters to Predict Exacerbations of Chronic Obstructive Pulmonary Disease (COPD): A Systematic Review. J Clin Med 5, (2016).
Brons, M., Koudstaal, S. & Asselbergs, F. W. Algorithms used in telemonitoring programmes for patients with chronic heart failure: A systematic review. European Journal of Cardiovascular Nursing 17, 580–588 (2018).
Vamos, M. et al. Refined heart failure detection algorithm for improved clinical reliability of OptiVol alerts in CRT-D recipients. Cardiol J 25, 236–244 (2018).
Stehlik, J. et al. Continuous Wearable Monitoring Analytics Predict Heart Failure Hospitalization: The LINK-HF Multicenter Study. Circ Heart Fail 13, e006513 (2020).
Wu, C.-T. et al. Acute Exacerbation of a Chronic Obstructive Pulmonary Disease Prediction System Using Wearable Device Data, Machine Learning, and Deep Learning: Development and Cohort Study. JMIR Mhealth Uhealth 9, e22591 (2021).
Singhal, A. & Cowie, M. R. The Role of Wearables in Heart Failure. Curr Heart Fail Rep 17, 125–132 (2020).
Gálvez-Barrón, C. et al. Effort Oxygen Saturation and Effort Heart Rate to Detect Exacerbations of Chronic Obstructive Pulmonary Disease or Congestive Heart Failure. J Clin Med 8, (2019).
Cebul, R. D., Hershey, J. C. & Williams, S. V. Using multiple tests: series and parallel approaches. Clin Lab Med 2, 871–890 (1982).
Krittanawong, C. et al. Integration of novel monitoring devices with machine learning technology for scalable cardiovascular management. Nat Rev Cardiol 18, 75–91 (2021).
Rush, B., Celi, L. A. & Stone, D. J. Applying machine learning to continuously monitored physiological data. J Clin Monit Comput 33, 887–893 (2019).
Drew, B. J. et al. Insights into the problem of alarm fatigue with physiologic monitor devices: a comprehensive observational study of consecutive intensive care unit patients. PLoS One 9, e110274 (2014).
Paganelli, A. I. et al. Real-time data analysis in health monitoring systems: A comprehensive systematic literature review. J Biomed Inform 127, 104009 (2022).
Sidey-Gibbons, J. A. M. & Sidey-Gibbons, C. J. Machine learning in medicine: a practical introduction. BMC Med Res Methodol 19, 64 (2019).
Luo, W. et al. Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View. Journal of Medical Internet Research 18, e5870 (2016).
Bossuyt, P. M. et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ 351, h5527 (2015).
Liu, Y., Chen, P.-H. C., Krause, J. & Peng, L. How to Read Articles That Use Machine Learning: Users’ Guides to the Medical Literature. JAMA 322, 1806–1816 (2019).
Dolgin, M. Nomenclature and criteria for diagnosis of diseases of the heart and great vessels. (Little, Brown, 1994).
Fletcher, C. Standardised questionnaire on respiratory symptoms: a statement prepared and approved by the MRC Committee on the Aetiology of Chronic Bronchitis (MRC breathlessness score). BMJ 2, 1665 (1960).
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12, 2825–2830 (2011).
Ke, G. et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. in Advances in Neural Information Processing Systems vol. 30 (Curran Associates, Inc., 2017).
Morrill, J. et al. A Machine Learning Methodology for Identification and Triage of Heart Failure Exacerbations. J Cardiovasc Transl Res 15, 103–115 (2022).
Masip, J. et al. Pulse oximetry in the diagnosis of acute heart failure. Rev Esp Cardiol (Engl Ed) 65, 879–884 (2012).
Al Rajeh, A. et al. Application of oxygen saturation variability analysis for the detection of exacerbation in individuals with COPD: A proof-of-concept study. Physiol Rep 9, e15132 (2021).

No competing interests reported.

Download PDF

Journal Publication

published 05 Aug, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
21 May, 2023
Reviews received at journal
19 May, 2023
Reviews received at journal
05 May, 2023
Reviews received at journal
01 May, 2023
Reviewers agreed at journal
25 Apr, 2023
Reviewers agreed at journal
25 Apr, 2023
Reviewers invited by journal
18 Apr, 2023
Editor assigned by journal
18 Apr, 2023
Editor invited by journal
11 Apr, 2023
Submission checks completed at journal
11 Apr, 2023
First submitted to journal
05 Apr, 2023

You are reading this latest preprint version

Machine learning for the development of diagnostic models of decompensated heart failure or exacerbation of chronic obstructive pulmonary disease.

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials And Methods

Design

Sample

Evaluation of the participants

Reference standard diagnostic test

Index test: diagnostic algorithms

Initial preparation of potentially predictive variables or characteristics

Labelling and definition of the events to be detected

Selection of predictor variables or characteristics

Development and validation of algorithms

Missing data, excluded data and indeterminate results

Approval of the Ethics Committee

Results

Discussion

Main results

Validity

Clinical implications

Limitations

Conclusions

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 1