Neural network analysis of clinical variables predicts escalated care in COVID-19 patients: a retrospective study

Joyce Q. Lu; Benjamin Musheyev; Qi Peng; Tim Q. Duong

doi:10.7717/peerj.11205

Neural network analysis of clinical variables predicts escalated care in COVID-19 patients: a retrospective study

Joyce Q. Lu, Benjamin Musheyev, Qi Peng, Tim Q. Duong

Department of Radiology, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY, USA

DOI: 10.7717/peerj.11205

Published: 2021-04-19
Accepted: 2021-03-11
Received: 2021-01-12

Academic Editor: Antonio Palazón-Bru

Subject Areas: Bioinformatics, Emergency and Critical Care, Epidemiology, Hematology, Infectious Diseases
Keywords: Machine learning, Coronavirus, Pneumonia, SARS-CoV-2, Predictive model

Copyright: © 2021 Lu et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Lu JQ, Musheyev B, Peng Q, Duong TQ. 2021. Neural network analysis of clinical variables predicts escalated care in COVID-19 patients: a retrospective study. PeerJ 9:e11205 https://doi.org/10.7717/peerj.11205

The authors have chosen to make the review history of this article public.

Abstract

This study sought to identify the most important clinical variables that can be used to determine which COVID-19 patients hospitalized in the general floor will need escalated care early on using neural networks (NNs). Analysis was performed on hospitalized COVID-19 patients between 7 February 2020 and 4 May 2020 in Stony Brook Hospital. Demographics, comorbidities, laboratory tests, vital signs and blood gases were collected. We compared those data obtained at the time in emergency department and the time of intensive care unit (ICU) upgrade of: (i) COVID-19 patients admitted to the general floor (N = 1203) vs. those directly admitted to ICU (N = 104), and (ii) patients not upgraded to ICU (N = 979) vs. those upgraded to the ICU (N = 224) from the general floor. A NN algorithm was used to predict ICU admission, with 80% training and 20% testing. Prediction performance used area under the curve (AUC) of the receiver operating characteristic analysis (ROC). We found that C-reactive protein, lactate dehydrogenase, creatinine, white-blood cell count, D-dimer and lymphocyte count showed temporal divergence between COVID-19 patients hospitalized in the general floor that were upgraded to ICU compared to those that were not. The NN predictive model essentially ranked the same laboratory variables to be important predictors of needing ICU care. The AUC for predicting ICU admission was 0.782 ± 0.013 for the test dataset. Adding vital sign and blood-gas data improved AUC (0.822 ± 0.018). This work could help frontline physicians to anticipate downstream ICU need to more effectively allocate healthcare resources.

Introduction

Since it was first reported in Wuhan, China in December 2019 (Huang et al., 2020; Li et al., 2020b; Zhu et al., 2020b), the coronavirus disease 2019 (COVID-19) has infected over 27 million people and killed more than 880,000 people worldwide (6 September 2020) (Johns Hopkin University Coronavirus Resource Center, 2021). There are recent spikes in COVID-19 cases and there will likely be second waves in many countries (Leung et al., 2020). To date, it is challenging for emergency room physicians to objectively and reliably determine which patients need escalated care (i.e., intensive care unit (ICU) admission) or anticipate ICU needs downstream for effective allocation of healthcare resources in part because much is still unknown about this disease.

Many studies have reported the use of patient demographics, clinical presentations, comorbidities, vital sign data and laboratory blood tests to predict in-hospital outcomes (see reviews (Brown et al., 2020; Rodriguez-Morales et al., 2020; Wynants et al., 2020)). Some earlier studies found that: (i) age and CRP thresholds are good predictor of mortality (Lu et al., 2020), (ii) age, lymphocyte count, lactate dehydrogenase (LDH) and SpO₂ are independent predictors of mortality (Xie et al., 2020), (iii) comorbidity, older age, lower lymphocyte and higher LDH at presentation to be independent high-risk factors for COVID-19 progression (Ji et al., 2020), (iv) mildly elevated alanine aminotransferase (ALT), myalgias and hemoglobin at presentation to be predictive of severe acute respiratory distress syndrome of COVID-19 with 70% to 80% accuracy (Jiang et al., 2020) and (v) LDH, procalcitonin (procal), SpO₂, smoking history and lymphocyte count were predictive of ICU admission, and heart failure, procal, LDH, chronic obstructive pulmonary disease (COPD), SpO₂, HR and age were predictive of mortality (Zhao et al., 2020). These studies have relatively small sample sizes.

Most of these published studies to predict outcomes associated COVID-19 used logistic regression. Machine learning (ML) is increasingly being used in medicine (Deo, 2015; Hwang, Kesselheim & Vokinger, 2019; Santos et al., 2019). ML uses computer algorithms to learn relationships amongst different data elements to relate to outcomes without the need to specify the exact relationship amongst these data elements to outcome variables. ML is ideally suited for analyzing large number of data elements. Machine learning methods outperform humans in many tasks in medicine (Killock, 2020). With increasing computing power and big data, ML is expected to play an important role in medicine. A neural network (NN), in particular, is an artificial NN with multiple layers between the input and output layers. NNs are based on algorithms inspired from the biological structure and functioning of a brain to aid machines with intelligence, consisting of neurons, synapses, weights, biases. A few studies have used ML to predict in-hospital outcomes associated with COVID-19 (Hou et al., 2021; Li et al., 2020a; Yan et al., 2020; Yuan et al., 2020; Zhu et al., 2020a).

Although some predictors of mortality and critical illnesses were shared amongst these studies, there is currently no consensus as to which clinical variables are most predictive of mortality or the needs for escalated care. Moreover, these prior studies predicted critical illness and mortality using data obtained at admission to the emergency department. COVID-19 patients came into the emergency department at various disease severity. We argue that it is more relevant to study hospitalized COVID-19 patients in the general floor who were subsequently upgraded to ICU to identify the clinical variables that predict escalated care.

Thus, the goal of this study was to identify the clinical variables that can be used to determine which patients hospitalized COVID-19 patients in the general floor will need to be upgraded to ICU early on by comparing between those not upgraded to the ICU from the general floor vs. those subsequently upgraded to the ICU. Clinical variables were obtained at the time of arrival to the emergency department and at the time of ICU upgrade. For references, we also compared between COVID-19 patients admitted to the general floor vs. those immediately admitted to ICU. As a secondary analysis, we employed a simple neural-network algorithm to these data to identify and predict the most important clinical variables that informed the need for escalated care.

Materials and Methods

Study population and data collection

This study is approved by Stony Brook University Institutional Review Board. Our IRB ID number was IRB2020-00207. Our IRB waived the need for informed consent from patients in the study. This retrospective study utilized the COVID-19 Persons Under Investigation registry (N = 6,678) of the Stony Brook Hospital ED from 7 February 2020 to 30 June 2020. There were 2,892 COVID-19 positive patients as determined by real-time polymerase chain reaction for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), of which 1,430 were hospitalized. Patients who were <18 years old, still in the hospital at the time of analysis, and did not have full codes were excluded.

The final sample sizes included 1,203 patients admitted to general floor (“general floor”, Group A) and 104 directly admitted to the ICU from the ED (“direct ICU”, Group B), 979 patients remained on the general floor (“no upgrade”, Group C) and 224 were upgraded from the general floor to the ICU (“upgrade ICU”, Group D) (Fig. 1).

Figure 1: Patient selection flowchart.
The final sample sizes included 1,203 patients admitted to general floor (“general floor”, Group A) and 104 directly admitted to the ICU from the ED (“direct ICU”, Group B), 979 patients remained on the general floor (“no upgrade”, Group C) and 224 were upgraded from the general floor to the ICU (“upgrade ICU”, Group D).

Download full-size image

DOI: 10.7717/peerj.11205/fig-1

Demographic information, chronic comorbidities, laboratory tests, vital signs and blood gases were collected. Demographics included age, gender, ethnicity and race. Chronic comorbidities included smoking, diabetes, hypertension, asthma, COPD, coronary artery disease, heart failure, cancer, immunosuppression and chronic kidney disease. Laboratory tests included C-reactive protein (CRP), D-dimer, ferritin, LDH, white blood cell count (WBC), lymphocytes count (lymph), procal, ALT, aspartate transaminase (AST), brain natriuretic peptide (BNP), creatinine (Cr) and troponin (TNT). Vital signs included heart rate (HR), respiratory rate (RR), pulse oxygen saturation (SpO₂), systolic blood pressure (SBP), diastolic blood pressure (DBP) and temperature (temp). Blood gas variables and others include pH, pO₂, pCO₂, bicarbonate, sodium, hematocrit (HCRIT) and potassium.

These clinical variables were collected for general floor admission (group A) vs. direct ICU (group B) at ED admission. Data were collected for the no-upgrade vs. upgraded group at ED admission to the general floor. Data were also collected one day prior to ICU upgrade (group D) or three days after hospitalization for the no-upgrade group (Group C). The “3rd day” was chosen for comparison because the median day for patients to be upgraded to the ICU from the general floor was 3 days.

Preprocessing and NN prediction model

Bicarbonates, pCO₂, pO₂, pH, HCRIT and TNT were not used in the ML analysis because invasive blood gas samples and TNT were not routinely obtained in our hospital on general floor patients. For the rest of the laboratory variables, missing data (<25%) were imputated using standard methods (Van Buuren & Groothuis-Oudshoorn, 2011).

We employed a simple NN with two fully connected dense layers using Jupyter Notebook, Tensorflow, and Keras (Fig. 2). Only two dense layers were used to avoid overfitting given large number of variables and small sample sizes. Two NN sequential models were built: one using 11 laboratory tests (excluding vitals and blood gases) and the other using laboratory tests, vitals and blood gases (total 18). The inputs consisted of the clinical variables for no-ICU vs. ICU patients: namely those of Group A (floor) at ED admission and Group C (no upgrade) at the corresponding time of upgrade vs. Group B (direct ICU) at ED admission and Group D (upgrade) at the time of upgrade. The output was ICU admission, which was binary. For both, the dataset was randomly split into 80% training data and 20% testing data. The first dense layer consisted of four nodes, and the second dense layer had three nodes, both using the rectified linear activation function (ReLU). A Softmax function for activation in the output layer was used. Training was performed for 50 epochs with a batch size of six. For the model using laboratory tests, a learning rate of 0.001 was used, whereas for the model using laboratory tests, vitals and blood gases, a learning rate of 0.0009 proved optimal. We found that these smaller learning rates resulted in the highest accuracy. The model was compiled using the Adam optimizer, an efficient gradient descent algorithm. The clinical variables were ranked using SHapley Additive exPlanations (SHAP), a Python package that explains the output of ML models based on game theory. SHAP explains the importance of a feature by calculating the contribution of each feature to the prediction. Specifically, it uses the KernelExplainer to build a weighted linear regression based on the model’s predictions and the actual values from the data. It then computes Shapley values from coalitional game theory to determine the variable’s importance. Lastly, we calculated cutoffs of the top 6 predictive clinical variables using a nonparametric kernel method to maximize the summation of sensitivity and specificity (Fluss, Faraggi & Reiser, 2005).

Figure 2: Architecture of both neural networks.
The model consists of one input layer, one output layer with two fully connected hidden layers.

Download full-size image

DOI: 10.7717/peerj.11205/fig-2

Statistical analysis and performance evaluation: Statistical analysis was performed using SPSS v26 (IBM, Armonk, NY, USA) and SAS v9.4 (SAS Institute, Cary, NC, USA). Group comparisons of categorical variables in frequencies and percentages were performed using the Chi-squared test or Fisher exact test. Group comparison of continuous variables in medians and interquartile ranges (IQR) used the Mann–Whitney U test. For all analyses, a p value < 0.05 was considered to be statistically significant.

For performance evaluation of NN, data were split 80% for training and 20% for testing. Prediction performance was evaluated by area under the curve (AUC) of the receiver operating characteristic (ROC) curve for the test data set. The average ROC curve and AUC were obtained with ten runs and standard deviations were obtained. A p value < 0.05 was taken to be statistically significant unless otherwise specified.

Results

Table 1 summarizes the demographics and comorbidities for the general floor (group A, N = 1,203) vs. direct ICU (group B, N = 104). Compared to the general floor group, the direct ICU group had more males (p = 0.005), smokers (p = 0.008), diabetics (p =0.047) and patients with heart failure (p = 0.016). Age, ethnicity, race and prevalence of hypertension, asthmas, COPD, coronary artery disease, cancer immunosuppression and chronic kidney disease were not statistically different between groups (p > 0.05).

Table 1:

Laboratory tests, vital signs and blood gases of patients: (i) admitted to general floor from the emergency department (“floor”) and (ii) admitted to ICU directly from emergency department (“direct ICU”)

	Floor N = 1203	Direct ICU N = 104	p Value
Demographics
Age, median (IQR)	60 (49, 73)	63 (52, 74)	0.529
Sex			0.005
Male	687 (57.1%)	74 (71.2%)
Female	516 (42.9%)	30 (28.%)
Ethnicity			0.175
Hispanic/Latino	333 (27.7%)	20 (19.2%)
Non-Hispanic/Latino	710 (59%)	69 (66.3%)
Unknown	160 (13.3%)	15 (14.4%)
Race			0.784
Caucasian	629 (52.3%)	57 (54.8%)
African American	90 (7.5%)	7 (6.7%)
Asian	42 (3.5%)	5 (4.8%)
American Indian/Alaska Native	3 (0.2%)	1 (1.0%)
Pacific Islander	1 (0.1%)	0
More Than One Race	7 (0.6%)	0
Unknown/Not Reported	431 (35.8%)	34 (32.7%)

Comorbidities
Smoking History			0.008
Current Smoker	49 (4.1%)	8 (7.7%)
Former Smoker	250 (20.8%)	24 (23.1%)
Never Smoker	839 (69.7%)	59 (56.7%)
Unknown	64 (5.3%)	12 (11.5%)
Diabetes	309 (25.7%)	36 (34.6%)	0.047
Hypertension	573 (47.6%)	55 (52.9%)	0.304
Asthma	81 (6.7%)	6 (5.8%)	0.705
COPD	96 (8.0%)	10 (9.6%)	0.558
Coronary artery disease	167 (13.9%)	15 (14.4%)	0.878
Heart failure	84 (7.0%)	14 (13.5%)	0.016
Cancer	108 (9.0%)	11 (10.6%)	0.586
Immunosuppression	91 (7.6%)	9 (8.7%)	0.688
Chronic kidney disease	112 (9.3%)	12 (11.5%)	0.457

DOI: 10.7717/peerj.11205/table-1

Note:

Group comparison of categorical variables in frequencies and percentages used χ² test or Fisher exact tests. Group comparison of continuous variables in medians and interquartile ranges (IQR) used the Mann–Whitney U test. Abbreviation: COPD, chronic obstructive pulmonary disease. IQR, interquartile range. SpO₂, O₂, oxygen saturation. Bold text indicate statistical significance.

Table 2 summarizes the demographics and comorbidities for the no-upgrade (group C, N = 979) vs. upgrade group (group D, N = 224). Compared to the no upgrade group, the upgrade ICU group had more males (p = 0.005), and patients with asthma (p = 0.008) but fewer patients with cancer (p = 0.004). Race was different between groups. Age, ethnicity and prevalence of smoking, hypertension, diabetes, COPD, coronary artery disease, heart failure immunosuppression and chronic kidney disease were not statistically different between groups (p > 0.05).

Table 2:

Laboratory tests, vital signs and blood gases of patients: (i) not upgraded (“no-upgrade”) and (ii) upgrade to ICU from general floor (“upgrade”).

	No-upgrade N = 979	Upgrade N = 224	p Value
Demographics
Age, median (IQR)	60 (49, 72)	60 (50, 70)	0.307
Sex			<0.001
Male	534 (54.5%)	153 (68.3%)
Female	445 (45.5%)	71 (31.7%)
Ethnicity			0.056
Hispanic/Latino	226 (21.2%)	67 (29.9%)
Non-Hispanic/Latino	589 (60.2%)	121 (54.0%)
Unknown	124 (12.8%)	36 (16.0%)
Race			0.005
Caucasian	531 (54.2%)	98 (43.8%)
African American	76 (7.8%)	14 (6.3%)
Asian	27 (2.8%)	15 (6.7%)
American Indian/Alaska Native	2 (0.2%)	1 (0.5%)
Pacific Islander	1 (0.1%)	0
More Than One Race	7 (0.7%)	0
Unknown/Not Reported	335 (34.2%)	96 (42.9%)

Comorbidities
Smoking History			0.247
Current Smoker	48 (4.5%)	5 (2.2%)
Former Smoker	237 (22.1%)	42 (18.7%)
Never Smoker	728 (67.8%)	164 (72.9%)
Unknown	60 (5.6%)	14 (6.2%)
Diabetes	246 (25.1%)	63 (28.1%)	0.354
Hypertension	464 (47.4%)	109 (46.7%)	0.732
Asthma	57 (5.8%)	24 (10.7%)	0.008
COPD	84 (8.6%)	12 (5.4%)	0.108
Coronary artery disease	132 (13.5%)	35 (15.6%)	0.403
Heart failure	68 (7.0%)	16 (7.1%)	0.917
Cancer	99 (10.1%)	9 (4.0%)	0.004
Immunosuppression	77 (7.9%)	14 (6.3%)	0.410
Chronic kidney disease	94 (9.6%)	18 (8.0%)	0.467

DOI: 10.7717/peerj.11205/table-2

Note:

Laboratory tests

Figure 3 plots the laboratory tests for general floor (group A) vs. direct ICU (group B) at ED admission, and no-upgrade (group C) vs. upgrade (group D) at ED admission and at the time of upgrade. WBC, LDH, CRP, TNT and ferritin were significantly different between the general floor and the direct ICU group at ED admission (red bars). Lymph, WBC, LDH, CRP, AST, CRT, ferritin and ALT were significantly different between the no-upgrade and upgrade group at the time of admission to the hospital (green bars). Lymph, WBC and CRP were significantly different between the no-upgrade and upgrade group at the day prior to upgrade (blue bars).

Table 3 integrates the comparison across different groups of Fig. 3. LDH, CRP and ferritin were significantly different for: (i) the general floor vs. direct ICU group at ED admission time point, (ii) no-upgrade vs. upgrade group at ED admission time point and (iii) no-upgrade vs. upgrade group at upgrade time point (Table 3, row 1–3). WBC stood out in that it was different for the general floor vs. direct ICU group at ED admission, the no-upgrade vs. upgrade at upgrade, but it was not different for the no-upgrade vs. upgrade at ED admission time point. WBC and CRP significantly decreased in the no-upgrade group (Table 4, 4th row). WBC, LDH and Cr increased while lymph decreased in the upgrade group (Table 4, 5th row).

Table 3:

Comparison of laboratory tests. This is a summary of the results in Fig. 3.

Row		lymh	WBC	D-dimer	LDH	CRP	Fer	Cr	TNT	AST	ALT
1	Group A vs. B at admission		↑↑↑		↑↑↑	↑↑↑	↑		↓↓↓
2	Group C vs. D at admission	↓↓↓			↑↑↑	↑↑↑	↑↑↑			↑
3	Group C vs. D at upgrade	↓↓↓	↑↑↑	↑	↑↑↑	↑↑↑	↑↑↑	↑		↑↑	↑
4	at admission vs. at upgrade for C		↓↓↓			↓↓↓
5	at admission vs. at upgrade for D	↓↓↓	↑↑↑	↑	↑↑↑			↑↑↑
6	C improved or plateau but D deteriorated	X	X	X	X	X		X

DOI: 10.7717/peerj.11205/table-3

Note:

Note that at upgrade means 1 day prior to upgrade, ↑ = significant increase where p < 0.05, ↑↑ = significant increase where p < 0.01, ↑↑↑ = significant increase where p < 0.005. ↓ = significant decrease where p < 0.05, ↓↓ = significant decrease where p < 0.01, ↓↓↓ = significant decrease where p < 0.005. X: denotes variables that showed group C improved or plateaued, but group D deteriorated between two time points.

Table 4:

Comparisons for vitals and blood gases. This is a summary of the results in Fig. 4.

Row		RR	HR	SpO₂	DBP	SBP	Temp	pO₂	pH	pCO₂	Hcrit	Blood Bicarb	Serum Bicarb	Na⁺
1	Group A vs. B at admission	↑↑↑		↓↓↓			↓	↑↑↑	↓↓↓			↓	↓↓↓
2	Group C vs. D at admission	↑↑↑		↓↓↓		↓↓↓	↑↑↑						↓↓↓	↓↓
3	Group C vs. D at upgrade	↑↑↑	↑↑↑	↓↓↓			↑↑↑		↓	↑	↑			↓↓
4	at admission vs. at upgrade for C	↓↓↓	↓↓↓	↑↑↑	↓↓↓	↓↓↓	↓↓↓				↓↓↓		↓↓↓	↑↑↑
5	at admission vs. at upgrade for D		↓↓↓	↑↑↑			↓↓↓				↓↓↓		↓	↑↑↑
6	C improved or unchanged but D deteriorated

DOI: 10.7717/peerj.11205/table-4

Note:

Note that at upgrade means 1 day prior to upgrade, ↑ = significant increase where p < 0.05, ↑↑ = significant increase where p < 0.01, ↑↑↑ = significant increase where p < 0.005. ↓ = significant decrease where p < 0.05, ↓↓ = significant decrease where p < 0.01, ↓↓↓ = significant decrease where p < 0.005. X: denotes variables that showed C improved or plateaued, but D deteriorated between two time points.

Lymph, WBC, D-dimer, LDH, CRP and Cr improved or did not deteriorate between the two time points in the no-upgrade group but deteriorated in the upgrade group (Table 4, 6th row). These findings suggest that some of these clinical variables are informative of COVID-19 patients hospitalized in the general floor will need escalated care early on.

Vitals and blood gases

Figure 4 plots the vital signs and blood gases for general floor vs. direct ICU at ED admission and no-upgrade vs. upgrade at ED admission and one day prior to upgrade. RR, SpO₂, temp, pO₂ and pH, were significantly different between the general floor vs. direct ICU group (red bars). RR, HR, SpO₂, temp, pH and pCO₂ were significantly different between the no-upgrade vs. upgrade group (green bars) at the time of admission to hospital. HR, SpO₂, DBP, SDP and temp were significantly different between the no-upgrade vs. upgrade group (blue bars) at the day prior to upgrade.

Vitals, blood gases, and others were collected: i) RR ii) HR iii) SpO2 iv) DBP v) SBP vi) Temperature vii) pO2 viii) pH ix) pCO2 x) BICARB (arterial blood) xi) HCRIT xii) BICARB (serum) xiii) K+ xiv) Na+. — Figure 4: Vitals, blood gases, and others were collected: i) RR ii) HR iii) SpO₂ iv) DBP v) SBP vi) Temperature vii) pO₂ viii) pH ix) pCO₂ x) BICARB (arterial blood) xi) HCRIT xii) BICARB (serum) xiii) K⁺ xiv) Na⁺.
*p < 0.05, **p < 0.01, ***p < 0.005.

Download full-size image

DOI: 10.7717/peerj.11205/fig-4

Table 4 integrates the comparison across different groups in Fig. 4. HR, SpO₂ and temp were significantly different for: (i) the general floor vs. direct ICU group at ED admission time point, (ii) no-upgrade vs. upgrade at ED admission time point, and (iii) no-upgrade vs. upgrade at upgrade time point (Table 4, row 1–3). pH stood out in that it was different for the general floor vs. direct ICU group at ED admission, no-upgrade vs. upgrade at upgrade but it was not different for no-upgrade vs. upgrade at ED admission.

For the no upgrade group, RR, HR, DBP, SBP significantly decreased and SpO₂ and temp increased (Table 4, 4th row), whereas for the upgrade group, HR and temp decreased and SpO₂ increased (Table 4, 5th row). Unlike the laboratory tests, none of the vitals and blood gases showed improvement in the no-upgrade group and deterioration in the upgrade group between the two time points (Table 4, 6th row). These findings suggest that some of these clinical variables are informative of COVID-19 patients hospitalized in the general floor will need escalated care early on.

Predictors of ICU upgrade

The NN model built using laboratory tests ranked CRP, LDH, Cr, WBC, D-dimer and lymph (in order of importance) to be the top predictors of ICU admission. This model yielded an accuracy of 86 ± 5%, sensitivity of 0.242, specificity of 0.966 and AUC of 0.782 ± 0.013 for the testing dataset (Fig. 5). Note that high specificity and low sensitivity were due to sample asymmetry in which patients of ICU upgrades were fewer than those not upgraded.

Figure 5: AUC of deep neural network model built using laboratory tests.

Download full-size image

DOI: 10.7717/peerj.11205/fig-5

The NN model built using laboratory tests, vitals and blood gases ranked RR, LDH, CRP, DBP, procal, WBC, D-dimer and O₂ (in order of importance) to be the top predictors of ICU admission. This model yielded an accuracy of 88 ± 7%, sensitivity of 0.364, specificity of 0.924 and an AUC of 0.822 ± 0.018 for the testing dataset (Fig. 6). Note that high specificity and low sensitivity were due to sample asymmetry in which patients of ICU upgrades were fewer than those not upgraded. Cutoffs for the top predictive variables using the training set were determined to be 49.8 mg/dL for CRP, 1.05 mg/dL for Cr, 285 ng/mL for d-dimer, 392 ng/mL for LDH, 12.8% for lymph and 8 × 10³/mL for WBC.

Figure 6: AUC of deep neural network model built using laboratory tests, vitals and blood gases.

Download full-size image

DOI: 10.7717/peerj.11205/fig-6

Discussion

This study investigated the clinical variables associated with direct ICU admission and upgrade to ICU from the general floor. We found that lymphocyte count, white-blood cell count, D-dimer, LDH, CRP and Cr (unranked) improved or did not deteriorate with time in patients who were not upgraded to the ICU but deteriorated in patients who were upgraded to the ICU, showing temporal divergence. The learning predictive model using laboratory tests ranked CRP, LDH, Cr, white-blood cell count, D-dimer and lymphocyte count (in orders of importance), showing substantial overlaps with those variables that exhibited temporal divergence. The performance of the predictive model using these top predictors yielded an AUC of 0.782 ± 0.013 for predicting ICU admission on the test dataset. Adding vitals and blood-gas data further improved prediction performance (0.822 ± 0.018).

Compared to the general floor group, the direct ICU group had significantly more males, smokers, diabetics and patients with heart failure. Compared to the no upgrade group, the upgrade ICU group had more males, and patients with asthma but fewer patients with cancer. Smokers, diabetics and patients with heart failure were more likely to receive escalated care at ED admission. Patients with asthma was the only comorbidity that were associated with ICU upgrade. Some major comorbidities were important factor for ICU admission especially at ED admission, but less so for ICU upgrade, suggesting that ED physicians might consider major comorbidities as factor needing escalated care.

Clinical variables associated with ICU admission

Essentially all the laboratory test results of hospitalized COVID-19 patients were outside of normative physiologic ranges. The normative ranges of major laboratory tests were: lymphs 25–33%, WBC 0.5–11.0 × 10⁹/L, D-dimer <250 ng/mL, LDH 45–90 U/L, CRP <10 mg/dL, AST 5–40 U/L, Cr 0.84–1.21 mg/dL, TNT <0.04 ng/mL, ferritin male: 15–200 ng/mL (male) and 12–150 ng/mL, BNP <100 pg/mL, procal <0.15 ng/mL and ALT 8–40 U/L . Elevated values of these laboratory tests indicate increased disease severity in COVID-19, except lymphocyte count where lower values are associated with worse prognosis (Brown et al., 2020). Note that these clinical variables could be dependent on sex, age, ethnicity and race and are shown here for reference and might not be of use clinically.

Many laboratory tests showed worse disease severity in the direct or upgrade ICU group compared to general floor and no-upgrade group. However, we found that these laboratory tests by themselves were inadequate to reliably determine which patients required ICU admission. Often time, there were no appreciable differences between those directly admitted or upgraded to the ICU and those admitted to the general floor. For example, LDH, CRP and ferritin were significantly different for the general floor vs. direct ICU group at ED admission, and no-upgrade vs. upgrade group for both ED admission and at time of the ICU upgrade, suggesting they might not be useful to distinguish ICU upgrade despite being abnormal due to COVID-19. WBC stood out in that it was different for the general floor vs. direct ICU group at ED admission and the no-upgrade vs. upgrade group at the time of upgrade, but not for the no-upgrade vs. upgrade group at ED admission, suggesting it is one of the most informative variables of ICU upgrade.

Our innovative approach was thus to identify the laboratory tests that showed improvement or plateau between the two time points in the no-upgrade group but deteriorated in the upgrade group. The laboratory tests that showed temporal divergence were identified to be lymphocyte count, white-blood cell count, D-dimer, LDH, CRP and Cr (unranked). By contrast, most vitals and blood gases did not show such temporal divergence between groups, suggesting that vital signs and blood gases might be overall less important when compared to laboratory tests. This appears counter intuitive because vitals are readily available and are often informative in emergency room situation. Possible explanations are: (i) SpO₂ might be affected by supplemental oxygen inhalation, (ii) RR, HR, SBP and DBP could be highly variables, (iii) these vital signs were within normal normative physiological ranges (Merck Manual for the Professional, 2020) although there were group differences. We concluded that vital signs and blood gases appear to be overall less informative in predicting ICU admission compared to laboratory tests.

NN analysis

To further explore whether the above-mentioned laboratory variables are predictive of direct and upgrade ICU admission, we developed a NN model, trained it on 80% of the data, and tested it independently on 20% of data that the model had not seen before. Our neural NN identified CRP, LDH, Cr, white-blood cell count, D-dimer and lymphocyte count (in orders of importance) to be the top predictors of ICU admission. These variables showed substantial overlaps with those variables exhibiting temporal divergence described above. The performance of the predictive model using these top predictors yielded an AUC of 0.782 for predicting ICU admission from the testing dataset. Note that high specificity and low sensitivity were due to sample asymmetry in which patients of ICU upgrades were fewer than those not upgraded. Adding vital and blood-gas data improved prediction performance, yielding an AUC of 0.822 for predicting ICU admission from the test dataset. It is worth noting that RR was one of the highly ranked variables. This is not surprising because COVID-19 patients usually exhibited respiratory distress. Taken together, there is corroborative evidence that a few laboratory tests and vital signs are amongst the most important predictors of severe illness that warrants escalated care.

Limitations

This study has several limitations. This is a retrospective study carried out in a single hospital. As in all observational studies, other residual confounders might exist that were not accounted for in our analysis. These findings need to be replicated in a large and multi-institutional setting for generalizability. It might be challenging however to achieve generalizability across hospitals because the COVID-19 pandemic circumstance is unusual and evolving and how we treat COVID-19 patient is also evolving. ICU admission rate could depend on countries, hospital practice, patient loads, available ICU beds, and when during the pandemic the data were collected, amongst others. At the time of our study, our hospital was not limited by available ICU beds. Inclusion of radiological imaging, such as chest x-ray, may be helpful in improving prediction (Cohen et al. 2020; Kikkisetti et al. 2020; Zhu et al. 2020c). It is conceivable that our model might not work for patients in Wuhan because their patients were more severe, amongst others. Our model also might not work on data from second COVID-19 wave. This is not because the model is wrong, but rather we believe that it is necessary to retrain predictive model with local data. We only explored NN. Alternatively, random forest, Xgboost, kernel support vector machine and other more sophisticated ML methods could also be explored. To date it is generally not trivial for hospitals to share clinical COVID-19 data because of lack of infrastructure to do so seamlessly or concerns about patient data privacy, amongst others. There is a national effort to share deidentified clinical COVID-19 data but this is not yet available.

Conclusions

This study provided corroborative evidence that WBC, lymphocyte count, D-dimer, LDH, CRP and Cr are amongst the most important predictors of severe illness requiring ICU care. This work could help frontline physicians to better manage COVID-19 patients by anticipating downstream ICU needs to more effectively allocate healthcare resources.

Supplemental Information

Categorial Data Codebook.

DOI: 10.7717/peerj.11205/supp-1

Download

Code for Neural Network.

DOI: 10.7717/peerj.11205/supp-2

Download

[1] Brown RAC, Barnard J, Harris-Skillman E, Harbinson B, Dunne B, Drake J, Roche S, Harris E, Gunnel J, Frost J, Angus B, Hodgson S. 2020. Lymphocytopaenia is associated with severe SARS-CoV-2 disease: a systematic review and meta-analysis of clinical data. MedRxiv

[2] Cohen JP, Dao L, Roth K, Morrison P, Bengio Y, Abbasi AF, Shen B, Mahsa HK, Ghassemi M, Li H, Duong TQ. 2020. Predicting COVID-19 Pneumonia Severity on Chest X-ray With Deep Learning. Cureus 12:e9448

[3] Deo RC. 2015. Machine learning in medicine. Circulation 132(20):1920-1930

[4] Fluss R, Faraggi D, Reiser B. 2005. Estimation of the youden index and its associated cutoff point. Biometrical Journal 47(4):458-472

[5] Hou H, Zhao Z, Chen A, Li H, Duong TQ. 2021. Machining learning predicts the need for escalated care and mortality in COVID-19 patients from clinical variables. International Journal of Medical Sciences 18(8):1739-1745

[6] Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B. 2020. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet 395(10223):497-506

[7] Hwang TJ, Kesselheim AS, Vokinger KN. 2019. Lifecycle regulation of artificial intelligence- and machine learning-based software devices in medicine. JAMA 322(23):2285

[8] Ji D, Zhang D, Xu J, Chen Z, Yang T, Zhao P, Chen G, Cheng G, Wang Y, Bi J, Tan L, Lau G, Qin E. 2020. Prediction for progression risk in patients with COVID-19 Pneumonia: the CALL score. Clinical Infectious Diseases 71(6):1393-1399

[9] Jiang X, Coffee M, Bari A, Wang J, Jiang X, Huang J, Shi J, Dai J, Cai J, Zhang T, Wu Z, He G, Huang Y. 2020. Towards an artificial intelligence framework for data-driven prediction of coronavirus clinical severity. Computers, Materials & Continua 63(3):537-551

[10] Johns Hopkin University Coronavirus Resource Center. 2021. COVID-19 Dashboard by the Center for Systems Science and Engineering.

[11] Kikkisetti S, Zhu J, Shen B, Li H, Duong TQ. 2020. Deep-learning convolutional neural networks with transfer learning accurately classify COVID-19 lung infection on portable chest radiographs. PeerJ 8:e10309

[12] Killock D. 2020. AI outperforms radiologists in mammographic screening. Nature Reviews Clinical Oncology 17(3):134

[13] Leung K, Wu JT, Liu D, Leung GM. 2020. First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment. Lancet 395(10233):1382-1393

[14] Li X, Ge P, Zhu J, Li H, Graham J, Singer A, Richman PS, Duong TQ. 2020a. Deep learning prediction of likelihood of ICU admission and mortality in COVID-19 patients using clinical variables. PeerJ 8(8844):e10337

[15] Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KSM, Lau EHY, Wong JY, Xing X, Xiang N, Wu Y, Li C, Chen Q, Li D, Liu T, Zhao J, Liu M, Tu W, Chen C, Jin L, Yang R, Wang Q, Zhou S, Wang R, Liu H, Luo Y, Liu Y, Shao G, Li H, Tao Z, Yang Y, Deng Z, Liu B, Ma Z, Zhang Y, Shi G, Lam TTY, Wu JT, Gao GF, Cowling BJ, Yang B, Leung GM, Feng Z. 2020b. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected Pneumonia. New England Journal of Medicine 382:1199-1207

[16] Lu J, Hu S, Fan R, Liu Z, Yin X, Wang Q, Lv Q, Cai Z, Li H, Hu Y. 2020. ACP risk grade: a simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China. medRxiv

[17] Merck Manual for the Professional. 2020. COVID-19 resources. (accessed 31 August 2020)

[18] Rodriguez-Morales AJ, Cardona-Ospina JA, Gutierrez-Ocampo E, Villamizar-Pena R, Holguin-Rivera Y, Escalera-Antezana JP, Alvarado-Arnez LE, Bonilla-Aldana DK, Franco-Paredes C, Henao-Martinez AF, Paniz-Mondolfi A, Lagos-Grisales GJ, Ramirez-Vallejo E, Suarez JA, Zambrano LI, Villamil-Gomez WE, Balbin-Ramon GJ, Rabaan AA, Harapan H, Dhama K, Nishiura H, Kataoka H, Ahmad T, Sah R, Latin American Network of Coronavirus Disease C-REahwlo. 2020. Clinical, laboratory and imaging features of COVID-19: a systematic review and meta-analysis. Travel Medicine and Infectious Disease 34:101623

[19] Santos MK, Ferreira Junior JR, Wada DT, Tenorio APM, Barbosa MHN, Marques PMA. 2019. Artificial intelligence, machine learning, computer-aided diagnosis, and radiomics: advances in imaging towards to precision medicine. Radiologia Brasileira 52(6):387-396

[20] Van Buuren S, Groothuis-Oudshoorn K. 2011. Mice: multivariate imputation by chained equations in R. Journal of Statistical Software 45:1-67

[21] Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, Bonten MMJ, Damen JAA, Debray TPA, De Vos M, Dhiman P, Haller MC, Harhay MO, Henckaerts L, Kreuzberger N, Lohman A, Luijken K, Ma J, Andaur CL, Reitsma JB, Sergeant JC, Shi C, Skoetz N, Smits LJM, Snell KIE, Sperrin M, Spijker R, Steyerberg EW, Takada T, van Kuijk SMJ, van Royen FS, Wallisch C, Hooft L, Moons KGM, van Smeden M. 2020. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ 369:m1328

[22] Xie J, Hungerford D, Chen H, Abrams ST, Li S, Li X, Tong Z, Du B, Qiu H, Toh CH. 2020. Development and external validation of a prognostic multivariable model on admission for hospitalized patients with COVID-19. MedRxiv

[23] Yan L, Zhang HT, Goncalves J, Xiao Y, Wang M, Guo Y, Sun C, Tang X, Jing L, Zhang M, Huang X, Xiao Y, Cao H, Chen Y, Ren T, Wang F, Xiao Y, Huang S, Tan X, Huang N, Jiao B, Cheng C, Zhang Y, Luo A, Mombaerts L, Jin J, Cao Z. 2020. An interpretable mortality prediction model for COVID-19 patients. Nature Machine Intelligence 2:283-288

[24] Yuan Y, Sun C, Tang X, Cheng C, Mombaerts L, Wang M, Hu T, Sun C, Guo Y, Li X, Xu H, Ren T, Xiao Y, Xiao Y, Zhu H, Chen C, Liu Y, Liang Z, Cao Z, Zhang HT, Paschaldis IC, Liu Q, Gonalves J, Zhong Q, Yan L. 2020. Development and validation of a prognostic risk score system for COVID-19 inpatients: a multi-center retrospective study in China.

[25] Zhao Z, Chen A, Hou W, Graham JM, Li H, Richman PS, Thode HC, Singer AJ, Duong TQ. 2020. Prediction model and risk scores of ICU admission and mortality in COVID-19. PLOS ONE 15(7):e0236618