Abstract
Identifying risk factors associated with COVID-19 lethality is crucial in combating the ongoing pandemic. In this study, we developed lethality predictive models for each epidemiological wave and for the overall dataset using the Extreme Gradient Boosting technique and analyzed them using Shapley values to determine the contribution levels of various features, including demographics, comorbidities, medical units, and recent medical information from confirmed COVID-19 cases in Mexico between February 23, 2020, and April 15, 2022. The results showed that pneumonia and advanced age were the most important factors predicting patient death in all cohorts. Additionally, the medical unit where the patient received care acted as a risk or protective factor. IMSS medical units were identified as high-risk factors in all cohorts, except in wave four, while SSA medical units generally were moderate protective factors. We also found that intubation was a high-risk factor in the first epidemiological wave and a moderate-risk factor in the following waves. Female gender was a protective factor of moderate-high importance in all cohorts, while being between 18 and 29 years old was a moderate protective factor and being between 50 and 59 years old was a moderate risk factor. Additionally, diabetes (all cohorts), obesity (third wave), and hypertension (fourth wave) were identified as moderate risk factors. Finally, residing in municipalities with the lowest Human Development Index level represented a moderate risk factor. In conclusion, this study identified several significant risk factors associated with COVID-19 lethality in Mexico, which could aid policymakers in developing targeted interventions to reduce mortality rates.
Similar content being viewed by others
Data Availability
The Python notebook and datasets used in this work are available at web repository https://github.com/Above02/Covid19_Analysis.
References
WHO Coronavirus (COVID-19) Dashboard (2023) https://covid19.who.int/. Accessed 24 March 2023.
WHO characterizes COVID-19 as a pandemic. (2023) https://www.paho.org/en/news/11-3-2020-who-characterizes-covid-19-pandemic. Accessed 24 March 2023.
Chen XS, OA Laurent, NN Onur, Kleineberg, GR et al (2021) A systematic review of neurological symptoms and complications of COVID-19. J Neurol 268:392–402. https://doi.org/10.1007/s00415-020-10067-3
Interpretable Machine Learning. A Guide for Making Black Box Models Explainable (2023) https://christophm.github.io/interpretable-ml-book/. Accessed 24 March 2023.
Datos Abiertos de COVID-19 (2022) https://www.gob.mx/salud/documentos/datos-abiertos-bases-historicas-direccion-general-de-epidemiologia Accessed July 6, 2023.
Secretaría de Salud. Lineamiento estandarizado para la vigilancia epidemiológica y por laboratorio de la enfermedad respiratoria viral (2022) https://www.gob.mx/cms/uploads/attachment/file/715444/Lineamiento_VE_y_Lab_Enf_Viral_05042022.pdf Accessed June 24 2023.
Najera H, Ortega-Avila AG (2020) Health and Institutional Risk Factors of COVID-19 lethality in Mexico. Am J Prev Med 60:471–477. https://doi.org/10.1016/j.amepre.2020.10.015.
Bae SA, Kim SR, Kim MN et al (2021) Park, Impact of cardiovascular disease and risk factors on fatal outcomes in patients with COVID-19 according to age: A systematic review and meta-analysis. Heart 107:373–380. https://doi.org/10.1136/heartjnl-2020-317901.
Salinas-Aguirre JE, Sánchez-García C, Rodríguez-Sanchez R et al (2020) Clinical characteristics and comorbidities associated with lethality in patients with COVID-19 in Coahuila (Mexico. Rev Clin Esp 222:288–292. https://doi.org/10.1016/j.rce.2020.12.006.
Escobedo-de la Peña J, Rascón-Pacheco RA, de Jesús Ascencio-Montiel I et al (2021) Hypertension, Diabetes and Obesity, Major Risk Factors for Death in Patients with COVID-19 in Mexico. Arch Med Res 52:443–449. https://doi.org/10.1016/j.arcmed.2020.12.002.
ERA-EDTA Council (2021) ERACODA Working Group, Chronic kidney disease is a key risk factor for severe COVID-19: A call to action by the ERA-EDTA. Nephro Dial Transplant 36:87–94. https://doi.org/10.1093/ndt/gfaa314
Wong KCY, So HC (2020) Uncovering clinical risk factors and prediction of severe COVID-19: A machine learning approach based on UK Biobank data. MedRxiv 2020.09.18.20197319. https://doi.org/10.1101/2020.09.18.20197319
Tehrani S, Killander A, Åstrand P et al (2021) Risk factors for death in adult COVID-19 patients: Frailty predicts fatal outcome in older patients. Int J Infect Dis 102:415–421. https://doi.org/10.1101/2020.09.18.20197319
Chávez-Almazán LA, Díaz-González L., Rosales-rivera M (2022) Socioeconomic status and its effects on morbidity, lethality, and lethality due to COVID-19 in Mexico. Gac Méd Méx 158:4–11. https://doi.org/10.24875/gmm.21000302.
Quiroz-Juárez MA, Torres-Gómez A, Hoyo-Ulloa I et al (2021) Identification of high-risk COVID-19 patients using machine learning. PLoS One 16:1–21. https://doi.org/10.1371/journal.pone.0257234.
Solis J, Franco-Paredes C, Henao-Martinez AF, et al (2020) Structural vulnerability in the U.S. revealed in three waves of COVID-19. Am J Trop Med Hyg 103:25–278. https://doi.org/10.4269/ajtmh.20-0391.
Di Castelnuovo A, Bonaccio M, Costanzo S, et al (2020) Common cardiovascular risk factors and in-hospital lethality in 3,894 patients with COVID-19: survival analysis and machine learning-based findings from the multicentre Italian CORIST Study. Nutr. Metab. Cardiovasc. Dis. 30:1899–1913. https://doi.org/10.1016/j.numecd.2020.07.031.
Smith M, Alvarez F (2020) Identifying lethality factors from Machine Learning using Shapley values – a case of COVID19. Expert Syst Appl 176:1–12, https://doi.org/10.1016/j.eswa.2021.114832.
Booth AL, Abels E, McCaffrey P (2021) Development of a prognostic model for lethality in COVID-19 infection using machine learning. Mod Pathol 34:522–531, https://doi.org/10.1038/s41379-020-00700-x.
Davazdahemami B, Zolbanin HM, Delen D (2021) An explanatory machine learning framework for studying pandemics: The case of COVID-19 emergency department readmissions. Decis Support Syst 161, 113730 https://doi.org/10.1016/j.dss.2022.113730.
Informe de pobreza en los municipios de México 2015. México: Consejo Nacional de Evaluación de la Política de Desarrollo Social. https://www.coneval.org.mx/Medicion/Paginas/Pobreza-municipio-2010-2020.aspx. Accessed 24 March 2023.
Pérez-Tamayo R (2016) Patología de la pobreza. El Colegio Nacional, México.
Jing N, Shi Z, Hu Y, Yuan J (2022) Cross-sectional analysis and data-driven forecasting of confirmed COVID-19 cases. Appl Intell 52:3303–3318 https://doi.org/10.1007/s10489-021-02616-8.
Banerjee T, Paul A, Srikanth, Strümke I (2022) Socioeconomic Disparities and COVID-19: The Causal Connections. SSRN Electron J https://doi.org/10.2139/ssrn.4013119
Vaid A, Somani S, Russak AJ et al (2020) Machine learning to predict lethality and critical events in a cohort of patients with COVID-19 in New York City: Model development and validation. J Med Internet Res 22:1–19. https://doi.org/10.2196/24018.
Schapire RE (2014) Boosting: Foundations and Algorithms, Massachusetts Institute of Technology, USA.
Chen T, Guestrin C (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August:785–794 https://doi.org/10.1145/2939672.2939785
Pedragosa F (2019) Scikit-learn: Machine Learning in Python. J Mach Learn Res 127:2825–2830, https://doi.org/10.1289/EHP4713.
Berrar D (2018) Cross-validation. Encycl. Bioinforma. Comput. Biol ABC Bioinforma 1–3 542–545, https://doi.org/10.1016/B978-0-12-809633-8.20349-X.
Muschelli J (2019) ROC and AUC with a Binary Predictor: a Potentially Misleading Metric. J Classif 37:696–708. https://doi.org/10.1007/s00357-019-09345-1
Kornbrot D (2014) Point biserial correlation. John Wiley & Sons, Ltd and republished in Wiley StatsRef: Statistics Reference Online. https://doi.org/10.1002/9781118445112.stat06227
Rod JE, Oviedo-Trespalacios O, Cortes-Ramirez J (2020) A brief-review of the risk factors for covid-19 severity. Rev Saude Publica 54:1–11 https://doi.org/10.11606/S1518-8787.2020054002481.
Ya-dong G, Ding M, Dong X et al (2021) Risk factors for severe and critically ill COVID-19 patients: a review. Allergy 76:428–455. https://doi.org/10.1111/all.14657
Zhang Q, Bastard P, COVID Human Genetic Effort, et al (2022) Human genetic and immunological determinants of critical COVID-19 pneumonia. Nature 603:587–598 https://doi.org/10.1038/s41586-022-04447-0
Calendario de vacunación del Gobierno de México. (2023) https://vacunacovid.gob.mx/calendario-vacunacion/. Accessed March 24 2023.
Soto-Estrada G, Moreno-Altamirano L, Pahua-Díaz D (2016) Epidemiological overview of Mexico’s leading causes of morbidity and mortality. Rev Fac Med 59:8–22. http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S0026-17422016000600008.
Carrillo-Vega MF, Salinas-Escudero G, García-Peña C et al (2020) Early estimation of the risk factors for hospitalization and mortality by COVID-19 in Mexico. PLoS ONE 15(9):e0238905 https://doi.org/10.1371/journal.pone.0238905
Guo W, Li M, Dong Y et al (2020) Diabetes is a risk factor for the progression and prognosis of COVID-19. Diabetes Metab Res Rev 36:1–9 https://doi.org/10.1002/dmrr.3319.
Acknowledgements
We are grateful for the computer resources provided by the project “National High-Performance Computing Laboratory (LANCAD-6-2023) - Machine and deep learning”. Additionally, we thank the anonymous referees for their valuable feedback and suggestions.
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
Alejandro Carvantes-Barrera: conceptualized, applied and evaluated the methodology, analyzed the results and prepared figures and tables. Lorena Díaz-González: conceptualized and supervised the study, analyzed the results, wrote and revised the manuscript. Mauricio Rosales-Rivera: conceptualized and supervised the study, analyzed the results and reviewed the manuscript. Luis A. Chávez: reviewed and edited the manuscript. All authors read and approved the manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Carvantes-Barrera, A., Díaz-González, L., Rosales-Rivera, M. et al. Risk Factors Associated with COVID-19 Lethality: A Machine Learning Approach Using Mexico Database. J Med Syst 47, 90 (2023). https://doi.org/10.1007/s10916-023-01979-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-023-01979-4