Skip to main content

Advertisement

Log in

Risk Factors Associated with COVID-19 Lethality: A Machine Learning Approach Using Mexico Database

  • Original Paper
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

Identifying risk factors associated with COVID-19 lethality is crucial in combating the ongoing pandemic. In this study, we developed lethality predictive models for each epidemiological wave and for the overall dataset using the Extreme Gradient Boosting technique and analyzed them using Shapley values to determine the contribution levels of various features, including demographics, comorbidities, medical units, and recent medical information from confirmed COVID-19 cases in Mexico between February 23, 2020, and April 15, 2022. The results showed that pneumonia and advanced age were the most important factors predicting patient death in all cohorts. Additionally, the medical unit where the patient received care acted as a risk or protective factor. IMSS medical units were identified as high-risk factors in all cohorts, except in wave four, while SSA medical units generally were moderate protective factors. We also found that intubation was a high-risk factor in the first epidemiological wave and a moderate-risk factor in the following waves. Female gender was a protective factor of moderate-high importance in all cohorts, while being between 18 and 29 years old was a moderate protective factor and being between 50 and 59 years old was a moderate risk factor. Additionally, diabetes (all cohorts), obesity (third wave), and hypertension (fourth wave) were identified as moderate risk factors. Finally, residing in municipalities with the lowest Human Development Index level represented a moderate risk factor. In conclusion, this study identified several significant risk factors associated with COVID-19 lethality in Mexico, which could aid policymakers in developing targeted interventions to reduce mortality rates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

The Python notebook and datasets used in this work are available at web repository https://github.com/Above02/Covid19_Analysis.

References

  1. WHO Coronavirus (COVID-19) Dashboard (2023) https://covid19.who.int/. Accessed 24 March 2023.

  2. WHO characterizes COVID-19 as a pandemic. (2023) https://www.paho.org/en/news/11-3-2020-who-characterizes-covid-19-pandemic. Accessed 24 March 2023.

  3. Chen XS, OA Laurent, NN Onur, Kleineberg, GR et al (2021) A systematic review of neurological symptoms and complications of COVID-19. J Neurol 268:392–402. https://doi.org/10.1007/s00415-020-10067-3

    Article  CAS  PubMed  Google Scholar 

  4. Interpretable Machine Learning. A Guide for Making Black Box Models Explainable (2023) https://christophm.github.io/interpretable-ml-book/. Accessed 24 March 2023.

  5. Datos Abiertos de COVID-19 (2022) https://www.gob.mx/salud/documentos/datos-abiertos-bases-historicas-direccion-general-de-epidemiologia Accessed July 6, 2023.

  6. Secretaría de Salud. Lineamiento estandarizado para la vigilancia epidemiológica y por laboratorio de la enfermedad respiratoria viral (2022) https://www.gob.mx/cms/uploads/attachment/file/715444/Lineamiento_VE_y_Lab_Enf_Viral_05042022.pdf Accessed June 24 2023.

  7. Najera H, Ortega-Avila AG (2020) Health and Institutional Risk Factors of COVID-19 lethality in Mexico. Am J Prev Med 60:471–477. https://doi.org/10.1016/j.amepre.2020.10.015.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Bae SA, Kim SR, Kim MN et al (2021) Park, Impact of cardiovascular disease and risk factors on fatal outcomes in patients with COVID-19 according to age: A systematic review and meta-analysis. Heart 107:373–380. https://doi.org/10.1136/heartjnl-2020-317901.

    Article  CAS  PubMed  Google Scholar 

  9. Salinas-Aguirre JE, Sánchez-García C, Rodríguez-Sanchez R et al (2020) Clinical characteristics and comorbidities associated with lethality in patients with COVID-19 in Coahuila (Mexico. Rev Clin Esp 222:288–292. https://doi.org/10.1016/j.rce.2020.12.006.

    Article  CAS  Google Scholar 

  10. Escobedo-de la Peña J, Rascón-Pacheco RA, de Jesús Ascencio-Montiel I et al (2021) Hypertension, Diabetes and Obesity, Major Risk Factors for Death in Patients with COVID-19 in Mexico. Arch Med Res 52:443–449. https://doi.org/10.1016/j.arcmed.2020.12.002.

    Article  CAS  Google Scholar 

  11. ERA-EDTA Council (2021) ERACODA Working Group, Chronic kidney disease is a key risk factor for severe COVID-19: A call to action by the ERA-EDTA. Nephro Dial Transplant 36:87–94. https://doi.org/10.1093/ndt/gfaa314

    Article  CAS  Google Scholar 

  12. Wong KCY, So HC (2020) Uncovering clinical risk factors and prediction of severe COVID-19: A machine learning approach based on UK Biobank data. MedRxiv 2020.09.18.20197319. https://doi.org/10.1101/2020.09.18.20197319

  13. Tehrani S, Killander A, Åstrand P et al (2021) Risk factors for death in adult COVID-19 patients: Frailty predicts fatal outcome in older patients. Int J Infect Dis 102:415–421. https://doi.org/10.1101/2020.09.18.20197319

    Article  CAS  PubMed  Google Scholar 

  14. Chávez-Almazán LA, Díaz-González L., Rosales-rivera M (2022) Socioeconomic status and its effects on morbidity, lethality, and lethality due to COVID-19 in Mexico. Gac Méd Méx 158:4–11. https://doi.org/10.24875/gmm.21000302.

    Article  Google Scholar 

  15. Quiroz-Juárez MA, Torres-Gómez A, Hoyo-Ulloa I et al (2021) Identification of high-risk COVID-19 patients using machine learning. PLoS One 16:1–21. https://doi.org/10.1371/journal.pone.0257234.

    Article  CAS  Google Scholar 

  16. Solis J, Franco-Paredes C, Henao-Martinez AF, et al (2020) Structural vulnerability in the U.S. revealed in three waves of COVID-19. Am J Trop Med Hyg 103:25–278. https://doi.org/10.4269/ajtmh.20-0391.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Di Castelnuovo A, Bonaccio M, Costanzo S, et al (2020) Common cardiovascular risk factors and in-hospital lethality in 3,894 patients with COVID-19: survival analysis and machine learning-based findings from the multicentre Italian CORIST Study. Nutr. Metab. Cardiovasc. Dis. 30:1899–1913. https://doi.org/10.1016/j.numecd.2020.07.031.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Smith M, Alvarez F (2020) Identifying lethality factors from Machine Learning using Shapley values – a case of COVID19. Expert Syst Appl 176:1–12, https://doi.org/10.1016/j.eswa.2021.114832.

    Article  Google Scholar 

  19. Booth AL, Abels E, McCaffrey P (2021) Development of a prognostic model for lethality in COVID-19 infection using machine learning. Mod Pathol 34:522–531, https://doi.org/10.1038/s41379-020-00700-x.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Davazdahemami B, Zolbanin HM, Delen D (2021) An explanatory machine learning framework for studying pandemics: The case of COVID-19 emergency department readmissions. Decis Support Syst 161, 113730 https://doi.org/10.1016/j.dss.2022.113730.

    Article  Google Scholar 

  21. Informe de pobreza en los municipios de México 2015. México: Consejo Nacional de Evaluación de la Política de Desarrollo Social. https://www.coneval.org.mx/Medicion/Paginas/Pobreza-municipio-2010-2020.aspx. Accessed 24 March 2023.

  22. Pérez-Tamayo R (2016) Patología de la pobreza. El Colegio Nacional, México.

    Google Scholar 

  23. Jing N, Shi Z, Hu Y, Yuan J (2022) Cross-sectional analysis and data-driven forecasting of confirmed COVID-19 cases. Appl Intell 52:3303–3318 https://doi.org/10.1007/s10489-021-02616-8.

    Article  Google Scholar 

  24. Banerjee T, Paul A, Srikanth, Strümke I (2022) Socioeconomic Disparities and COVID-19: The Causal Connections. SSRN Electron J https://doi.org/10.2139/ssrn.4013119

    Article  Google Scholar 

  25. Vaid A, Somani S, Russak AJ et al (2020) Machine learning to predict lethality and critical events in a cohort of patients with COVID-19 in New York City: Model development and validation. J Med Internet Res 22:1–19. https://doi.org/10.2196/24018.

    Article  Google Scholar 

  26. Schapire RE (2014) Boosting: Foundations and Algorithms, Massachusetts Institute of Technology, USA.

    Google Scholar 

  27. Chen T, Guestrin C (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August:785–794 https://doi.org/10.1145/2939672.2939785

  28. Pedragosa F (2019) Scikit-learn: Machine Learning in Python. J Mach Learn Res 127:2825–2830, https://doi.org/10.1289/EHP4713.

    Article  Google Scholar 

  29. Berrar D (2018) Cross-validation. Encycl. Bioinforma. Comput. Biol ABC Bioinforma 1–3 542–545, https://doi.org/10.1016/B978-0-12-809633-8.20349-X.

    Article  Google Scholar 

  30. Muschelli J (2019) ROC and AUC with a Binary Predictor: a Potentially Misleading Metric. J Classif 37:696–708. https://doi.org/10.1007/s00357-019-09345-1

    Article  PubMed  PubMed Central  Google Scholar 

  31. Kornbrot D (2014) Point biserial correlation. John Wiley & Sons, Ltd and republished in Wiley StatsRef: Statistics Reference Online. https://doi.org/10.1002/9781118445112.stat06227

  32. Rod JE, Oviedo-Trespalacios O, Cortes-Ramirez J (2020) A brief-review of the risk factors for covid-19 severity. Rev Saude Publica 54:1–11 https://doi.org/10.11606/S1518-8787.2020054002481.

    Article  Google Scholar 

  33. Ya-dong G, Ding M, Dong X et al (2021) Risk factors for severe and critically ill COVID-19 patients: a review. Allergy 76:428–455. https://doi.org/10.1111/all.14657

    Article  CAS  Google Scholar 

  34. Zhang Q, Bastard P, COVID Human Genetic Effort, et al (2022) Human genetic and immunological determinants of critical COVID-19 pneumonia. Nature 603:587–598 https://doi.org/10.1038/s41586-022-04447-0

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Calendario de vacunación del Gobierno de México. (2023) https://vacunacovid.gob.mx/calendario-vacunacion/. Accessed March 24 2023.

  36. Soto-Estrada G, Moreno-Altamirano L, Pahua-Díaz D (2016) Epidemiological overview of Mexico’s leading causes of morbidity and mortality. Rev Fac Med 59:8–22. http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S0026-17422016000600008.

    Google Scholar 

  37. Carrillo-Vega MF, Salinas-Escudero G, García-Peña C et al (2020) Early estimation of the risk factors for hospitalization and mortality by COVID-19 in Mexico. PLoS ONE 15(9):e0238905 https://doi.org/10.1371/journal.pone.0238905

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Guo W, Li M, Dong Y et al (2020) Diabetes is a risk factor for the progression and prognosis of COVID-19. Diabetes Metab Res Rev 36:1–9 https://doi.org/10.1002/dmrr.3319.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We are grateful for the computer resources provided by the project “National High-Performance Computing Laboratory (LANCAD-6-2023) - Machine and deep learning”. Additionally, we thank the anonymous referees for their valuable feedback and suggestions.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Alejandro Carvantes-Barrera: conceptualized, applied and evaluated the methodology, analyzed the results and prepared figures and tables. Lorena Díaz-González: conceptualized and supervised the study, analyzed the results, wrote and revised the manuscript. Mauricio Rosales-Rivera: conceptualized and supervised the study, analyzed the results and reviewed the manuscript. Luis A. Chávez: reviewed and edited the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Lorena Díaz-González.

Ethics declarations

Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Carvantes-Barrera, A., Díaz-González, L., Rosales-Rivera, M. et al. Risk Factors Associated with COVID-19 Lethality: A Machine Learning Approach Using Mexico Database. J Med Syst 47, 90 (2023). https://doi.org/10.1007/s10916-023-01979-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10916-023-01979-4

Keywords

Navigation