Skip to main content
Log in

Exploring disease axes as an alternative to distinct clusters for characterizing sepsis heterogeneity

  • Original
  • Published:
Intensive Care Medicine Aims and scope Submit manuscript

Abstract

Purpose

Various studies have analyzed sepsis subtypes, yet the reproducibility of such results remains unclear. This study aimed to determine the reproducibility of sepsis subtypes across multiple cohorts.

Methods

The study examined 63,547 sepsis patients from six distinct cohorts who had similar sepsis-related characteristics (vital signs, lactate, sequential organ failure assessment score, bilirubin, serum, urine output, and Glasgow coma scale). Identical cluster analysis techniques were used, employing 27 clustering schemes, and normalized mutual information (NMI), a metric ranging from 0 to 1 with higher values indicating better concordance, was employed to quantify the clustering solutions' reproducibility. Principal component analysis (PCA) was utilized to obtain the disease axis, and its uniformity across cohorts was evaluated through patterns of feature loading and correlation.

Results

The reproducibility of sepsis clustering subtypes across the various studies was modest (median NMI ranging from 0.08 to 0.54). The top-down transfer learning method (model trained on cohorts with greater severity was transferred to cohorts with lower severity score) had a higher NMI value than the bottom-up approach (median [Q1, Q3]: 0.64 [0.49, 0.78] vs. 0.23 [0.2, 0.31], p < 0.001). The reproducibility was greater when the transfer solution was performed within United States (US) cohorts. The PCA analysis revealed that the correlation pattern between variables was consistent across all cohorts, and the first two disease axes were the "shock axis" and "systemic inflammatory response syndrome (SIRS) axis."

Conclusions

Cluster analysis of sepsis patients across various cohorts showed modest reproducibility. Sepsis heterogeneity is better characterized through continuous disease axes that coexist to varying degrees within the same individual instead of mutually exclusive subtypes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

Datasets can be obtained by contacting the corresponding author upon a reasonable request.

References

  1. Rajendran S, Xu Z, Pan W, Ghosh A, Wang F (2023) Data heterogeneity in federated learning with electronic health records: case studies of risk prediction for acute kidney injury and sepsis diseases in critical care. PLOS Digit Health 2:e0000117

    Article  PubMed  PubMed Central  Google Scholar 

  2. Wang W, Liu C-F (2023) Sepsis heterogeneity. World J Pediatr. https://doi.org/10.1007/s12519-023-00689-8

    Article  PubMed  PubMed Central  Google Scholar 

  3. Harhay MO, Casey JD, Clement M et al (2020) Contemporary strategies to improve clinical trial design for critical care research: insights from the first critical care clinical trialists workshop. Intensive Care Med 46:930–942

    Article  PubMed  PubMed Central  Google Scholar 

  4. François B, Lambden S, Fivez T et al (2023) Prospective evaluation of the efficacy, safety, and optimal biomarker enrichment strategy for nangibotide, a TREM-1 inhibitor, in patients with septic shock (ASTONISH): a double-blind, randomised, controlled, phase 2b trial. Lancet Respir Med. https://doi.org/10.1016/S2213-2600(23)00158-3

    Article  PubMed  Google Scholar 

  5. Reignier J, Plantefeve G, Mira J-P et al (2023) Low versus standard calorie and protein feeding in ventilated adults with shock: a randomised, controlled, multicentre, open-label, parallel-group trial (NUTRIREA-3). Lancet Respir Med. https://doi.org/10.1016/S2213-2600(23)00092-9

    Article  PubMed  Google Scholar 

  6. Zhang Z, Zhang G, Goyal H, Mo L, Hong Y (2018) Identification of subclasses of sepsis that showed different clinical outcomes and responses to amount of fluid resuscitation: a latent profile analysis. Crit Care 22:347

    Article  PubMed  PubMed Central  Google Scholar 

  7. Seymour CW, Kennedy JN, Wang S et al (2019) Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321:2003–2017

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Mosevoll KA, Hansen BA, Gundersen IM et al (2023) Systemic metabolomic profiles in adult patients with bacterial sepsis: characterization of patient heterogeneity at the time of diagnosis. Biomolecules 13:223

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Qin Y, Caldino Bohn RI, Sriram A et al (2023) Refining empiric subgroups of pediatric sepsis using machine-learning techniques on observational data. Front Pediatr 11:1035576

    Article  PubMed  PubMed Central  Google Scholar 

  10. Charrad M, Ghazzali N, Boiteau V, Niknafs A (2014) NbClust: an R package for determining the relevant number of clusters in a data set. J Stat Soft 61:1–36

    Article  Google Scholar 

  11. Kinney GL, Santorico SA, Young KA et al (2018) Identification of chronic obstructive pulmonary disease axes that predict all-cause mortality: the COPDGene study. Am J Epidemiol 187:2109–2116

    Article  PubMed  PubMed Central  Google Scholar 

  12. Thoral PJ, Peppink JM, Driessen RH et al (2021) Sharing ICU patient data responsibly under the society of critical care medicine/European society of intensive care medicine joint data science collaboration: the Amsterdam university medical centers database (AmsterdamUMCdb) example. Crit Care Med 49:e563–e577

    Article  PubMed  PubMed Central  Google Scholar 

  13. Zhang Z, Chen L, Liu H et al (2022) Gene signature for the prediction of the trajectories of sepsis-induced acute kidney injury. Crit Care 26:398

    Article  PubMed  PubMed Central  Google Scholar 

  14. Pollard TJ, Johnson AEW, Raffa JD, Celi LA, Mark RG, Badawi O (2018) The eICU collaborative research database, a freely available multi-center database for critical care research. Sci Data 5:180178

    Article  PubMed  PubMed Central  Google Scholar 

  15. Xu P, Chen L, Zhu Y et al (2022) Critical care database comprising patients with infection. Front Public Health 10:852410

    Article  PubMed  PubMed Central  Google Scholar 

  16. Johnson AEW, Bulgarelli L, Shen L et al (2023) MIMIC-IV, a freely accessible electronic health record dataset. Sci Data 10:1

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Singer M, Deutschman CS, Seymour CW et al (2016) The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA 315:801–810

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Sterne JAC, White IR, Carlin JB et al (2009) Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ 338:b2393

    Article  PubMed  PubMed Central  Google Scholar 

  19. Vesin A, Azoulay E, Ruckly S et al (2013) Reporting and handling missing values in clinical studies in intensive care units. Intensive Care Med 39:1396–1404

    Article  PubMed  Google Scholar 

  20. Djouzi K, Beghdad-Bey K (2019) A review of clustering algorithms for big data. In: 2019 International Conference on Networking and Advanced Systems (ICNAS). pp. 1–6

  21. Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, Lopez A (2020) A comprehensive survey on support vector machine classification: applications, challenges and trends. Neurocomputing 408:189–215

    Article  Google Scholar 

  22. Vinh NX, Epps J, Bailey J (2009) Information theoretic measures for clusterings comparison: is a correction for chance necessary? In: Proceedings of the 26th Annual International Conference on Machine Learning. New York, NY, USA: Association for Computing Machinery, pp. 1073–80

  23. Lever J, Krzywinski M, Altman N (2017) Principal component analysis. Nat Methods 14:641–642

    Article  CAS  Google Scholar 

  24. Bhavani SV, Semler M, Qian ET et al (2022) Development and validation of novel sepsis subphenotypes using trajectories of vital signs. Intensive Care Med 48:1582–1592

    Article  PubMed  PubMed Central  Google Scholar 

  25. Rogers AJ, Leligdowicz A, Contrepois K et al (2021) Plasma metabolites in early sepsis identify distinct clusters defined by plasma lipids. Crit Care Explor 3:e0478

    Article  PubMed  PubMed Central  Google Scholar 

  26. Castaldi PJ, Benet M, Petersen H et al (2017) Do “COPD subtypes” really exist? Thorax 72:998–1006

    Article  PubMed  Google Scholar 

  27. Russell JA, Gordon AC, Williams MD, Boyd JH, Walley KR, Kissoon N (2021) Vasopressor therapy in the intensive care unit. Semin Respir Crit Care Med 42:59–77

    Article  PubMed  Google Scholar 

  28. Li Y, Sun P, Chang K et al (2022) Effect of continuous renal replacement therapy with the oXiris hemofilter on critically ill patients: a narrative review. J Clin Med 11:6719

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Funding

ZZ received funding from the Open Foundation of Key Laboratory of Digital Technology in Medical Diagnostics of Zhejiang Province (SZZD202206), Health Science and Technology Plan of Zhejiang Province (2021KY745), the Fundamental Research Funds for the Central Universities (226–2022-00148), National natural science foundation of China (82272180) and the Project of Drug Clinical Evaluate Research of Chinese Pharmaceutical Association NO.CPA-Z06-ZC-2021–004. YH received funding from the Key Research & Development project of Zhejiang Province (2021C03071).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhongheng Zhang.

Ethics declarations

Conflict of interest

There is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 836 KB)

Supplementary file2 (DOCX 91 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Z., Chen, L., Liu, X. et al. Exploring disease axes as an alternative to distinct clusters for characterizing sepsis heterogeneity. Intensive Care Med 49, 1349–1359 (2023). https://doi.org/10.1007/s00134-023-07226-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00134-023-07226-1

Keywords

Navigation