Dear Editor,
In the past decade, data-driven research utilizing advanced statistical methodology such as complex modeling or artificial intelligence (AI) has increasingly gained popularity in the field of intensive care medicine. One catalyst for these research endeavors has been the introduction of the first publicly accessible intensive care database, MIMIC-II, in 2011 [1]. MIMIC-II with its subsequent updates has facilitated over 1000 publications in various medical fields. Subsequently, these single-center datasets have been complemented by multicenter, United States (US) datasets (eICU-CRD), and most recently by two European single-center databases (HiRID, Amsterdam UMCdb) [2, 3]. Overall, 243,429 patient entries are currently available in the 4 mentioned datasets. However, the granularity of the data is comparatively poor (with exemption of the HiRID), complicating the utilization of AI applications.
Hence, we present the Salzburg Intensive Care database (SICdb), a publicly available, highly granular, once-per-minute critical care dataset [4]. SICdb (1.0.4) contains 27,386 admissions from 4 different intensive care units (ICUs) at 1 single tertiary care institution of the Department of Anesthesiology and Intensive Care Medicine at the Salzburger Landesklinik (SALK) and Paracelsus Medical University (PMU) between 2013 and 2021. SICdb provides highly granulated data at the minute level and is preprocessed hourly. Raw signals were collected utilizing the MetaVision (iMDSoft, Tel Aviv, Israel) patient data management (PDMS) software. Additionally, exports from ORBIS (Dedalus Healthcare GmbH, Bonn, Germany), containing information on admission, discharge, and ICD–10 codes, were included. The workflow to generate the dataset is shown in Fig. 1. The export process strictly adhered to the ACID (Atomicity, Consistency, Isolation, Durability) properties of database transactions. SICdb has been fully approved by the ethics commission of the Land Salzburg, Austria (EK Nr: 1115/2021).
All data are pseudo-anonymized as defined by the European General Data Protection Regulation. The de-identification strategy also complies with the US regulations for health data.
Overall, SICdb contains more than 1.2 billion data points. The mean length of stay in SICdb is 3.50 [SD 6.49] days. Twenty-six thousand four hundred and forty-six [96.55%] patients were alive at ICU discharge and twenty-five thousand two hundred and fifty-two [92.2%] were alive at hospital discharge, respectively. For 24,243 [88.5%] admissions, 1-year out-of-hospital survival data are available. Nineteen thousand six hundred and eighty-nine [81.21%] were still alive 1 year after hospital discharge. The main admission reason to the ICU was postoperative treatment. The mean Simplified Acute Physiology Score III (SAPS III) is 44.67 [SD 14.81]. Two thousand four hundred and six [8.79%] patients were mechanically ventilated > 24 h and one thousand and twenty-seven [3.75%] patients required renal replacement therapy (RRT).
A detailed description of the data, access credentials, and documentation on the removal of all protected health information (PHI), can be found at https://www.sicdb.com. To access and download SICdb, the online repository for physiological signals PhysioNet (https://physionet.org/content/sicdb/1.0.5/) was used [5]. It is mandatory to cite this paper in subsequent analyses of the database.
In conclusion, SICdb is among the largest ICU datasets currently available worldwide. SICdb is designed for continuous enhancement, developed, and growth, including consistently evolving patient cases. SICdb’s exceptional high granularity, coupled with its up-to-date case data, holds substantial value for future research, particularly in terms of potential AI applications.
Data availability
The raw dataset in .csv files is available on physionet.org for credentialed users. At the moment you need to request access using the provided form on PhysioNet.
References
Saeed M, Villarroel M, Reisner AT et al (2011) Multiparameter Intelligent Monitoring in Intensive Care II (MIMIC-II): a public-access intensive care unit database. Crit Care Med 39(5):952. https://doi.org/10.1097/CCM.0B013E31820A92C6
Pollard TJ, Johnson AEW, Raffa JD, Celi LA, Mark RG, Badawi O (2018) The eICU Collaborative Research Database, a freely available multi-center database for critical care research. Sci Data 5:180178. https://doi.org/10.1038/sdata.2018.178
Thoral PJ, Peppink JM, Driessen RH et al (2021) Sharing ICU patient data responsibly under the Society of Critical Care Medicine/European Society of Intensive Care Medicine Joint Data Science Collaboration: the Amsterdam University Medical Centers Database (AmsterdamUMCdb) Example. Crit Care Med 49(6):E563–E577. https://doi.org/10.1097/CCM.0000000000004916
Rodemund N, Kokoefer A, Wernly B, Cozowicz C (2023) “Salzburg Intensive Care database (SICdb), a freely accessible intensive care database” (version 1.0.5). PhysioNet. https://doi.org/10.13026/67ww-6m34
Goldberger AL, Amaral LA, Glass L et al (2000) PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. https://doi.org/10.1161/01.CIR.101.23.E215
Acknowledgements
CJ reports institutional support by the German Research Council (SFB 1116, B06) as well as the State of North Rhine Westphalia (Giga for Health: 5GMedizincampus. NRW, Project number 005-2008-0055 and Multiomics in data science, PROFILNRW-2020-107-A, TP4).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflicts of interest to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/.
About this article
Cite this article
Rodemund, N., Wernly, B., Jung, C. et al. The Salzburg Intensive Care database (SICdb): an openly available critical care dataset. Intensive Care Med 49, 700–702 (2023). https://doi.org/10.1007/s00134-023-07046-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00134-023-07046-3