Abstract
Purpose
Recent advancements in medical imaging have transformed diagnostic assessments, offering exciting possibilities for extracting biomarker-based information. This study aims to investigate the capabilities of a machine learning classifier that incorporates dual-energy computed tomography (DECT) radiomics. The primary focus is on discerning and predicting outcomes related to pulmonary embolism (PE).
Methods
The study included 131 participants who underwent pulmonary artery DECT angiography between January 2015 and March 2022. Among them, 104 patients received the final diagnosis of PE and 27 patients served as a control group. A total of 107 radiomic features were extracted for every case based on DECT imaging. The dataset was divided into training and test sets for model development and validation. Stepwise feature reduction identified the most relevant features, which were used to train a gradient-boosted tree model. Receiver operating characteristics analysis and Cox regression tests assessed the association of texture features with overall survival.
Results
The trained machine learning classifier achieved a classification accuracy of 0.94 for identifying patients with acute PE with an area under the receiver operating characteristic curve of 0.91. Radiomics features could be valuable for predicting outcomes in patients with PE, demonstrating strong prognostic capabilities in survival prediction (c-index, 0.991 [0.979–1.00], p = 0.0001) with a median follow-up of 130 days (IQR, 38–720). Notably, the inclusion of clinical or DECT parameters did not enhance predictive performance.
Conclusion
In conclusion, our study underscores the promising potential of leveraging radiomics on DECT imaging for the identification of patients with acute PE and predicting their outcomes. This approach has the potential to improve clinical decision-making and patient management, offering efficiencies in time and resources by utilizing existing DECT imaging without the need for an additional scoring system.
Similar content being viewed by others
Introduction
Pulmonary embolism (PE) represents a life-threatening condition caused by the sudden blockage of pulmonary arteries due to blood clots [1]. Given its diverse clinical presentations, accurate diagnosis and prognosis of PE remain sometimes challenging in clinical practices. With a high global prevalence, the complex nature of PE necessitates a nuanced approach to diagnosis and management. Traditional diagnostic methods, such as angiography and D-dimer assays, have demonstrated limitations in terms of sensitivity and specificity [2]. In this context, the integration of imaging biomarkers appears as a promising perspective to not only enhance diagnostic accuracy but also provide insights into individualized treatment approaches.
In recent years, imaging has emerged as a crucial approach for investigating PE with computed tomography pulmonary angiography (CTPA) serving as the established gold standard for diagnosis [3]. Advanced technologies offer the potential to extract specific biomarker-based information [4,5,6]. This development has heightened interest in the identification and validation of imaging biomarkers that not only contribute to the discrimination of PE but also aid in predicting treatment outcomes and prognostic parameters [7,8,9].
This study focuses on investigating the recent developments and advancements in dual-energy computed tomography (DECT) and radiomics for their utility in differentiating pulmonary embolism and predicting disease progression. Through the exploration of these modern technologies, our objective is to enhance the diagnosis and prognosis of PE, ultimately contributing to more effective patient management and care.
Methods
The study was approved by the local ethical committee and was carried out accordingly, which exempted the need for obtaining written informed consent. All analyses were conducted in compliance with local data protection regulations.
Study population
In this study, we compiled clinical data and CT datasets from a total of 131 patients who had undergone CTPA for suspicion of acute PE. The CT scans were performed using a third-generation dual-source dual-energy CT machine at the University Hospital Frankfurt (Frankfurt am Main, Germany). Data from this cohort have previously been reported [8]. The data collection period encompassed January 2015 to March 2022.
All patients were divided into two distinct groups for analysis. One group consisted of patients with central PE and peripheral PE, while the other group who had no PE served as the control group. Inclusion criteria were patients over 18 years with a confirmed PE and the presence of a dedicated CTPA examination depicting the complete lung vasculature. Exclusion criteria in both cohorts encompassed imaging artifacts in the pulmonary artery region (n = 11), inadequate visual delineation of the embolus (n = 7), incomplete examination protocols (n = 4) and insufficient data on disease progression (n = 28). Eligible participants were identified from the picture archiving and communication system (Centricity™ RIS-i 7.0‘ [Version 7.0.3.5, 11/2021] und ‚Centricity™ Universal Viewer‘ [Version 7.0, 08/2021], General Electric Healthcare, Chicago, Illinois, USA) by conducting searches using specific terms such as ‘pulmonary embolism’, ‘pulmonal artery embolism’, ‘central pulmonary embolism’ and ‘peripheral pulmonary embolism’. The process of selecting participants for the study is illustrated in Fig. 1.
Clinical and laboratory data
We obtained additional patient data and parameters by extracting information from medical reports and the hospital information system. Laboratory chemistry data were retrieved from recorded laboratory findings. The Wells and Pulmonary embolism severity index (PESI) scores were also calculated, further classifying patients into risk classes ranging from 1 to 5 based on the PESI score [2].
Additionally, a retrospective stratification of the patients into the four risk categories according the European Society of Cardiology (ESC) score was performed (low-risk PE, intermediate-low-risk PE, intermediate-high-risk PE, and high-risk PE) [2].
Overall survival was characterized as the duration from the DECT scan to the occurrence of death or the most recent documented follow-up.
Dual-energy CT protocol
The CT scans were conducted using a third generation DECT scanner equipped with a Somatom Force unit manufactured by Siemens Healthineers (Forchheim, Bavaria, Germany).
The examination parameters were as follows: tube A, 90 kVp and 190 mAs; tube B, Sn150 kVp and 95mAs. In tube B, an additional tin filter (Selective Photon Shield II, Siemens Healthineers) was used to reduce radiation exposure. Acquisition parameters were 0.25 s (s) rotation time, 1–2 s acquisition time, 192 × 0.6-millimeter (mm) collimation, and 2.5 pitch value. Scanning was performed in the craniocaudal direction in bolus-triggered arterial and venous phases with 80–120 milliliters (ml) of nonionized contrast agent (Imeron 400, Bracco, Milan, Italy) at an injection rate of 5–6 ml/s, a threshold of 120 HU and a delay of 7s.
Image segmentation and analysis
After anonymization, the CT datasets from all patients were extracted as Digital Imaging and Communications in Medicine (DICOM) datasets and uploaded into 3D Slicer (www.slicer.org, Version 5.0.2, Harvard University, Cambridge, USA).
The segmentation of the embolus and pulmonary trunk was performed in each patient using the interactive segmentation algorithm GrowCut [10,11,12]. An example of the segmentation is shown in Fig. 2. The segmentation in each case was evaluated by two experienced radiologist (V.K. and S.S.M., board-certified radiologists with four and seven years of experience in experimental imaging, respectively).
In case of any disagreement with the initial segmentation, the process was repeated, and necessary adjustments were made. All radiologists participating in the assessment were kept entirely unaware of the clinical data pertaining to the patients.
Extraction of radiomics features
Radiomic features were extracted through the utilization of the PyRadiomics extension package incorporated into the 3D Slicer software (Version 5.1.0–2022-05–20). This extraction process generated a total of 107 features for each segmentation (Table S1) [13]. All extracted features were subsequently classified into seven distinct groups, as follows: Gray-Level Dependence Matrix (GLDM), Gray-Level Co-Occurrence Matrix (GLCM), Gray-Level Run Length Matrix (GLRLM), Gray-Level Size Zone Matrix (GLSZM), Neighboring Gray Tone Difference Matrix (NGTDM), Shape, and First Order [13].
To provide a comprehensive assessment of the study’s transparency and quality, we included a CheckList for EvaluAtion of Radiomics Research (CLEAR) [14] in Figure S1.
Radiomics feature selection
In order to identify the most pertinent features for subsequent analysis, multi-stage feature selection process was implemented. Initially, all numerical features were normalized using Z-score standardization. In the next step, the Boruta dimension reduction and feature elimination algorithm, was applied [15]. Furthermore, a correlation analysis was conducted to identify clusters of highly correlated features, defined by a Pearson’s correlation coefficient (r ≥ 0.60). From each cluster, one feature with the highest Gini index was selected for further analysis.
Construction of the radiomics model
A gradient-boosted tree model was trained on the selected radiomic features to discriminate between pulmonary embolism and healthy patients using a training dataset comprising 79 patients. The model was subsequently assessed using a distinct test dataset comprising 52 patients. This test dataset had not been incorporated into the algorithm’s training phase previously.
Statistical analysis
Statistical analysis was conducted with the use of R statistical software (R Foundation for Statistical Computing, Vienna, Austria; Version 2023.06.0 + 421) and MedCalc (MedCalc Software Ltd., Ostend, Belgium; Version 20.123). The normality of the data distribution was assessed through visual methods such as histograms and the Wilk-Shapiro test. Normally distributed values were presented as mean ± standard deviation (SD), while non-normally distributed values were expressed as median and interquartile range (IQR). The t-test was employed for data with a continuous distribution, whereas the Man-Whitney test or Spearman rank correlation coefficient was applied to non-normally distributed data. A significance level of less than 0.05 was considered statistically significant.
The diagnostic accuracy of the optimal predictive parameters was evaluated using the area under the curve (AUC) derived from receiver operating characteristic (ROC) analyses. Subsequently, diagnostic sensitivity and specificity were computed.
Cox proportional hazards models were used to identify independent factors among clinical markers, imaging markers, and radiomics features. Multivariate Cox proportional hazards models were adjusted for significant univariate prognostic parameters and clinically relevant confounders, with hazard ratios and their corresponding 95% confidence intervals (CI) reported.
Results
Patient characteristics
A total of 181 patients were initially considered for study inclusion. After applying exclusion criteria, the ultimate study population encompassed a cohort of 131 patients aged 64 ± 15 years. Among these, 88 individuals (67%) received a final diagnosis of central PE, while 16 patients (12%) were diagnosed with peripheral PE. The comparative group comprised 27 patients who did not have PE at discharge. Table 1 provides an overview of the socio-demographic and clinical characteristics.
Wells Scores were found to be higher in the central PE patient group (p = 0.002), whereas no significant differences were observed for PESI (p = 0.7163). Significantly varying factors between central and peripheral PE patients included heart rate (p < 0.001), D-dimers (p < 0.0001), and CT-derived measurements, such as RV/LV-ratio (p < 0.0004) and truncus pulmonalis diameter (p < 0.0006).
Selection of radiomics features
Following the implementation of the Boruta algorithm, 46 features were retained for subsequent analysis. In the subsequent step, a correlation matrix was generated. Figure 3 provides a visual representation of the correlation matrix, highlighting the crucial features for distinguishing between pulmonary embolism and normal pulmonary artery configurations. After feature reduction, 26 features remained for further analysis. The most significant features identified through the multi-step feature selection process are presented in Table 2.
Performance of the radiomics model in detecting pulmonary embolism
In the next step, a gradient-boosted tree model was trained on these selected radiomic features to differentiate between pulmonary embolism and normal pulmonary artery configurations. To assess the model’s classification performance and its ability to generalize, we applied the trained model to an independent test dataset, encompassing 52 patients who were not part of the initial analysis. The classification accuracy in the validation dataset was 0.94 with an AUC of 0.91. Table 3 provides a comprehensive overview of the classification accuracy, sensitivity, specificity, and AUC.
Radiomics features performance in predicting the prognosis of pulmonary embolism
The Cox regression analysis unveiled that the selected radiomic features provided independent prognostic insights into mortality with a c-index of 0.991 (95% CI, 0.979-1.0, p = 0.0001). Following the adjustment of the unadjusted radiomics model by incorporating imaging parameters and clinical parameters showed that the prognostic accuracy decreased. Table 4 provides a summary of the performance of various Cox regression models in predicting the outcomes of patients with PE incorporating radiomic features, imaging parameters and clinical parameters.
Discussion
In our study, we explored the potential of DECT-derived radiomics to enhance the diagnosis and prognostication of PE. A precise and reliable diagnosis of PE plays a pivotal role in the management of patients experiencing an acute event. Identifying individuals at high risk for adverse outcomes is crucial for prompt interventions and the application of suitable treatment strategies, effectively minimizing the risk of complications and potential harm [2, 9, 16].
There is a growing interest in integrating machine learning (ML) into various medical contexts, with the potential to alleviate physician workload and expedite the diagnostic process [9, 17]. ML possesses an advantage in managing extensive, intricate and diverse data types, enabling it to automatically handle large datasets. Moreover, it has the capability to provide data-driven insights and contribute to decision-making in a broad range of medical conditions, delivering advantages that go beyond saving time and resources. Over the past decade, a growing body of evidence has indicated the value of incorporating quantitative imaging biomarkers into established clinical decision-making models, enabling automated extraction of valuable imaging features from to diagnose a wide range of medical conditions or to gain tissue information [5, 18].
While past research has explored the capacity of radiomic features to differentiate between malignant and benign masses and predict outcomes across diverse cancer types [4, 19,20,21], there is limited information on techniques for forecasting outcomes in patients with PE. A previous study investigated the performance of distinct ML models based on clinical parameters to assess the diagnostic performance for the detection of PE [9]. The ML models based on clinical markers achieved superior diagnostic performance in diagnosing PE than classical clinical assessment models like the Wells score combined with D-dimer, the revised Geneva score with D-dimer, or the Years algorithm [9].
Another recent study has showcased the capability of models utilizing radiomic features extracted from DECT to classify patients with pulmonary embolism according to established clinical risk scores [8]. Our machine learning classifier attained a classification accuracy of 94% in identifying patients with acute PE, with an area under the receiver operating characteristic curve of 0.91.Furthermore, our model indicates that radiomic features could play a significant role in predicting outcomes for patients with PE. The inclusion of radiomic features demonstrated strong prognostic capabilities in predicting survival (c-index, 0.991 [0.979–1.00], p = 0.0001). Notably, the addition of clinical or DECT parameters did not enhance the predictive power. Even the PESI score, a well-established clinical scoring system for prognostic assessment, failed to augment the predictive strength in our study cohort [2, 22]. One reason for this could be that radiomics can more effectively characterize the various thrombus components that are not visible to the naked eye. Therefore, further investigations are necessary to evaluate the robustness and applicability of this approach, with a focus on its integration into clinical practice.
However, our study acknowledges several limitations that deserve consideration. First, this study was conducted at a single center. This decision was deliberate to minimize variabilities associated with scanners from different manufacturers or scanner generations. Secondly, due to the retrospective study design, inherent limitations may affect the reliability and generalizability of the findings. Lastly, our study yielded an acceptable radiomics quality score [14]. This score provides an overview of the transparency of the study and facilitates the repeatability and reproducibility of radiomics research. While comparable to previous radiomics studies, a higher radiomics quality score could have further enhanced the power of our findings.
Our study offers valuable insights into this innovative approach using radiomics for acute PE diagnosing and prognostication. Nevertheless, these limitations underscore the need for future investigations involving larger and more diverse patient cohorts. Further validation and seamless integration of this approach with established clinical tools have the potential to enhance patient outcomes and refine the delivery of patient care, ultimately advancing the field of risk assessment and patient management for individuals with acute PE. This, in turn, can lead to improved prognostication and better-tailored interventions, optimizing the overall management of patients with acute PE.
In conclusion, our study underscores the promising potential of employing radiomics on DECT imaging for the accurate detection and outcome prediction of patients with acute PE. This innovative approach has the capacity to significantly impact patient care by providing more accurate risk assessment and tailored management strategies.
Data availability
All data and information used are included in this manuscript.
Abbreviations
- PE:
-
Pulmonary embolism
- CT:
-
Computed tomography
- DECT:
-
Dual energy computed tomography
- AUC:
-
Area under the curve
- ROC:
-
Receiver operating characteristic
- GLDM:
-
Gray-Level Dependence Matrix
- GLCM:
-
Gray-Level Co-Occurrence Matrix
- GLRLM:
-
Gray-Level Run Length Matrix
- GLSZM:
-
Gray-Level Size Zone Matrix
- NGTDM:
-
Neighboring Gray Tone Difference Matrix
- ML:
-
Machine learning
- PESI:
-
Pulmonary Embolism Severity Index
References
Goldhaber SZ, Bounameaux H (Mai 2012) „Pulmonary embolism and deep vein thrombosis. Lancet 379:1835–1846. https://doi.org/10.1016/S0140-6736(11)61904-1
S. V. Konstantinides u. a, „2019 ESC Guidelines for the diagnosis and management of acute pulmonary embolism developed in collaboration with the European Respiratory Society (ERS), European Heart Journal, Bd. 41, Nr. 4, S. 543–603, (2020) https://doi.org/10.1093/eurheartj/ehz405
C. B. Monti u. a, „Dual-energy CT performance in acute pulmonary embolism: a meta-analysis. Eur Radiol, Bd. 31, Nr. 8, S. 6248–6258, (2021) https://doi.org/10.1007/s00330-020-07633-8
S. Bernatz u. a, „CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients, Sci Rep, Bd. 13, Nr. 1, S. 533, (2023) https://doi.org/10.1038/s41598-023-27714-0
M. P. Than u. a, „Machine Learning to Predict the Likelihood of Acute Myocardial Infarction, Circulation, Bd. 140, Nr. 11, S. 899–909, (2019) https://doi.org/10.1161/CIRCULATIONAHA.119.041980
Islam NU, Zhou Z, Gehlot S, Gotway MB (2023) und J. Liang, „Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism, Med Image Anal, Bd. 91, S. 102988, Okt. https://doi.org/10.1016/j.media.2023.102988
Xia W, Yu H, Chen W, Chen B, Huang Y (Aug. 2022) „A Radiological Nomogram to predict 30-day mortality in patients with Acute Pulmonary Embolism. Acad Radiol 29:1169–1177. https://doi.org/10.1016/j.acra.2021.11.021
J. Gotta u. a, „Imaging-based risk stratification of patients with pulmonary embolism based on dual-energy CT-derived radiomics. Eur J Clin Invest, S. e14139, Dez. (2023) https://doi.org/10.1111/eci.14139
u. a. Xi L et al (Okt. 2023) A machine learning model for diagnosing acute pulmonary embolism and comparison with Wells score, revised Geneva score, and years algorithm. Chin Med J. https://doi.org/10.1097/CM9.0000000000002837
E. R. Velazquez u. a, „Volumetric CT-based segmentation of NSCLC using 3D-Slicer, Sci Rep, Bd. 3, Nr. 1, S. 3529, Dez. (2013) https://doi.org/10.1038/srep03529
J. Egger u. a, „GBM Volumetry using the 3D Slicer Medical Image Computing Platform. Sci Rep, Bd. 3, Nr. 1, S. 1364, (2013) https://doi.org/10.1038/srep01364
Gonzalo Domínguez M, Hernández C, Ruisoto P, Juanes JA, Prats A (Juni 2016) Hernández, „Morphological and volumetric Assessment of cerebral ventricular system with 3D Slicer Software. J Med Syst 40:154 Nr. 6. https://doi.org/10.1007/s10916-016-0510-9
Fedorov A et al (2012) „3D Slicer as an image computing platform for the Quantitative Imaging Network, Magnetic Resonance Imaging, Bd. 30, Nr. 9, S. 1323–1341, Nov. https://doi.org/10.1016/j.mri.2012.05.001
Kocak B (Mai 2023) „CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Imaging 14(1):75. https://doi.org/10.1186/s13244-023-01415-8
Kursa MB, Rudnicki WR (2010) „Feature selection with the Boruta Package. J Stat Soft 36(11). https://doi.org/10.18637/jss.v036.i11
G. Mansella u. a, „Delayed Diagnosis in Pulmonary Embolism: Frequency, Patient Characteristics, and Outcome, Respiration, Bd. 99, Nr. 7, S. 589–597, (2020) https://doi.org/10.1159/000508396
Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H (2018) „eDoctor: machine learning and the future of medicine. J Intern Med 284(6):603–619. https://doi.org/10.1111/joim.12822
van Timmeren JE, Cester D, Tanadini-Lang S, Alkadhi H, Baessler B (2020) „Radiomics in medical imaging—how-to guide and critical reflection, Insights into Imaging, Bd. 11, Nr. 1, S. 91, Aug. https://doi.org/10.1186/s13244-020-00887-2
Bera K, Braman N, Gupta A, Velcheti V, Madabhushi A (Feb. 2022) „Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol 19:132–146. https://doi.org/10.1038/s41571-021-00560-7
Hebei Medical University Fourth Affiliated Hospital and Hebei Provincial Tumor Hospital, Shijiazhuang, China u. a., „Application of radiomics in predicting the preoperative risk stratification of gastric stromal tumors, Diagn Interv Radiol, Bd. 28, Nr. 6, S. 532–539, Dez et al (2022) https://doi.org/10.5152/dir.2022.21033
H. Mao u. a, „MRI-Based Radiomics Models for Predicting Risk Classification of Gastrointestinal Stromal Tumors. Front Oncol, Bd. 11, S. 631927, (2021) https://doi.org/10.3389/fonc.2021.631927
Surov A, Thormann M, Bär C, Wienke A, Borggrefe J (2023) „Validation of clinical-radiological scores for prognosis of mortality in acute pulmonary embolism, Respir Res, Bd. 24, Nr. 1, S. 195, Aug. https://doi.org/10.1186/s12931-023-02489-0
Acknowledgements
None.
Funding
Open Access funding enabled and organized by Projekt DEAL. This research project did not receive any funding.
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
All authors have contributed significantly to this work, participating in the conception, design, analysis, interpretation of data, and writing/editing of this manuscript. J.G. and V.K. wrote the manuscript and designed the research. The other authors were involved in the analysis of the data and performed research. All authors revised the manuscript.
Corresponding author
Ethics declarations
Ethical approval
Institutional Review Board approval was obtained.
Conflict of interest
C.B. received speaking fees from Siemens Healthineers. The other authors have no potential conflict of interest to disclose.
Informed consent
Not applicable.
Statistics and biometry
No complex statistical methods were necessary for this paper.
Patient consent statement
Not applicable.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gotta, J., Gruenewald, L.D., Martin, S.S. et al. From pixels to prognosis: Imaging biomarkers for discrimination and outcome prediction of pulmonary embolism. Emerg Radiol (2024). https://doi.org/10.1007/s10140-024-02216-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10140-024-02216-2