Introduction

Pulmonary embolism (PE) represents a life-threatening condition caused by the sudden blockage of pulmonary arteries due to blood clots [1]. Given its diverse clinical presentations, accurate diagnosis and prognosis of PE remain sometimes challenging in clinical practices. With a high global prevalence, the complex nature of PE necessitates a nuanced approach to diagnosis and management. Traditional diagnostic methods, such as angiography and D-dimer assays, have demonstrated limitations in terms of sensitivity and specificity [2]. In this context, the integration of imaging biomarkers appears as a promising perspective to not only enhance diagnostic accuracy but also provide insights into individualized treatment approaches.

In recent years, imaging has emerged as a crucial approach for investigating PE with computed tomography pulmonary angiography (CTPA) serving as the established gold standard for diagnosis [3]. Advanced technologies offer the potential to extract specific biomarker-based information [4,5,6]. This development has heightened interest in the identification and validation of imaging biomarkers that not only contribute to the discrimination of PE but also aid in predicting treatment outcomes and prognostic parameters [7,8,9].

This study focuses on investigating the recent developments and advancements in dual-energy computed tomography (DECT) and radiomics for their utility in differentiating pulmonary embolism and predicting disease progression. Through the exploration of these modern technologies, our objective is to enhance the diagnosis and prognosis of PE, ultimately contributing to more effective patient management and care.

Methods

The study was approved by the local ethical committee and was carried out accordingly, which exempted the need for obtaining written informed consent. All analyses were conducted in compliance with local data protection regulations.

Study population

In this study, we compiled clinical data and CT datasets from a total of 131 patients who had undergone CTPA for suspicion of acute PE. The CT scans were performed using a third-generation dual-source dual-energy CT machine at the University Hospital Frankfurt (Frankfurt am Main, Germany). Data from this cohort have previously been reported [8]. The data collection period encompassed January 2015 to March 2022.

All patients were divided into two distinct groups for analysis. One group consisted of patients with central PE and peripheral PE, while the other group who had no PE served as the control group. Inclusion criteria were patients over 18 years with a confirmed PE and the presence of a dedicated CTPA examination depicting the complete lung vasculature. Exclusion criteria in both cohorts encompassed imaging artifacts in the pulmonary artery region (n = 11), inadequate visual delineation of the embolus (n = 7), incomplete examination protocols (n = 4) and insufficient data on disease progression (n = 28). Eligible participants were identified from the picture archiving and communication system (Centricity™ RIS-i 7.0‘ [Version 7.0.3.5, 11/2021] und ‚Centricity™ Universal Viewer‘ [Version 7.0, 08/2021], General Electric Healthcare, Chicago, Illinois, USA) by conducting searches using specific terms such as ‘pulmonary embolism’, ‘pulmonal artery embolism’, ‘central pulmonary embolism’ and ‘peripheral pulmonary embolism’. The process of selecting participants for the study is illustrated in Fig. 1.

Fig. 1
figure 1

Flowchart of patient inclusion. Abbreviations PE, pulmonary embolism

Clinical and laboratory data

We obtained additional patient data and parameters by extracting information from medical reports and the hospital information system. Laboratory chemistry data were retrieved from recorded laboratory findings. The Wells and Pulmonary embolism severity index (PESI) scores were also calculated, further classifying patients into risk classes ranging from 1 to 5 based on the PESI score [2].

Additionally, a retrospective stratification of the patients into the four risk categories according the European Society of Cardiology (ESC) score was performed (low-risk PE, intermediate-low-risk PE, intermediate-high-risk PE, and high-risk PE) [2].

Overall survival was characterized as the duration from the DECT scan to the occurrence of death or the most recent documented follow-up.

Dual-energy CT protocol

The CT scans were conducted using a third generation DECT scanner equipped with a Somatom Force unit manufactured by Siemens Healthineers (Forchheim, Bavaria, Germany).

The examination parameters were as follows: tube A, 90 kVp and 190 mAs; tube B, Sn150 kVp and 95mAs. In tube B, an additional tin filter (Selective Photon Shield II, Siemens Healthineers) was used to reduce radiation exposure. Acquisition parameters were 0.25 s (s) rotation time, 1–2 s acquisition time, 192 × 0.6-millimeter (mm) collimation, and 2.5 pitch value. Scanning was performed in the craniocaudal direction in bolus-triggered arterial and venous phases with 80–120 milliliters (ml) of nonionized contrast agent (Imeron 400, Bracco, Milan, Italy) at an injection rate of 5–6 ml/s, a threshold of 120 HU and a delay of 7s.

Image segmentation and analysis

After anonymization, the CT datasets from all patients were extracted as Digital Imaging and Communications in Medicine (DICOM) datasets and uploaded into 3D Slicer (www.slicer.org, Version 5.0.2, Harvard University, Cambridge, USA).

The segmentation of the embolus and pulmonary trunk was performed in each patient using the interactive segmentation algorithm GrowCut [10,11,12]. An example of the segmentation is shown in Fig. 2. The segmentation in each case was evaluated by two experienced radiologist (V.K. and S.S.M.,  board-certified radiologists with four and seven years of experience in experimental imaging, respectively).

Fig. 2
figure 2

Exemplary segmentation of the pulmonary arteries and the embolus. A, Unsegmented axial slice; B, Segmented axial slice; C, Unsegmented coronal slice; D, Segmented coronal slice; E, Unsegmented sagittal slice; F, Segmented sagittal slice; G, Final segmentation of the embolus. Abbreviations PE, pulmonary embolism

In case of any disagreement with the initial segmentation, the process was repeated, and necessary adjustments were made. All radiologists participating in the assessment were kept entirely unaware of the clinical data pertaining to the patients.

Extraction of radiomics features

Radiomic features were extracted through the utilization of the PyRadiomics extension package incorporated into the 3D Slicer software (Version 5.1.0–2022-05–20). This extraction process generated a total of 107 features for each segmentation (Table S1) [13]. All extracted features were subsequently classified into seven distinct groups, as follows: Gray-Level Dependence Matrix (GLDM), Gray-Level Co-Occurrence Matrix (GLCM), Gray-Level Run Length Matrix (GLRLM), Gray-Level Size Zone Matrix (GLSZM), Neighboring Gray Tone Difference Matrix (NGTDM), Shape, and First Order [13].

To provide a comprehensive assessment of the study’s transparency and quality, we included a CheckList for EvaluAtion of Radiomics Research (CLEAR) [14] in Figure S1.

Radiomics feature selection

In order to identify the most pertinent features for subsequent analysis, multi-stage feature selection process was implemented. Initially, all numerical features were normalized using Z-score standardization. In the next step, the Boruta dimension reduction and feature elimination algorithm, was applied [15]. Furthermore, a correlation analysis was conducted to identify clusters of highly correlated features, defined by a Pearson’s correlation coefficient (r ≥ 0.60). From each cluster, one feature with the highest Gini index was selected for further analysis.

Construction of the radiomics model

A gradient-boosted tree model was trained on the selected radiomic features to discriminate between pulmonary embolism and healthy patients using a training dataset comprising 79 patients. The model was subsequently assessed using a distinct test dataset comprising 52 patients. This test dataset had not been incorporated into the algorithm’s training phase previously.

Statistical analysis

Statistical analysis was conducted with the use of R statistical software (R Foundation for Statistical Computing, Vienna, Austria; Version 2023.06.0 + 421) and MedCalc (MedCalc Software Ltd., Ostend, Belgium; Version 20.123). The normality of the data distribution was assessed through visual methods such as histograms and the Wilk-Shapiro test. Normally distributed values were presented as mean ± standard deviation (SD), while non-normally distributed values were expressed as median and interquartile range (IQR). The t-test was employed for data with a continuous distribution, whereas the Man-Whitney test or Spearman rank correlation coefficient was applied to non-normally distributed data. A significance level of less than 0.05 was considered statistically significant.

The diagnostic accuracy of the optimal predictive parameters was evaluated using the area under the curve (AUC) derived from receiver operating characteristic (ROC) analyses. Subsequently, diagnostic sensitivity and specificity were computed.

Cox proportional hazards models were used to identify independent factors among clinical markers, imaging markers, and radiomics features. Multivariate Cox proportional hazards models were adjusted for significant univariate prognostic parameters and clinically relevant confounders, with hazard ratios and their corresponding 95% confidence intervals (CI) reported.

Results

Patient characteristics

A total of 181 patients were initially considered for study inclusion. After applying exclusion criteria, the ultimate study population encompassed a cohort of 131 patients aged 64 ± 15 years. Among these, 88 individuals (67%) received a final diagnosis of central PE, while 16 patients (12%) were diagnosed with peripheral PE. The comparative group comprised 27 patients who did not have PE at discharge. Table 1 provides an overview of the socio-demographic and clinical characteristics.

Table 1 Baseline characteristics of the study population

Wells Scores were found to be higher in the central PE patient group (p = 0.002), whereas no significant differences were observed for PESI (p = 0.7163). Significantly varying factors between central and peripheral PE patients included heart rate (p < 0.001), D-dimers (p < 0.0001), and CT-derived measurements, such as RV/LV-ratio (p < 0.0004) and truncus pulmonalis diameter (p < 0.0006).

Selection of radiomics features

Following the implementation of the Boruta algorithm, 46 features were retained for subsequent analysis. In the subsequent step, a correlation matrix was generated. Figure 3 provides a visual representation of the correlation matrix, highlighting the crucial features for distinguishing between pulmonary embolism and normal pulmonary artery configurations. After feature reduction, 26 features remained for further analysis. The most significant features identified through the multi-step feature selection process are presented in Table 2.

Fig. 3
figure 3

Correlation matrix. Correlation matrix of the most important features. Abbreviations PE, pulmonary embolism; GLCM, gray-level co-occurrence matrix; GLRLM, gray-level run length matrix; GLSZM, gray-level size zone matrix; NGTDM, neighboring gray tone difference matrix; GLDM, gray-level dependence matrix; MCC, maximal correlation coefficient

Table 2 The most important radiomics features

Performance of the radiomics model in detecting pulmonary embolism

In the next step, a gradient-boosted tree model was trained on these selected radiomic features to differentiate between pulmonary embolism and normal pulmonary artery configurations. To assess the model’s classification performance and its ability to generalize, we applied the trained model to an independent test dataset, encompassing 52 patients who were not part of the initial analysis. The classification accuracy in the validation dataset was 0.94 with an AUC of 0.91. Table 3 provides a comprehensive overview of the classification accuracy, sensitivity, specificity, and AUC.

Table 3 Performance of the radiomics model for the differentiation between patients with and without acute PE

Radiomics features performance in predicting the prognosis of pulmonary embolism

The Cox regression analysis unveiled that the selected radiomic features provided independent prognostic insights into mortality with a c-index of 0.991 (95% CI, 0.979-1.0, p = 0.0001). Following the adjustment of the unadjusted radiomics model by incorporating imaging parameters and clinical parameters showed that the prognostic accuracy decreased. Table 4 provides a summary of the performance of various Cox regression models in predicting the outcomes of patients with PE incorporating radiomic features, imaging parameters and clinical parameters.

Table 4 Performance of different Cox-regression models to predict outcome using radiomics features and clinical parameters.

Discussion

In our study, we explored the potential of DECT-derived radiomics to enhance the diagnosis and prognostication of PE. A precise and reliable diagnosis of PE plays a pivotal role in the management of patients experiencing an acute event. Identifying individuals at high risk for adverse outcomes is crucial for prompt interventions and the application of suitable treatment strategies, effectively minimizing the risk of complications and potential harm [2, 9, 16].

There is a growing interest in integrating machine learning (ML) into various medical contexts, with the potential to alleviate physician workload and expedite the diagnostic process [9, 17]. ML possesses an advantage in managing extensive, intricate and diverse data types, enabling it to automatically handle large datasets. Moreover, it has the capability to provide data-driven insights and contribute to decision-making in a broad range of medical conditions, delivering advantages that go beyond saving time and resources. Over the past decade, a growing body of evidence has indicated the value of incorporating quantitative imaging biomarkers into established clinical decision-making models, enabling automated extraction of valuable imaging features from to diagnose a wide range of medical conditions or to gain tissue information [5, 18].

While past research has explored the capacity of radiomic features to differentiate between malignant and benign masses and predict outcomes across diverse cancer types [4, 19,20,21], there is limited information on techniques for forecasting outcomes in patients with PE. A previous study investigated the performance of distinct ML models based on clinical parameters to assess the diagnostic performance for the detection of PE [9]. The ML models based on clinical markers achieved superior diagnostic performance in diagnosing PE than classical clinical assessment models like the Wells score combined with D-dimer, the revised Geneva score with D-dimer, or the Years algorithm [9].

Another recent study has showcased the capability of models utilizing radiomic features extracted from DECT to classify patients with pulmonary embolism according to established clinical risk scores [8]. Our machine learning classifier attained a classification accuracy of 94% in identifying patients with acute PE, with an area under the receiver operating characteristic curve of 0.91.Furthermore, our model indicates that radiomic features could play a significant role in predicting outcomes for patients with PE. The inclusion of radiomic features demonstrated strong prognostic capabilities in predicting survival (c-index, 0.991 [0.979–1.00], p = 0.0001). Notably, the addition of clinical or DECT parameters did not enhance the predictive power. Even the PESI score, a well-established clinical scoring system for prognostic assessment, failed to augment the predictive strength in our study cohort [2, 22]. One reason for this could be that radiomics can more effectively characterize the various thrombus components that are not visible to the naked eye. Therefore, further investigations are necessary to evaluate the robustness and applicability of this approach, with a focus on its integration into clinical practice.

However, our study acknowledges several limitations that deserve consideration. First, this study was conducted at a single center. This decision was deliberate to minimize variabilities associated with scanners from different manufacturers or scanner generations. Secondly, due to the retrospective study design, inherent limitations may affect the reliability and generalizability of the findings. Lastly, our study yielded an acceptable radiomics quality score [14]. This score provides an overview of the transparency of the study and facilitates the repeatability and reproducibility of radiomics research. While comparable to previous radiomics studies, a higher radiomics quality score could have further enhanced the power of our findings.

Our study offers valuable insights into this innovative approach using radiomics for acute PE diagnosing and prognostication. Nevertheless, these limitations underscore the need for future investigations involving larger and more diverse patient cohorts. Further validation and seamless integration of this approach with established clinical tools have the potential to enhance patient outcomes and refine the delivery of patient care, ultimately advancing the field of risk assessment and patient management for individuals with acute PE. This, in turn, can lead to improved prognostication and better-tailored interventions, optimizing the overall management of patients with acute PE.

In conclusion, our study underscores the promising potential of employing radiomics on DECT imaging for the accurate detection and outcome prediction of patients with acute PE. This innovative approach has the capacity to significantly impact patient care by providing more accurate risk assessment and tailored management strategies.