Left Ventricular Myocardial Dysfunction Evaluation in Thalassemia Patients Using Echocardiographic Radiomic Features and Machine Learning Algorithms

Taleie, Haniyeh; Hajianfar, Ghasem; Sabouri, Maziar; Parsaee, Mozhgan; Houshmand, Golnaz; Bitarafan-Rajabi, Ahmad; Zaidi, Habib; Shiri, Isaac

doi:10.1007/s10278-023-00891-0

Left Ventricular Myocardial Dysfunction Evaluation in Thalassemia Patients Using Echocardiographic Radiomic Features and Machine Learning Algorithms

Open access
Published: 21 September 2023

Volume 36, pages 2494–2506, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

Left Ventricular Myocardial Dysfunction Evaluation in Thalassemia Patients Using Echocardiographic Radiomic Features and Machine Learning Algorithms

Download PDF

Haniyeh Taleie¹,
Ghasem Hajianfar²,
Maziar Sabouri^1,3,
Mozhgan Parsaee⁴,
Golnaz Houshmand³,
Ahmad Bitarafan-Rajabi^4,5,
Habib Zaidi^2,6,7,8 &
…
Isaac Shiri ORCID: orcid.org/0000-0002-5735-0736^2,9

1802 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Heart failure caused by iron deposits in the myocardium is the primary cause of mortality in beta-thalassemia major patients. Cardiac magnetic resonance imaging (CMRI) T2* is the primary screening technique used to detect myocardial iron overload, but inherently bears some limitations. In this study, we aimed to differentiate beta-thalassemia major patients with myocardial iron overload from those without myocardial iron overload (detected by T2*CMRI) based on radiomic features extracted from echocardiography images and machine learning (ML) in patients with normal left ventricular ejection fraction (LVEF > 55%) in echocardiography. Out of 91 cases, 44 patients with thalassemia major with normal LVEF (> 55%) and T2* ≤ 20 ms and 47 people with LVEF > 55% and T2* > 20 ms as the control group were included in the study. Radiomic features were extracted for each end-systolic (ES) and end-diastolic (ED) image. Then, three feature selection (FS) methods and six different classifiers were used. The models were evaluated using various metrics, including the area under the ROC curve (AUC), accuracy (ACC), sensitivity (SEN), and specificity (SPE). Maximum relevance-minimum redundancy-eXtreme gradient boosting (MRMR-XGB) (AUC = 0.73, ACC = 0.73, SPE = 0.73, SEN = 0.73), ANOVA-MLP (AUC = 0.69, ACC = 0.69, SPE = 0.56, SEN = 0.83), and recursive feature elimination-K-nearest neighbors (RFE-KNN) (AUC = 0.65, ACC = 0.65, SPE = 0.64, SEN = 0.65) were the best models in ED, ES, and ED&ES datasets. Using radiomic features extracted from echocardiographic images and ML, it is feasible to predict cardiac problems caused by iron overload.

Left ventricular global function index is associated with myocardial iron overload and heart failure in thalassemia major patients

Article 13 January 2023

How does iron deposition modify the myocardium? A feature-tracking cardiac magnetic resonance study

Article 08 June 2021

Four‑Dimensional Echocardiographic Evaluation of Cardiac Iron Overload in Patients with Beta-Thalassemia Major

Article 18 December 2023

Introduction

One of the common forms of inherited anemia caused by a malfunction with hemoglobin synthesis is thalassemia [1]. Approximately 1.5% of people worldwide, according to a World Health Organization (WHO) estimate, may be thalassemia carriers [1]. Defects in the synthesis of hemoglobin chains occur in one of the forms alpha-thalassemia (reduction or lack of synthesis of alpha globin chains) and beta-thalassemia (decrease or absence of synthesis of beta-globin chains) [2]. Beta-thalassemia is a genetic disorder that leads to the incomplete synthesis of beta-globin chains and, eventually, hemolytic anemia [3]. Beta-thalassemia is a common genetic disorder worldwide which roughly 9% of thalassemia patients suffer from it [4]. Beta-thalassemia is seen in one of the forms encompassing beta-thalassemia major, beta-thalassemia intermedia, and beta-thalassemia minor [5].

Patients with thalassemia major require frequent blood transfusions due to severe anemia, and regular blood transfusions cause iron overload in these patients [5]. Iron overload can lead to heart problems (cardiomyopathy), liver and endocrine gland involvement [5, 6], osteoporosis, splenomegaly, chronic hepatitis, and delayed growth and sexual maturity in children [5]. Among the stated complications, heart failure caused by iron deposition in the myocardium is the leading cause of mortality in 71% of beta-thalassemia major patients [7]. Cardiomyopathy evoked by cardiac siderosis (iron overload) is the most common and life-threatening issue in B-thalassemia major patients [5, 8]. Moreover, the life span of these patients is limited [9]. Hence, early recognition of myocardial dysfunction can lead to initiating the iron chelation treatment in time [4] to reverse cardiomyopathy caused by iron deposition [10]. However, the initial diagnosis of heart failure patients is difficult, as the left ventricular function is preserved until the later stages of the disease in these patients [9].

Identifying and measuring iron deposits, especially in the heart, in patients with thalassemia major with frequent injections and patients receiving chelation treatments should be considered. There are several methods to evaluate and control iron overload. Serum ferritin level, as the broadest tool with the lowest cost in assessing the body’s iron concentration [11], may change under the influence of several conditions, including inflammation, infection, liver damage, and chelation treatments. Therefore, it cannot accurately measure iron overload [12, 13]. Another approach to testing iron overload is liver biopsy, an invasive procedure with risks and complications [10] that cannot analyze and identify cardiac iron deposits [14]. In addition, due to the non-uniform distribution of iron in the liver, the results obtained from the biopsy may not have the necessary accuracy [12, 15], such that in a situation where a significant liver iron overload is seen, cardiac iron overload may not be evident [15]. On the other hand, a myocardial biopsy lacks the sensitivity required to identify cardiac iron deposits due to [14] the fragmentation of iron deposits [16].

Cardiac magnetic resonance imaging (CMRI) is a non-invasive method to evaluate cardiac iron deposits. Moreover, T2*CMRI is an outstanding and non-invasive technique in evaluating myocardial iron deposits [17,18,19,20]. T2* relaxation time is a parameter in MRI that shows the speed of signal decay in tissues with an iron overload [21]. As cardiac iron deposition increases, T2* value in MRI decreases [4, 6, 8, 14, 21, 22] because iron disrupts the magnetic field’s uniformity and speeds up signal decay [14, 16, 23]. Despite the advantages of this technique, CMRI is costly and is not generally available in all medical centers [8,9,10, 18]. In addition, some patients are claustrophobic or cannot undertake MRI due to having metal implants or devices such as pacemaker [18].

Echocardiography is a method to identify cardiac dysfunction caused by iron overload in patients with thalassemia major. In studies conducted by Abtahi et al. [4] and Khaled et al.[13], no significant relationship was observed between LVEF from echocardiography and cardiac T2*. One of the most used imaging procedures in cardiology is echocardiography; however, the correct interpretation of its findings relies on the user’s experience and knowledge [24]. Artificial intelligence, machine learning (ML), and radiomic analysis could be potentially efficient in this era. Radiomics involves the extraction with high throughput of quantitative features from digital images to construct predictive or diagnostic models [25,26,27,28]. Radiomics plays an important role in medical image analysis, especially in cancer and cardiac imaging [29,30,31,32,33,34]. Radiomics extracts quantitative characteristics from medical images and probed to hold significant promise in predicting factors, such as lesion malignancy, prognosis, and treatment response prediction (e.g., tumor response and recurrence risks) [35]. Radiomic biomarkers can be associated with clinical observations [35] and can be utilized to estimate survival rates [36].

In a study conducted by Baessler et al. [37], the ability of texture analysis of CMRI images to distinguish between myocardial problems and a healthy heart was investigated in non-contrast cine images. Also, Cetin et al. [38] demonstrated the ability of radiomics of CMRI images to detect myocardial structural changes in cases with hypertension but apparently healthy. Additionally, the use of artificial intelligence in cardiovascular imaging is expanding quickly [24]. Artificial intelligence has the ability to reduce human errors and facilitate the identification and prediction of diseases and the decision-making process [24]. Numerous studies highlighted the effectiveness of radiomics in identifying changes in myocardial tissue in patients with hypertrophic cardiomyopathy and detecting cardiovascular changes in hypertensive patients with healthy hearts using CMRI [38, 39]. However, echocardiography has certain advantages over CMRI, including its availability, outpatient nature, non-invasiveness, and portability [24]. Therefore, it is crucial to empower echocardiography in identifying cardiovascular problems using radiomic features, as it is capable of offering the abovementioned benefits.

The main objective of this study is to utilize echocardiography radiomic features and ML algorithms to identify individuals at risk of developing cardiac issues in the near future due to excessive iron accumulation in their hearts. These individuals have been categorized based on CMRI T2* results.

Materials and Methods

Study Design and Dataset

The framework of the study is shown in Fig. 1. In the following, each part of the workflow will be elaborated.

Ninety-one patients with an age range of 15 to 50 years (31 ± 7.56 years) were enrolled in this study. Forty-four patients with thalassemia major (age: 30.40 ± 7.05 years) whose heart condition was examined and followed up at a maximum interval of 6 months by echocardiography and CMRI were included in the study. Patients had normal echoes (LVEF > 55%) and T2* ≤ 20 ms in CMRI. In addition, 47 people (age: 33.23 ± 7.75 years) and conditions as thalassemia patients who had an echo and CMRI tests conducted at a maximum interval of 6 months and those with LVEF > 55% and T2* > 20 ms were entered into the study as the control group. Table 1 shows the characteristics of the population investigated, and in Fig. 1S, the CMRI image of two cases, including one T2* > 20 and one T2* ≤ 20, is presented. Patients with valvular heart problems, congenital heart diseases, infectious diseases, hypertension, liver failure, diabetes, and kidney diseases, as well as patients who used drugs that changed myocardial function, were excluded from the study.

Table 1 Demographic data of the studied patient population

Full size table

Image Acquisition

Two-dimensional M-mode and Doppler echocardiography (pulsed-wave Doppler, continuous-wave Doppler, colored Doppler) and tissue Doppler echocardiography were performed using the echocardiography system by the transthoracic method in supine. Left lateral decubitus positions for all subjects were performed to obtain a 4-chamber view. Within less than 6 months, patients with thalassemia major and those selected as the control group were subjected to CMRI examinations. MRI examinations were performed using a 1.5 Tesla scanner (Avanto-Siemens). To evaluate cardiac T2*, a black-blood multi-gradient echo sequence was obtained in short axis view with 8 echo times (TE) and 10 mm slice thickness. In addition, fast spin echo sequences were routinely performed to examine the morphology of the heart. Myocardial function was evaluated using cine CMRI protocols such as steady-state free precession (SSFP) with a slice thickness of 8 mm and a gap of 2 mm in the short and long axis (2 chambers, 3 chambers, and 4 chambers).

Preprocessing and Segmentation

After obtaining the echo images in the video, the frames of the end-systolic (ES) and end-diastolic (ED), according to the patient’s electrocardiogram, were extracted from all the frames related to the 4-chamber view. Furthermore, filtration was also done on them due to the low quality and high Speckle noise. The linear statistical filter DsFlsmv, which had the best performance and had the most negligible impact on the radiomic analysis [40], was applied to the images to remove the noise. Finally, the filtered images (two frames of ES and ED) were used for segmentation. The ventricular septum in filtered echo images was segmented manually by an experienced echocardiographer and edited/verified by a specialist using LIFEx v7.2.0 [41] software.

Feature Extraction

In order to extract radiomic features in LIFEx v7.2.0 software [41], which is compliant with the image biomarker standardization initiative (IBSI) were used [42, 43]. All echo images, including ED and ES images, were resampled (in two-dimensional space with 1 mm intervals) and intensities were quantized to 64 fixed bin number discretized gray levels. Because image intensities range between 0 and 255 values and are constant in all images, this value equals 4 fixed bin size gray levels, which results in 64 ray levels. Finally, 54 features were extracted from each ES and ED image using LIFEx v7.2.0 software. The features included different types of first order (n = 21), shape (n = 1), and texture ((n = 32), GLCM = 7, GLRLM = 11, NGLDM = 3, GLZLM = 11). After extracting the features and classifying the data into two classes (class 1 representing the patients and class 0 representing the control group), the data were divided into two categories: the train (75%, 36 class 0 and 33 class 1) and the test (25%, 11 class 0 and 11 class 1) using stratification.

Feature Selection

Before FS and the implementation of ML algorithms, some preprocessing on the train dataset was implemented. The radiomic features with zero variance were removed, and correlation between features was investigated using Spearman’s statistical analysis. One of the two features that had an absolute correlation coefficient > 0.90 with each other was removed. Then, feature standardization was done using Z-score. In more detail, the training dataset was standardized, and the derived mean and standard deviation (SD) were applied to the test dataset. Analysis of variance (ANOVA), maximum relevance-minimum redundancy (MRMR), and recursive feature elimination (RFE) feature selection methods were used to select the features. Random forest is used as the core of the RFE feature selection approach because it is flexible, resistant to overfitting, computationally efficient, and produces feature importance scores [44].

Machine Learning

After selecting a certain number of features as input to the models, various types of models such as K-nearest neighbors (KNN), logistic regression (LR), multi-layer perceptron (MLP), random forest (RF), support vector machine (SVM), and eXtreme gradient boosting (XGB) were applied to the data. The architecture of MLP can be found in the supplementary under the heading “The Multi-Layer Perceptron (MLP) architecture”. The data were divided into three categories to feed the ML algorithms: first, only the radiomic features extracted from ED images; second, the radiomic features extracted from ES images; and finally, the combination of the features of these images (ED and ES). Considering 3 separate datasets, 3 FS methods, and 6 different classifiers, a total of 54 models were implemented in this study. GridSearch was used to optimize hyperparameters in the training dataset with 10-fold cross-validation. The best hyperparameters were chosen for the trained model, which was then applied to the test dataset using 1000 bootstraps.

Model’s Evaluation

Various metrics including the area under the receiver operating characteristic (ROC) curve (AUC), accuracy, specificity, and sensitivity were used to evaluate the performance of the models. In addition, all model’s AUCs were compared using the DeLong test [45]. The DeLong test is designed to compare the ROC curves of two models. It is used to examine whether the difference in AUC between two models is statistically significant, which shows one is more accurate or dependable in a certain situation. The test considers the data’s paired nature and produces a P value to estimate the significance of the observed difference [45]. P values under 0.05 were regarded as statistically significant. Furthermore, feature maps of four of the most selected features with the highest scores across the three different FS methods were designed for two cases, including a case from the control group (T2* > 20 ms) and a case from the group of patients prone to develop iron overload (T2* ≤ 20 ms). The Pyradiomics version 3.0.1 [42] was used to generate the feature maps, with a kernel radius of 2, the initial value of the feature maps of 0. Moreover, the convolution operation was executed on batches of 10,000 voxels, utilizing the voxelBatch parameter. FS, classification, and statistical test were performed in R version 4.0 [46] using “mlr” [47], “ggplot2” [48], “caret” [49], and “praznik” [50] libraries.

Results

Selected Features

Three feature selection methods including ANOVA, MRMR, and RFE were applied on the three datasets (ED, ES, ED + ES), and 10 features were selected in all the datasets for the ANOVA and MRMR methods. Moreover, for RFE, 12, 11, and 8 features were selected in ED, ES, and ED&ES datasets, respectively. Selected features with their scores for ANOVA and MRMR, the linear diagram, and selected features related to the RFE are presented in Figs. 2, 3, 2S, and Table 2, respectively. Also, the distribution of selected features in different modes is plotted as a pie chart in Fig. 4. According to the pie charts, the first-order features had the largest share in different modes, and the shape and NGLDM had the least share.

Table 2 The selected features using RFE in different datasets

Full size table

Models’ Performance

The results of all the models used using six classifiers (KNN, LR, MLP, RF, SVM, and XGB) and three FS (ANOVA, MRMR, and RFE) on three separate datasets (ED, ES, and ED&ES) are shown in Fig. 5. In addition, the performance of the models is evaluated using different criteria of AUC, ACC, SEN, and SPE. The best models in ED, ES, and ED&ES datasets were MRMR-XGB (AUC = 0.73, ACC = 0.73, SPE = 0.73, SEN = 0.73), ANOVA-MLP (AUC = 0.69, ACC = 0.69, SPE = 0.56, SEN = 0.83), and RFE-KNN (AUC = 0.65, ACC = 0.65, SPE = 0.64, SEN = 0.65), respectively. Models with the best performance are listed in Table 3. In addition, the adjusted range for the hyperparameters of the different models used in this study is given in Table 1S in the supplementary section. At last, a feature map comparing a normal and thalassemia case in ED and ES datasets in four features among the best and most selected features according to the FS methods evaluated in this study was generated (Fig. 6).

Table 3 The best machine learning models

Full size table

DeLong Test

Figure 7 presents the DeLong test findings. A total of 54 different models were implemented in this study, and the AUC of all these models was compared with each other using the DeLong test. The outcomes were categorized as non-significant and statistically significant (substantially lower or higher). ED-MRMR-XGB, ES-ANOVA-MLP, and ES-RFE-KNN models were among the best models of this study. They had 24, 21, and 16 statistically higher q values compared to other models, respectively.

Discussion

Despite recent developments, heart failure resulting from iron deposition in patients with thalassemia major is still the most serious and the leading cause of death [18]. Furthermore, patients with thalassemia major may not have any symptoms, which can delay the early diagnosis of myocardial dysfunction and put successful reverse disease conditions at risk [10]. Iron overload identification methods such as serum ferritin and liver and heart biopsy have limitations and cannot be used as a reliable and accurate method to evaluate myocardial iron concentration [15]. T2*CMRI is an outstanding and non-invasive diagnostic technique in identifying cardiac iron content [10], although obstacles such as being expensive and time-consuming, not being generally available in all medical centers, and the presence of contraindications for MRI prevent its widespread use [18]. Echocardiography can also evaluate heart failure resulting from iron deposition. Availability, outpatient, and portable are the advantages of echo; nevertheless, the interpretation of this method highly depends on the user’s knowledge and experience [24]. In this study, the CMRI findings are utilized to categorize participants into two groups: normal and prone to thalassemia. It should be emphasized that while echo results were comparable (LEFV > 55%) in all patients, those with T2* ≤ 20 ms were more likely to experience future cardiac issues. In order to categorize patients and determine who is most likely to experience cardiac difficulties owing to iron overload in the future, radiomic features of echo images were used. As was already noted, early identification of these individuals can have a crucial role in the treatment process and reducing the mortality rate. To the best of our knowledge, this study is the first attempt to identify cardiac problems caused by iron overload using radiomic features extracted from echo images and ML based on T2* values obtained from MRI. In previous studies [6, 8,9,10, 17, 18, 51], statistical tests and software determined the correlation between T2* and echo parameters.

According to Fig. 4, first-order, GLRLM, and GLZLM were the most frequent features in three FS methods. In detail, in the ANOVA method, first-order (50%) and GLZLM (20%) features; in the MRMR method, the first-order (47%) and GLRLM (23%) features; and in the RFE method, the first-order (36%) and GLRLM (23%) features were the most frequent. Shape features were not selected in any FS methods because these features have no relationship with the amount of iron deposition and the amount of T2* in the septal area. Among the features selected by the ANOVA method, the highest scores in ED, ES, and ED&ES datasets belonged to the first quartile of discretized (DISCRETIZED_Q1), GLRLM_High Gray-Level Run Emphasis (HGRE), and GLCM_Energy features, respectively (Fig. 2). Joint Energy as one of the GLCM features evaluates the homogeneity patterns in the myocardium [31] and HGRE as GLRLM feature determines the distribution of the higher gray level values [43]. An effective substitute for the coefficient of variance is the quartile coefficient of dispersion. The first quartile (Q1) also evaluates the distribution of gray level values [43]. Then, among the features selected by the MRMR method, the highest scores in ED, ES, and ED&ES datasets belonged to the GLCM_Correlation, GLCM_Dissimilarity, and GLCM_Dissimilarity features, respectively (Fig. 3). Correlation as GLCM feature measures the linear dependency of gray levels and dissimilarity shows the local intensity variation [42]. Although, as in CMRI images, iron deposits cause the myocardium containing iron overload to have a lower signal and intensity compared to normal tissue [11], in echocardiography images, iron accumulation leads to the heterogeneous distribution of the intensity of gray levels. These radiomic features will help identify patients without any heart failure.

Barzin et al. [18] stated that all diastolic functional indicators, except for early (E) and late (A) transmitral peak flow velocity ratio (E⁄A), exhibit a notable relationship with T2*. In our research, the radiomic features showed that diastolic indices are related to the T2* parameter. Meanwhile, in the study of Aypar et al. [10], diastolic dysfunction was seen locally in the septal wall in patients with thalassemia major. In our study, the radiomics from diastolic were obtained from the segmentation area (septum) and had the highest score and importance in the ANOVA method.

Model explainability seeks to identify a distinctive set of biomarkers, known as a signature, to potentially predict a clinical outcome, such as a diagnosis, prognosis, or response to treatment. In the realm of radiomics, intriguing research has been carried out recently. However, there is a lack of emphasis on developing explainable models. The essence of explainable models lies in their ability to gain approval and trust from physicians in clinical setting. When a model is developed, it becomes crucial to demonstrate to physicians that it is not just a black-box computerized system. By providing explanations for the model’s outcomes, it can foster confidence and encourage the utilization of these models in practice [39].

The important issue is that many of the developed models are not easily interpretable. Physicians and clinicians cannot easily understand and subsequently trust them because of their black-box nature [39]. In Fig. 6, the four features that had the highest scores and the most selections among different FS methods were visualized by voxel-wise feature extraction for different classes. Entropy, from first-order features, provides randomness of the intensity distribution in the region of interest (ROI). A lower entropy value denotes a more uniform distribution, while a greater value reflects a more heterogeneous intensity [43, 52]. Dissimilarity, a GLCM-derived feature, measures the difference between adjacent pixel intensities, revealing changes in intensity values and indicating texture edges or sharp transitions. Higher dissimilarity values indicate greater contrast and variation, while lower values suggest more uniformity [43, 52]. HGRE, a GLRLM-derived feature, shows the image frequency and length of runs of consecutive pixel values. It measures the importance or weighting of the image’s longer runs with higher gray-level values and emphasizes the dominance or prevalence of runs with high values for the gray level [43, 52]. Small Zone High Gray-Level Emphasis (SZHGE), a GLZLM-derived feature, evaluates the significance or prioritization given to smaller zones containing higher gray-level values within the image. It offers insights into these small zones’ frequency and dominance, characterized by elevated gray-level values [43, 52]. The mean value of entropy and GLCM dissimilarity was higher, and the mean value of GLRLM_HGRE and GLZLM_SZHGE was lower in the control group both in ES and ED datasets. Our hypothesis regarding the former two features is that the formation of iron overload may have reduced the amount of dissimilarity and randomness of the intensity, which caused these values to be lower in patients compared to the control group. In terms of the latter two features, it can be hypothetically related to the points where iron overload is developing.

In the ED dataset, the MRMR-XGB model achieved the best result. In the ES dataset, the top models were MLP using ANOVA and KNN using RFE. In the ED&ES dataset, RFE-KNN had the best result. The results of the models in the ED dataset are superior to those of each set of features. The explanation is that the motion of the heart is the lowest in the mid-to-end-diastolic phase; this probably causes the distortion of the features to occur less and get a better result.

According to our findings, using radiomics extracted from echo images, it is possible to classify individuals which are labeled according to CMRI T2*. Meanwhile, the subjects examined in this study had normal results in terms of LVEF, and no dysfunction was evident. In other words, based on image analysis, echo radiomic features are related to the T2* value. While in conventional echocardiography studies, Moussavi et al. [21] found no remarkable association between T2*MRI and echocardiographic results. Vogel et al. [9] stated that the sensitivity of tissue doppler echocardiography in detecting abnormal iron load is 88%, and its specificity is 65%. In contrast, 73% of sensitivity and 73% of specificity in the MRMR-XGB model and 83% sensitivity and 56% specificity in the ANOVA-MLP model were achieved in our study. Aypar et al. [10] also concluded that when the mid-septal Sm ≤ 5.7 cm⁄s, the tissue Doppler echocardiography sensitivity is 63%, and the specificity is 83%, and when the mid-septal Em ≤ 12.1 cm⁄s, the sensitivity is 75%, and the specificity is 75%. Djer et al. [17] claimed no significant relationship exists between T2* and left ventricular systolic indices. While in our study ANOVA-MLP among the models applied on the ES dataset (AUC: 0.69, SPE: 0.56, SEN: 0.83, and ACC: 0.69) had the best performance in diagnosis of cardiac problems caused by iron overload. ANOVA-MLP is considered among the top three models. Since systolic dysfunction occurs late in the disease process, this finding can be significant.

Our study emphasizes the high ability of radiomics in the early detection of cardiomyopathy resulting from iron deposition in conditions where LVEF is preserved. Therefore, the presented findings could potentially help physicians make decisions regarding heart failure caused by iron deposition using echo images. In such a way, physicians can successfully reverse the condition of cardiomyopathy and prevent the progression of the disease with early diagnosis. Furthermore, since echocardiography has a lower cost than a method like MRI and is available in most centers, this method is cost-effective in evaluating heart failure in patients with thalassemia. In addition, echocardiography is non-invasive as well as portable.

This study had some limitations. First, we have a small sample size as we select patients with echo and CMRI studies in short time intervals with a max 6-month duration; a larger sample in future studies would be of more value. In this study, data were collected from one center. To ensure the models’ generalizability, collecting data from different centers and evaluating model performance across different centers is necessary. As RFE only benefited from the RF model, it is possible that the features selected may not be the most optimal choice for other classifiers.

Conclusion

The ED-MRMR-XGB has presented promising and acceptable results among ML algorithms. According to the results of the echo images, the individuals in the study had the same conditions (LVEF > 55%), but they were different based on the CMRI results, which were labeled accordingly, and then using the radiomic features extracted from the echo images and ML approaches were classified. Although the results of LVEF from echo were similar, by using the radiomic features extracted from these images, our models obtained promising results, which indicate that with non-invasive, inexpensive, portable, and highly accessible echocardiography, it is possible to identify who is prone to suffer from heart problems caused by iron overload in the near future and this can lead to a proper treatment to prevent cardiac problems and death of these patients. Therefore, early diagnosis of heart failure, even before the appearance of symptoms, has been made possible by radiomics of echo images and ML.

Availability of Data and Material

Not applicable.

Code Availability

LIFEx v7.2.0 and Python libraries used in this study.

Abbreviations

CMRI:: Cardiac magnetic resonance imaging
ML:: Machine learning
LVEF:: Left ventricular ejection fraction
ES:: End-systolic
ED:: End-diastolic
FS:: Feature selection
AUC:: Area under the ROC curve
SEN:: Sensitivity
SPE:: Specificity
ACC:: Accuracy
SD:: Standard deviation
SSFP:: Steady-state free precession
IBSI:: Image biomarker standardization initiative
ANOVA:: Analysis of variance
MRMR:: Maximum relevance-minimum redundancy
RFE:: Recursive feature elimination
KNN:: K-nearest neighbors
LR:: Logistic regression
MLP:: Multi-layer perceptron
RF:: Random forest
SVM:: Support vector machine
XGB:: eXtreme gradient boosting

References

Aydinok Y: Thalassemia. Hematol 17:s28-s31, 2012
CAS Google Scholar
Muncie Jr HL, Campbell JS: Alpha and beta thalassemia. Am Fam Physician 80:339-344, 2009
PubMed Google Scholar
Said Othman KM, Elshazly SA, Heiba NM: Role of non-invasive assessment in prediction of preclinical cardiac affection in multi-transfused thalassaemia major patients. Hematol 19:380-387, 2014
CAS Google Scholar
Abtahi F, Abdi A, Jamshidi S, Karimi M, Babaei-Beigi MA, Attar A: Global longitudinal strain as an Indicator of cardiac Iron overload in thalassemia patients. J Cardiovasc Ultrasound 17:24, 2019
Google Scholar
Galanello R, Origa R: Beta-thalassemia. Orphanet J Rare Dis 5:1-15, 2010
Google Scholar
Liguori C, et al.: Relationship between myocardial T2* values and cardiac volumetric and functional parameters in β-thalassemia patients evaluated by cardiac magnetic resonance in association with serum ferritin levels. Eur J Radiol 82:e441-e447, 2013
PubMed Google Scholar
BORGNA‐PIGNATTI C, et al.: Survival and complications in thalassemia. Ann N Y Acad Sci 1054:40-47, 2005
CAS PubMed Google Scholar
Saravi M, Tamadoni A, Jalalian R, Mahmoodi–Nesheli H, Hojati M, Ramezani S: Evaluation of tissue doppler echocardiography and T2* magnetic resonance imaging in iron load of patients with thalassemia major. Caspian J Intern Med 4:692, 2013
PubMed PubMed Central Google Scholar
Vogel M, Anderson L, Holden S, Deanfield J, Pennell D, Walker J: Tissue Doppler echocardiography in patients with thalassaemia detects early myocardial dysfunction related to myocardial iron overload. Eur Heart J 24:113-119, 2003
CAS PubMed Google Scholar
Aypar E, Alehan D, Hazırolan T, Gümrük F: The efficacy of tissue Doppler imaging in predicting myocardial iron load in patients with beta-thalassemia major: correlation with T2* cardiovascular magnetic resonance. Int J Cardiovasc Imaging 26:413-421, 2010
PubMed Google Scholar
Fernandes JL: MRI for iron overload in thalassemia. Hematol Oncol Clin North Am 32:277-295, 2018
Wahidiyat PA, Liauw F, Sekarsari D, Putriasih SA, Berdoukas V, Pennell DJ: Evaluation of cardiac and hepatic iron overload in thalassemia major patients with T2* magnetic resonance imaging. Hematol 22:501-507, 2017
CAS Google Scholar
Khaled A, Ezzat DA, Salem HA, Seif HM, Rabee H: Effective method of evaluating myocardial iron concentration in pediatric patients with thalassemia major. J Blood Med 10:227, 2019
CAS PubMed PubMed Central Google Scholar
Ouederni M, et al.: Myocardial and liver iron overload, assessed using T2* magnetic resonance imaging with an excel spreadsheet for post processing in Tunisian thalassemia major patients. Ann Hematol 96:133-139, 2017
CAS PubMed Google Scholar
Farhangi H, Badiei Z, Moghaddam HM, Keramati MR: Assessment of heart and liver iron overload in thalassemia major patients using T2* magnetic resonance imaging. Indian J Hematol Blood Transfus 33:228-234, 2017
PubMed Google Scholar
Chaosuwannakit N, Makarawate P: The value of magnetic resonance imaging in evaluation of myocardial and liver iron overload in a thalassaemia endemic population: a report from Northeastern Thailand. Pol J Radiol 84:e262, 2019
PubMed PubMed Central Google Scholar
Djer MM, Anggriawan SL, Gatot D, Amalia P, Sastroasmoro S, Widjaja P: Correlation between T2* cardiovascular magnetic resonance with left ventricular function and mass in adolescent and adult major thalassemia patients with iron overload. Acta Med Indones 45:295-301, 2013
PubMed Google Scholar
Barzin M, Kowsarian M, Akhlaghpoor S, Jalalian R, Taremi M: Correlation of cardiac MRI T2* with echocardiography in thalassemia major. Eur Rev Med Pharmacol Sci 16:254-260, 2012
CAS PubMed Google Scholar
Wood JC, et al.: Cardiac iron determines cardiac T2*, T2, and T1 in the gerbil model of iron cardiomyopathy. Circ 112:535-543, 2005
CAS Google Scholar
Kirk P, et al.: Cardiac T2* magnetic resonance for prediction of cardiac complications in thalassemia major. Circ 120:1961-1968, 2009
CAS Google Scholar
Moussavi F, et al.: Optimal method for early detection of cardiac disorders in thalassemia major patients: magnetic resonance imaging or echocardiography? Blood Res 49:182-186, 2014
PubMed PubMed Central Google Scholar
Wood JC, Noetzli L: Cardiovascular MRI in thalassemia major. Ann N Y Acad Sci 1202:173-179, 2010
PubMed Google Scholar
Shehata SM, Amin MI, El Sayed HZ: MRI evaluation of hepatic and cardiac iron burden in pediatric thalassemia major patients: spectrum of findings by T2. Egypt J Radiol Nucl Med 50:1-9, 2019
Google Scholar
Lim LJ, Tison GH, Delling FN: Artificial intelligence in cardiovascular imaging. Methodist Debakey Cardiovasc J 16:138, 2020
PubMed PubMed Central Google Scholar
Kumar V, et al.: Radiomics: the process and the challenges. Magn Reson Imaging 30:1234-1248, 2012
PubMed PubMed Central Google Scholar
Shiri I, et al.: Impact of feature harmonization on radiogenomics analysis: Prediction of EGFR and KRAS mutations from non-small cell lung cancer PET/CT images. Comput Biol Med 142:105230, 2022
CAS PubMed Google Scholar
Manafi-Farid R, et al.: [(18)F]FDG-PET/CT radiomics and artificial intelligence in lung cancer: technical aspects and potential clinical applications. Semin Nucl Med 52:759-780, 2022
PubMed Google Scholar
Shiri I, et al.: High-dimensional multinomial multiclass severity scoring of COVID-19 pneumonia using CT radiomics features and machine learning algorithms. Sci Rep 12:14817, 2022
CAS PubMed PubMed Central Google Scholar
Lambin P, et al.: Radiomics: the bridge between medical imaging and personalized medicine. Rev Clin Oncol 14:749-762, 2017
Google Scholar
Yu F, Huang H, Yu Q, Ma Y, Zhang Q, Zhang B: Artificial intelligence-based myocardial texture analysis in etiological differentiation of left ventricular hypertrophy. Ann Transl Med 9:108, 2021
Avard E, et al.: Non-contrast cine cardiac magnetic resonance image radiomics features and machine learning algorithms for myocardial infarction detection. Comput Biol Med 141:105145, 2022
PubMed Google Scholar
Sabouri M, et al.: Myocardial perfusion SPECT imaging radiomic features and machine learning algorithms for cardiac contractile pattern recognition. J Digit Imaging. 36:1–13, 2022
Arian F, et al.: Myocardial function prediction after coronary artery bypass grafting using mri radiomic features and machine learning algorithms. J Digit Imaging 35:1708-1718, 2022
PubMed PubMed Central Google Scholar
Mohebi M, et al.: Post-revascularization ejection fraction prediction for patients undergoing percutaneous coronary intervention based on myocardial perfusion SPECT imaging radiomics: a preliminary machine learning study. J Digit Imaging 1–16, 2023
Militello C, et al.: 3D DCE-MRI radiomic analysis for malignant lesion prediction in breast cancer patients. Acad Radiol 29:830-840, 2022
PubMed Google Scholar
Park H, et al.: Radiomics Signature on Magnetic Resonance Imaging: Association with Disease-Free Survival in Patients with Invasive Breast Cancer. Clin Cancer Res 24:4705-4714, 2018
PubMed Google Scholar
Baeßler B, Mannil M, Maintz D, Alkadhi H, Manka R: Texture analysis and machine learning of non-contrast T1-weighted MR images in patients with hypertrophic cardiomyopathy—preliminary results. Eur J Radiol 102:61-67, 2018
PubMed Google Scholar
Cetin I, Petersen SE, Napel S, Camara O, Ballester MAG, Lekadir K: A radiomics approach to analyze cardiac alterations in hypertension. Proc. 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019). pp. 640-643. IEEE, 2019.
Militello C, Prinzi F, Sollami G, Rundo L, La Grutta L, Vitabile S: CT radiomic features and clinical biomarkers for predicting coronary artery disease. Cognit Comput. 15:238-253, 2023
Google Scholar
Loizou CP, Pattichis CS: Despeckle Filtering Algorithms. In: Despeckle Filtering Algorithms and Software for Ultrasound Imaging. Synthesis Lectures on Algorithms and Software in Engineering. Cham: Springer International Publishing, 2008
Google Scholar
Nioche C, et al.: LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res 78:4786-4789, 2018
CAS PubMed Google Scholar
Van Griethuysen JJ, et al.: Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104-e107, 2017
PubMed PubMed Central Google Scholar
Zwanenburg A, et al.: The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295:328-338, 2020
PubMed Google Scholar
Kuhn M, Johnson K: Feature engineering and selection: a practical approach for predictive models: Chapman and Hall/CRC, 2019
DeLong ER, DeLong DM, Clarke-Pearson DL: Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44:837–845, 1988
Team RDC: R: a language and environment for statistical computing. (No Title), 2010
Bischl B, et al.: mlr: Machine learning in R. J Mach Learn Res 17:5938-5942, 2016
Google Scholar
Wickham H, Wickham H: Data analysis: Springer, 2016
Kuhn M: Building predictive models in R using the caret package. J Stat Softw 28:1-26, 2008
Google Scholar
Kursa MB: Praznik: High performance information-based feature selection. SoftwareX 16:100819, 2021
Google Scholar
Liguori C, et al.: Magnetic resonance comparison of left-right heart volumetric and functional parameters in thalassemia major and thalassemia intermedia patients. Biomed Res Int 2015:857642, 2015
Depeursinge A, et al.: Standardised convolutional filtering for radiomics. arXiv preprint arXiv:200605470, 2020

Download references

Funding

Open access funding provided by University of Geneva. This work was supported by Iran University of Medical Sciences and Rajaie Cardiovascular Medical and Research Center under grant number IR.IUMS.FMD.REC.1400.139.

Author information

Authors and Affiliations

Department of Medical Physics, Iran University of Medical Sciences, Tehran, Iran
Haniyeh Taleie & Maziar Sabouri
Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH‑1211, Geneva 4, Switzerland
Ghasem Hajianfar, Habib Zaidi & Isaac Shiri
Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
Maziar Sabouri & Golnaz Houshmand
Echocardiography Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
Mozhgan Parsaee & Ahmad Bitarafan-Rajabi
Cardiovascular Interventional Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
Ahmad Bitarafan-Rajabi
Geneva University Neurocenter, University of Geneva, Geneva, Switzerland
Habib Zaidi
Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
Habib Zaidi
Department of Nuclear Medicine, University of Southern Denmark, Odense, Denmark
Habib Zaidi
Department of Cardiology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
Isaac Shiri

Authors

Haniyeh Taleie
View author publications
You can also search for this author in PubMed Google Scholar
Ghasem Hajianfar
View author publications
You can also search for this author in PubMed Google Scholar
Maziar Sabouri
View author publications
You can also search for this author in PubMed Google Scholar
Mozhgan Parsaee
View author publications
You can also search for this author in PubMed Google Scholar
Golnaz Houshmand
View author publications
You can also search for this author in PubMed Google Scholar
Ahmad Bitarafan-Rajabi
View author publications
You can also search for this author in PubMed Google Scholar
Habib Zaidi
View author publications
You can also search for this author in PubMed Google Scholar
Isaac Shiri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ahmad Bitarafan-Rajabi, Habib Zaidi or Isaac Shiri.

Ethics declarations

Ethics Approval

This retrospective study was approved by the ethics committee of Iran University of Medical Sciences (IR.IUMS.FMD.REC.1400.139).

Consent to Participate

Informed consent was waived by ethics groups.

Consent for Publication

Informed consent was waived by ethics groups.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 2040 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Taleie, H., Hajianfar, G., Sabouri, M. et al. Left Ventricular Myocardial Dysfunction Evaluation in Thalassemia Patients Using Echocardiographic Radiomic Features and Machine Learning Algorithms. J Digit Imaging 36, 2494–2506 (2023). https://doi.org/10.1007/s10278-023-00891-0

Download citation

Received: 27 February 2023
Revised: 23 July 2023
Accepted: 25 July 2023
Published: 21 September 2023
Issue Date: December 2023
DOI: https://doi.org/10.1007/s10278-023-00891-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Left Ventricular Myocardial Dysfunction Evaluation in Thalassemia Patients Using Echocardiographic Radiomic Features and Machine Learning Algorithms

Abstract

Similar content being viewed by others

Left ventricular global function index is associated with myocardial iron overload and heart failure in thalassemia major patients

How does iron deposition modify the myocardium? A feature-tracking cardiac magnetic resonance study

Four‑Dimensional Echocardiographic Evaluation of Cardiac Iron Overload in Patients with Beta-Thalassemia Major

Introduction

Materials and Methods

Study Design and Dataset

Image Acquisition

Preprocessing and Segmentation

Feature Extraction

Feature Selection

Machine Learning

Model’s Evaluation

Results

Selected Features

Models’ Performance

DeLong Test

Discussion

Conclusion

Availability of Data and Material

Code Availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Ethics Approval

Consent to Participate

Consent for Publication

Conflict of Interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 2040 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation