Next Article in Journal
A Sequalae of Lineage Divergence in Staphylococcus aureus from Community-Acquired Patterns in Youth to Hospital-Associated Profiles in Seniors Implied Age-Specific Host-Selection from a Common Ancestor
Next Article in Special Issue
Functional PET Neuroimaging in Consciousness Evaluation: Study Protocol
Previous Article in Journal
Assessment of the Diagnostic Methods of Mizaj in Persian Medicine: A Systematic Review
Previous Article in Special Issue
Accuracy of Dose-Saving Artificial-Intelligence-Based 3D Angiography (3DA) for Grading of Intracranial Artery Stenoses: Preliminary Findings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Learning-Based Auto-Segmentation of Spinal Cord Internal Structure of Diffusion Tensor Imaging in Cervical Spondylotic Myelopathy

1
Spinal Division, Orthopedic and Traumatology Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang 524013, China
2
Department of Orthopaedics and Traumatology, The University of Hong Kong, Hong Kong SAR, China
3
Department of Orthopaedics and Traumatology, The University of Hong Kong-Shenzhen Hospital, Shenzhen 518009, China
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(5), 817; https://doi.org/10.3390/diagnostics13050817
Submission received: 8 December 2022 / Revised: 14 February 2023 / Accepted: 19 February 2023 / Published: 21 February 2023
(This article belongs to the Special Issue Advancements in Neuroimaging)

Abstract

:
Cervical spondylotic myelopathy (CSM) is a chronic disorder of the spinal cord. ROI-based features on diffusion tensor imaging (DTI) provide additional information about spinal cord status, which would benefit the diagnosis and prognosis of CSM. However, the manual extraction of the DTI-related features on multiple ROIs is time-consuming and laborious. In total, 1159 slices at cervical levels from 89 CSM patients were analyzed, and corresponding fractional anisotropy (FA) maps were calculated. Eight ROIs were drawn, covering both sides of lateral, dorsal, ventral, and gray matter. The UNet model was trained with the proposed heatmap distance loss for auto-segmentation. Mean Dice coefficients on the test dataset for dorsal, lateral, and ventral column and gray matter were 0.69, 0.67, 0.57, 0.54 on the left side and 0.68, 0.67, 0.59, 0.55 on the right side. The ROI-based mean FA value based on segmentation model strongly correlated with the value based on manual drawing. The percentages of the mean absolute error between the two values of multiple ROIs were 0.07, 0.07, 0.11, and 0.08 on the left side and 0.07, 0.1, 0.1, 0.11, and 0.07 on the right side. The proposed segmentation model has the potential to offer a more detailed spinal cord segmentation and would be beneficial for quantifying a more detailed status of the cervical spinal cord.

1. Introduction

Cervical spondylotic myelopathy (CSM) is characterized by chronic spinal degeneration causing structural modifications to the intervertebral discs, ligaments, etc. [1,2]. Magnetic resonance imaging (MRI) is the gold standard for diagnosing cervical spinal cord dysfunction [3]. Conventional MRIs, including T1-weighted and T2-weighted MRIs, are commonly used to obtain morphological information about the spinal cord, such as intramedullary or extramedullary abnormalities, spinal cord compression, disk herniation, etc., for identification of spinal cord injury [4,5,6,7]. Based on signal abnormalities on T2-weighted MRI, the Brain and Spinal Injury Center (BASIC) score is used to classify acute traumatic spinal cord injuries [8]. However, conventional MRI findings and clinical presentation in CSM appear disconnected, and conventional MRI cannot highlight the microstructural spinal cord abnormalities associated with CSM [2,9,10]. Diffusion tensor imaging (DTI) is a type of multi-parametric MRI and is considered a promising imaging technique for studying the ultrastructure of the spinal cord. The DTI parameter, fractional anisotropy (FA), has been used in several studies to investigate the diagnosis and prognosis of cervical spondylotic myelopathy [1,11,12]. To detect detailed neurological deficits in the spinal cord, regions of interest (ROI) can be segmented on the cord to measure the corresponding features in different tracts from the FA map [11,13,14]. A greater number of ROIs within a spinal cord leads to precise information about myelopathy. Numerous studies have demonstrated the feasibility and effectiveness of ROI-based features for providing details on the internal state of the spinal cord [11,13]. Due to its manual voxel selection and user-dependent nature, ROI delineation involves the user drawing ROIs based on perceptions of the location of underlying anatomical structures, such as the gray matter or the corticospinal tract [15]. Hand animation ROIs are prone to inter-rater and intra-rater variability and bias and are not easily transferable to large-scale studies [15]. Consequently, the manual drawing of describing ROI has serious limitations.
Some methods have been proposed for segmenting the entire spinal cord. EI Mendili et al. used T2-weighted MR images as input and employed a dual-threshold-based approach for spinal cord segmentation [16]. De Leener et al. proposed a PropSeg method based on multi-resolution propagation of tubular deformation models [17]. Chen et al. proposed a method to segment the spinal cord using deformable atlases and topological constraints that are robust to noise and artifacts [18]. In all of the above techniques, manual intervention is required, or a large database needs to be established to create the spinal cord segmentation map, and T1-weighted or T2-weighted images are the primary input. Several research studies proposed further segmenting the internal structures, gray matter (GM) and white matter (WM). Antal et al. use variational formulation to automatically detect cerebrospinal fluid, WM, and GM and combine them with shape prior to segment GM and WM [19]. Ferran proposed a two-stage pipeline using the Optimized Patch Match Label fusion (OPAL) method for segmentation of the whole spinal cord and the Similarity and Truth Estimation for Propagated Segmentations (STEPS) for the further extraction of GM and WM [20]. However, these methods can be applied to specific spinal cord levels, limiting their application to other segments. In DTI image analysis, there are some applications of segmentation methods. Marek used the semiautomatic algorithm provided by ITK-SNAP to segment the cervical spine GM and WM [21]. This method takes a T2-weighted image as input and registers the segmentation results with the DTI image. Because of the deviations between T2-weighted images and DTI images, a manual check is still necessary to guarantee the registration result. Richu applied six commonly used automatic thresholding algorithms for segmentation and vote to obtain the final segmentation results of GM and WM [14]. Although this method uses DTI images as input, the segmentation of GM and WM given by this method cannot reveal the details of each white matter column. Therefore, it should be pointed out that the existing methods of spinal cord segmentation based on DTI images provide limited ROIs within the spinal cord.
In recent years, great attention has been paid to deep learning in the medical field. One of the most widely used deep learning networks in medical image segmentation is UNet, which is based on an encoder–decoder architecture [22]. Several deep learning-based models applied UNet or UNet-like structures on spinal cord segmentation. For instance, Xiaoran et al. proposed a UNet-based, fully automatic method on 2D axial-view MRI slices for the whole spinal cord segmentation of patients with CSM [23]. Alhanouf et al. used a pre-trained MobileNet-V3 CNN model as the backbone for feature extraction, which was augmented by a set of up-sampling layers and employing skip connection similar to the UNet architecture used for spinal cord GM segmentation [24]. The deep-learning model achieved better performance on the spinal cord gray matter segmentation challenge dataset compared to Spinal Cord Toolbox (SCT), the Variational Bayesian Expectation Maximization (VBEM) method, and the Gray Matter Segmentation Based on Maximum Entropy (GSBME) method.
This paper proposes a deep learning model based on the UNet architecture for segmenting multiple ROIs within the spinal cord in DTI images. Segmentation results can be obtained simultaneously for both sides’ ventral, lateral, dorsal columns, and gray matter. Specifically, heatmap-distance loss (HDL) is proposed to train with the UNet model to make the model have better performance on the small area of column-based ROI and gray matter. This study hopes to provide a more detailed segmentation than the gray and white matter segmentation of the spinal cord and more details of the internal status of the spinal cord.

2. Materials and Methods

2.1. Study Population

An overall sample size of 89 patients with CSM was recruited for this study. All of the CSM patients are symptomatic. Clinical examination by the Japanese Orthopedic Association (JOA) produced scores of 9.78 ± 3.48, while the healthy JOA score is 17 with motor function of upper and lower limb (8 scores), sensory (6 scores) and sphincter function (3 scores). Inclusion criteria were a clinical diagnosis of CSM without a history of spinal surgery. Those with neurological disorders or prior neurological trauma were excluded. Every participant in this study completed a written informed consent form approved by the institutional review board.
DTI scanning protocol was performed with a Philips 3T Achieva scanner (Philips Medical System, Best, Netherlands). Specifically, the one-shot echoplanar imaging sequence was used. We carried out diffusion encoding in 15 nonlinear and noncoplanar directions with a b value of 600 s/mm2. The scanning parameters are listed as follows: field of view = 80 × 80 mm2, thickness of slices = 7 mm, gap between slices = 2.2 mm, fold-over direction = anteroposterior, reconstruction resolution = 0.63 × 0.63 × 7 mm3, and voxel resolution = 1.0 × 1.26 × 7 mm3, and TTE/TR = 60 ms/5 heartbeats. For the purpose of sup-pressing the fold-over effect, spatial saturation was inverted and spectral presaturation was applied. To minimize the effects of cerebrospinal fluid on cardiac vector cardiograms, cardiac vector cardiograms were triggered.

2.2. Manual Segmentation

The manual delineation of the ROI is an essential step for feature extraction to provide more information about the inside of the spinal cord. FA is one of the most popular parameters in DTI studies to evaluate the microstructural abnormality of the spinal cord [13,14]. Each CSM patient extracted 12 axial slices from three stacks covering the vertebrae between C2 and C7/T1. Spinal Cord Toolbox, Version 2.3 [25] was used to preprocess the DTI images, extract the B0 images, and calculate the FA map. B0 and FA images were used as input for spinal cord ROI auto-segmentation in the following experiments. The auto-segmentation ROIs and manual segmentation ROIs were superimposed on the FA images to extract the mean FA value for each ROI.
An experienced researcher in the cervical spinal cord manually drew ROIs on B0 images supplemented with FA images. The ROIs have covered more specific columns within the entire cord region, as showed in Figure 1. A total of eight regions were identified for each image to obtain detailed information about the spinal cord: lateral column (LC), dorsal column (DC), ventral column (VC), and gray matter (GM) on both sides of the spinal cord. We acquired 96 segmentation masks for each subject (8 ROIs × 12 slices). A total of 8472 segmentation masks have been created for all patients with CSM (8 ROIs × 1059 slices) except for images that are difficult to segment manually. The manual delineated ROIs were defined as the ground truth. A second expert manually segmented these ROIs to enhance confidence in identifying these eight ROIs. Whenever there was a discrepancy, a third expert was consulted until a consensus was reached.

2.3. Segmentation Models

The UNet [22] was introduced for its popularity in medical image segmentation. The model structure is similar to a fully convolutional network, which consists of two parts. The first part consists of layers of convolution and max pooling (known as the encoder). B0 images will be used as input to extract multi-scale features. The second part is mirror-symmetric to the first part (also known as the decoder), consisting of transposed convolution and up-sampling layers for feature expansion of a feature vector of an image of sizes corresponding to the original medical image. Several skip connections between the encoder and decoder were added to further utilize the multi-scale feature of the encoder for feature expansion. Figure 2 demonstrates the detailed parameters of the UNet model.
Generally, the feature extracted from the encoder highly affects the segmentation performance. Two different depth general network structures, VGG-16 [26] and ResNet50 [27], were used as the backbone for feature extraction. Two segmentation tasks corresponded to two models with different backbones for feature extraction, including vgg_9classes for nine classes (plus background) segmentation using VGG16 as the backbone and resnet_9classes for nine classes (plus background) segmentation using ResNet50 as the backbone.
A challenge that needs to be noticed in this study is the small object of manual segmentation. In our dataset, all ROIs are small areas, and the largest ROI only takes about 1/1000 of the whole DTI images from Figure 3. The challenge for model training comes from small-shape annotations that contribute less to the loss than background annotations, which makes the model prone to classify a pixel as background because image segmentation is a pixel-wise classification. Wentao et al. [28] use the Dice loss combined with the focal loss to segment small-volume organs in the Organs-At-Risk delineation problem and achieve good segmentation accuracy, while the Dice loss learns the imbalanced class distribution, and the focal loss trains the model to learn hard segmented annotations for these small-volume organs. Hence, this hybrid loss was selected for the training of two models. The total loss can be formulated as follows:
T P ( i ) = n = 1 N p n ( i ) g n ( i )
F N ( i ) = n = 1 N ( 1 p n ( i ) ) g n ( i )
F P ( i ) = n = 1 N p n ( i ) ( 1 g n ( i ) )
L t o t a l = L D I C E + L F o c a l = C i = 1 C 1 T P ( i ) T P ( i ) + F N ( i ) + F P ( i ) 1 N c = 1 C 1 n = 1 N g n ( i ) ( 1 p n ( i ) ) 2 log ( p n ( i ) )
where TP(i), FN(i), and FP(i) are the true positives rate, false negatives rate, and false positives rate for the classes. p n ( i ) is the predicted output value for the pixel n class i, g n ( i ) is the ground truth label for pixel n class i, and C is the total number of anatomies included in the background. N is the total number of pixels in one slice of a B0 slice image.
However, the performance of the segmentation model using B0 images as input on the test dataset was not satisfactory. Although we tried to use the FA images as input, the segmentation performance was also unsatisfactory. Except for the small object, there is another challenge in this dataset: the difference in pixel values between different classes is minimal, making it difficult to distinguish the boundaries of different ROIs. Based on the above challenges and performance on the test dataset, we believe the Dice loss combined with focal loss is unsuitable for this research.
Inspired by the landmark localization based on two-dimensional (2D) heatmaps [29] and the boundary-weighted map from Qikui’s research [30], we proposed the heatmap-distance loss (HDL). The loss can be formulated as follows:
H D L = i = 1 9 H i ( X ) H i ^ ( X ) 2
H i ^ ( x ) = { 1 ,     x X l a n d m a r k exp ( max ( x X l a n d m a r k ) 2 2 2 σ 2 ) , x X l a n d m a r k
where X denotes the set of pixels in 2D space from any class ground truth mask, and x represents any pixels from X. Let X l a n d m a r k denote the set of landmark points whose pixel value is equal to one. The notation ‖⋅‖2 denotes the L2 or Euclidean distance. H i ^ ( x ) refers to the ground truth heatmap for the ith class of ground truth masks, while H i ( X ) is the predicted heatmap for the ith class. The pixel value of the heatmap H i ^ ( x ) is created based on formulation (6). When the point from X has the same coordinate with the point in X l a n d m a r k , the value of H i ^ ( x ) is equal to one. Otherwise, each point will calculate the L2 or Euclidean distance between the point and each landmark point. The max distance between each pixel and the set of landmark points will be used as input to a Gaussian function with variance σ 2 to generate the value of H i ^ ( x ) . The rationale behind the design is that the small area of segmentation can be expanded to a large area, and the pixels created by Gaussian based on distance could make the small area contribute to the training process. Based on the test dataset performance and the model structure’s complexity, a better encoder between VGG16 and ResNet50 would be used as a backbone trained with the proposed loss.

2.4. Training and Evaluation

We use 81% (857 images) of our dataset for training, 9% (96 images) of our dataset for validation, and 10% (106 images) as a test dataset. The UNet model is trained on the training dataset and validated on the validation dataset. The final performance is exclusively reported on the test set.
The experiments were performed in the Pytorch environment, and we trained the segmentation model on NVIDIA RTX3080Ti. The batch size was set to 16 based on the memory of the GPU and computer. We used the Adam optimizer with a momentum of 0.9 and a cosine learning rate strategy with a learning rate range from 1 × 10−4 to 1 × 10−6; the number of epochs is 500. The Dice loss combined with focal loss and the HDL loss was trained under the same hyperparameters. The threshold definition is an essential step that transforms the model output into segmentation results. The threshold values for different segmentation classes are selected from 0 to 1 with intervals of 0.001 on validation datasets, and the threshold value of the best performance was to evaluate the final performance on the test dataset.
For evaluation of the proposed method, three metrics for each class are used, including the Recall (also known as True Positive Rate), which represents a method’s ability to segment pixels as a proportion of all correctly labeled pixels, the Precision, which measures the ratio of correctly segmented pixels and the Dice coefficient, which is a similarity index between two masks. The following equations define these three metrics: GT refers to ground truth, and PM refers to predict mask. Symbols FN, FP, and TP are explained as follows: true positive (TP) if it was a pixel in GT mask and it was segmented in PM; false positive (FP) if it was not a pixel in GT mask and it was segmented in PM; and finally, false negative (FN) if it was a pixel in GT mask and it was not segmented in PM:
D I C E = 2 | G T   P M | | G T | + | P M |
R e c a l l = T P T P + F N  
P r e c i s i o n = T P T P + F P
To further evaluate the accuracy of the mean FA value from the segmentation model, a metric F A e r r o r i , the percentage of absolute error of FA was utilized to compare the mean value of the ground truth mask superimposing on the DTI metric map (FA map) and the prediction segmentation mask superimposing on the DTI metric map (FA map), which is formulated as follows:
F A e r r o r i = | F A G T i F A P M i | F A G T i
where F A G T i refers to the FA mean value from the ground truth mask of the ith segmentation class and F A P M i refers to the FA mean value from the predicted mask of the ith segmentation class.
The performance of the algorithm was evaluated using basic statistics, such as the mean and standard deviation of F A e r r o r i . Outliers were defined as slices of DTI images that did not obtain the segmentation results from the model. After removing these outliers, the performance was calculated. The author performed an outlier assessment by calculating the percentage of outliers within this dataset.

3. Results

3.1. Encoder Comparison

Figure 4 demonstrates the performance of the UNet with different backbones on the test dataset (B0 as input images). We used the VGG16 and ResNet50 as the encoder for UNet training with a hybrid loss of Dice loss and focal loss. The UNet with VGG16 as the backbone achieves the highest Dice value of 0.65 on the right dorsal column and reaches the lowest Dice value of 0.48 on the left ventral column. Figure 4 demonstrates that the UNet model with VGG16 as the backbone performs better than using ResNet50 as the backbone on all segmentation classes. Therefore, VGG16 will be used as a backbone for UNet trained with the proposed loss function.

3.2. Model Performance

Figure 5 shows the segmentation result of several samples using the proposed model. In these pictures, the line in blue color is by GT and the line in red color is by PM.
Figure 6 demonstrates the performance on test datasets (FA as input images) of UNet training with hybrid loss and HDL loss. The UNet with VGG16 as the backbone achieves the highest Dice value of 0.69 on the left dorsal column and the lowest Dice value of 0.54 on the left gray matter. Figure 6 demonstrates that the UNet model trained with the proposed HDL loss performs better than the hybrid loss on all segmentation classes. Table 1 presents the overall performance of UNet using the proposed HDL loss.
To eliminate the effect of different kinds of images, Figure 7 demonstrates the performance of the UNet model training under different loss functions and using different types of DTI images. UNet model training with the proposed HDL loss using FA as input images achieves the highest performance.
There is a situation in which several images could not obtain the segmentation result from the model defined as the outlier slices. Table 2 illustrates the statistics of performance removing these outlier slices. The Dice coefficients of the left lateral column, ventral column, and gray matter were improved by 1%, 4%, and 3%, respectively. The Dice coefficients of the right lateral column, ventral column, and gray matter were improved by 2%, 4%, and 6%, respectively. The “Outliers” column demonstrated the percentage of DTI slices that cannot obtain output from segmentation.

3.3. Accuracy of FA Prediction

Furthermore, segmentation aims to extract the mean FA value within ROIs from the spinal cord. Therefore, we calculate the mean FA values from auto-segmented and manual-segmented ROIs. A series of higher intra-class correlation coefficients were found in Figure 8, indicating excellent agreement between the FA from segmented ROIs and the FA from ground truth ROIs. Table 3 demonstrates the percentage of the absolute error between the mean FA value from the ground truth mask and the mean FA value from the prediction mask. The percentage of the mean absolute error for lateral, dorsal, ventral, and gray matter was 0.07, 0.07, 0.11 and 0.08 on the left side as well as 0.07, 0.1, 0.1 and 0.07 on the right side, respectively.

4. Discussion

Microstructural spinal cord abnormalities are vital for CSM patients. DTI can reveal the microstructural change in the spinal cord compared to conventional MRI. However, the DTI analysis based on the whole cord region cannot reveal the details of the spinal cord. The subtle abnormality of DTI parameters caused by minor microstructure impairment in a small ROI may be missed due to the whole cord analysis dilution effect. ROI-based DTI analysis could avoid the drawback and enable the quantitative evaluation of specific regions with the spinal cord. This study combines UNet with the proposed heatmap distance loss to segment multiple ROIs within the spinal cord on DTI-related images from CSM patients. Many previous studies in spinal cord segmentation mainly focus on T1-weighted [31,32,33] and T2-weighted images [23,34,35] instead of DTI-related images. The related studies in spinal cord segmentation approaches on DTI images are semi-automatic [14,21] or diffusion-tensor-tracking-based [36] instead of fully automatic. The segmentation region of associated studies is the whole cord [23,31,32,33,35,36,37] or gray matter/white matter [14,19,20,24,38,39,40,41] instead of multiple ROIs within the spinal cord. To the authors’ knowledge, this is the first study using the deep learning method to fully automate and segment the multiple ROIs within the spinal cord on DTI images from CSM patients. We focused on developing an approach to help clinicians provide a more detailed description of spinal status of CSM patients by providing ROI-based features to facilitate ROI-specific analysis of DTI images.
We used two backbone networks to extract the feature within DTI-related images. VGG16 [26] and ResNet50 [27] are well-known convolutional neural networks in many natural-images-related tasks, and these networks achieve good performance on feature extraction. We hypothesize that the higher-level feature would be beneficial for segmentation. Figure 4 illustrates that the hypothesis is wrong. Figure 3 demonstrates that the percentages of pixels for all classes are small than 0.0011. Hence, the advanced features would not be suitable for this task because the convolutional process from a deeper layer would eliminate the efficient information for the training of the segmentation model.
The obstacle to fulfilling the segmentation in this dataset is the ROIs within the spinal cord. Figure 5 demonstrates several samples with the auto-segmentation result. Because the spinal medulla is misshaped in CSM patients, the ROIs contain variable sizes. To deal with the above issue, several loss functions have been proposed. The Dice loss and focal loss are commonly used for segmentation. Wentao [28] used focal loss combined with Dice loss for organ-at-risk (OAR) segmentation, especially for small-volume organ segmentation, achieving better segmentation accuracy. In this research, we use the focal loss combined with Dice loss to train the UNet to fulfill the segmentation of ROIs within the spinal cord. Figure 4 demonstrates that the segmentation performance of sub-ROIs with the spinal cord is not satisfied. The possible reason is that the region of sub-ROIs from the spinal cord is smaller than the volume of OARs in Wentao’s research, and the boundary of these ROIs in DTI-related images is not as clear as in computed tomography (CT) images. We proposed a new heatmap distance loss function inspired by landmark localization. Figure 6 demonstrates that the UNet with the proposed HDL achieves the highest segmentation performance for all classes. The basic principle of HDL is that we manually expand the ground truth area by designing the pseudo-probability, which makes the small area significantly contribute to the model training.
However, the higher performance in Figure 6 was based on the FA images, while the lower performance in Figure 6 was based on the B0 images. Regardless of the image types, Figure 7 illustrates that the UNet model training using HDL performs better than hybrid loss. The B0 images have a wide range, as well as the CT image in Wentao’s research, while the value range of FA images is from 0 to 1. A wide range of values would be beneficial for the convolutional process to extract the high-level feature and the hybrid loss, which could be the reason that VGG_B0_hybird_loss has better performance than VGG_FA_hybird_loss. Figure 7 also illustrates that the pixel value of images has less effect on the model training using proposed HDL loss.
Figure 8 demonstrates the correlation between the mean FA of auto-segmented ROIs and the mean FA of ground truth. The high intra-class correlation on the diagonal position of the correlation matrix demonstrated the high agreement between the mean FA value from auto-segmented ROIs and ground truth ROIs. Several values on the correlation matrix need to be explored. It should be noted that the correlations from several pairs are high, including {L_DC, R_DC}, {R_DC, L_DC}, {L_VC, R_VC}, {R_VC, L_VC}, {L_GM, R_GM}, and {R_GM, L_GM}. This result illustrated that the ROI of Pred_L_DC or Pred_R_VC intersects with GT_L_DC or GT_R_VC, and there is a high agreement between the left and right of DC, VC, and GM, which could be the factor that affects the segmentation performance. Table 3 demonstrates the absolute error percentage between the FA of auto-segmented ROIs and ground truth ROIs on the test dataset. The mean distance between the mean FA value obtained from the segmentation model and the mean FA value obtained from manually segmented ROIs is small, demonstrating that the mean FA value extracted from the segmentation model has the potential for detailed spinal cord status evaluation, such as the mean FA value extracted from the ground truth. The spinal medulla misshaped in CSM patients still significantly affects segmentation performance. However, most of the segmentation result is close to the manual segmentation, and most of the auto-segmentation results are on the related region, which could be why the distance between mean FA values from the auto-segmentation and mean FA values from manual segmentation are small.
Clinically, DTI has three applications. Firstly, it has the ability to identify spinal cord structures and injuries, localization, and diagnosis of CSM. Cui et al. [10]’s research demonstrated that the orientation entropy from DTI analysis is a valuable tool for identifying the pathological level in multilevel CSM patients. Shu-Qiang Wang [42] combined the eigenvalues from DTI and machine learning methods to achieve satisfactory performance in recognizing the levels of CSM. According to Monika Skotarczak [43], the DTI metrics can be used as a biomarker to illustrate the microstructural disorder of the spinal cord not visible on conventional MRI. Secondly, there is the indication of surgery. As demonstrated by Karsten Schöller et al. [44], DTI metrics achieve higher sensitivity in identifying levels requiring decompression surgery than increased MRI signals. Thirdly, there is dynamic spinal function examination and prognosis assessment. For instance, Ellingson et al. [45]’s study demonstrated that DTI has the potential to monitor symptomatic patients and asymptomatic patients, and Chun Yi Wen et al. [46]’s research revealed that FA is a biomarker for surgical outcomes. The auto-segmentation of ROIs with the spinal cord DTI has the potential to reduce inter-rater and intra-rater variability in manually drawn ROIs and help the entire DTI analysis workflow be automated.
There are several limitations. Firstly, there is space for improvement in segmentation performance. Based on the findings from Figure 5, the deep learning model with fewer layers will be explored, which will be regarded as part of the future extension of this research. Secondly, the intersection between the ROI of Pred_L_DC and the ROI of GT_R_DC, the ROI of Pred_R_DC and the ROI of GT_L_DC, the ROI of Pred_L_VC and the ROI of GT_R_VC or the ROI of Pred_R_VC and the ROI of GT_L_VC is a problem that could be regarded as one reason to improve the segmentation performance of DC(L & R) and VC(L&R). A specified preprocessing or postprocessing method needs to be explored in the future. Thirdly, the diffusion tensor for calculation of the DTI metric is reconstructed based on a series of diffusion-weighted images and the b-matrix that integrates the parameters of diffusion-sensitizing gradients. However, many factors influence the accuracy or reliability of the diffusion tensor evaluation, including image noise [47], eddy currents [48], diffusion gradient nonlinearity [47], and others. For example, in the circumstance of low anisotropy, imaging noise could lead to the wrong estimation of eigenvalues, therefore causing an overestimation of DTI metrics (such as FA) due to the incorrect order of eigenvalues [49], and diffusion gradient inhomogeneities would cause distortion of the diffusion tensor’s eigenvalues as well as rotation of the eigenvectors [50]. The diffusion gradient inhomogeneity is an important source of systematic error, and several methods have been proposed to correct the error due to diffusion gradient inhomogeneity [51,52,53,54]. They are either based on a calibration that uses an anisotropic phantom as a reference for estimation of the diffusion tensor, mapping the actual magnetic field, or using the coil system’s manufacturer-provided specifications. The b-matrix spatial distribution in DTI (BSI-DTI) [55] is a frequently employed technique for correcting this kind of inaccuracy and has the capacity to minimize the impact on the assessment of DTI metrics. To reduce the effects of inhomogeneous magnetic field gradients and make the data as accurate as possible, we intend to adopt the BSI-DTI approach in the future. Finally, different types of input images potentially affect the performance of segmentation, and many factors of MRI sequences can affect the DTI images. The MRI scanner involves the measurement of DTI metrics (such as FA) [56]. Except for the MRI scanner, several studies demonstrated that the parameters of MRI sequence, such as the b-value [57], echo time [58], the number of DTI directions [59], and the number of signal acquisitions [60], could influence the diffusion quantification. In order to expand the scope of application of the model, it is necessary to extend the diversity of the DTI image dataset: for instance, acquiring the DTI images of CSM patients with different b-value and different numbers of DTI directions.
In future work, we will consider the continuation path for this research in the following steps. Firstly, we will continue to expand the size of our dataset, as the sample size in this research is insufficient for deep learning training. Secondly, weak-supervised training will be used on the new dataset. A new and helpful segmentation model will be developed using the manual segmentation result in this research. Thirdly, DTI images of CSM patients will be classified into several categories based on the types of CSM compression or the classification result from unsupervised learning. The segmentation model will be trained on the specified cases to verify the algorithm’s effectiveness in the specific category of CSM cases.

5. Conclusions

This paper proposed using UNet and the heatmap distance loss to automatically segment the sub-ROIs within the spinal cord on DTI images from CSM patients. The performance of the segmentation model and the agreement between the mean FA value of auto-segmented ROIs and the mean FA value of manually segmented ROIs demonstrates that the deep learning model has the potential to provide more details of the internal status of the spinal cord.

Author Contributions

Conceptualization, Y.H. and G.L.; methodology, N.F. and G.L.; software, N.F.; validation, J.L., X.W. and X.H.; formal analysis, N.F. and G.L.; investigation, N.F. and Y.H.; resources, Y.H. and G.L.; data curation, N.F.; writing—original draft preparation, N.F. and Y.H.; writing—review and editing, Y.H.; visualization, N.F.; supervision, Y.H.; funding acquisition, Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (82072507).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by Institutional Review Board of the University of Hong Kong/Hospital Authority Hong Kong West Cluster ([HKU/HA HKW IRB UW 18-394]).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Because the limits of ethic approval, the original DTI dataset is not opened to public. The post-process data can be asked from the corresponding author by request, which cannot be used for commercial purpose.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wang, K.; Chen, Z.; Zhang, F.; Song, Q.-x.; Hou, C.-l.; Tang, Y.-x.; Wang, J.; Chen, S.; Bian, Y.; Hao, Q.; et al. Evaluation of DTI Parameter Ratios and Diffusion Tensor Tractography Grading in the Diagnosis and Prognosis Prediction of Cervical Spondylotic Myelopathy. Spine 2017, 42, E202–E210. [Google Scholar] [CrossRef] [PubMed]
  2. Shabani, S.; Kaushal, M.; Budde, M.D.; Wang, M.C.; Kurpad, S. Diffusion tensor imaging in cervical spondylotic myelopathy: A review. J. Neurosurg. Spine 2020, 1–8. [Google Scholar] [CrossRef]
  3. Quencer, R.M. Cervical myelopathy: Imaging/clinical correlations and the application of evolving magnetic resonance imaging techniques in cervical spinal cord abnormalities. Spine (Phila Pa 1976) 2001, 26, 2411–2413. [Google Scholar] [CrossRef] [PubMed]
  4. Fehlings, M.G.; Martin, A.R.; Tetreault, L.A.; Aarabi, B.; Anderson, P.; Arnold, P.M.; Brodke, D.; Burns, A.S.; Chiba, K.; Dettori, J.R.; et al. A Clinical Practice Guideline for the Management of Patients With Acute Spinal Cord Injury: Recommendations on the Role of Baseline Magnetic Resonance Imaging in Clinical Decision Making and Outcome Prediction. Glob. Spine J. 2017, 7, 221S–230S. [Google Scholar] [CrossRef] [Green Version]
  5. Fehlings, M.G.; Tetreault, L.A.; Riew, K.D.; Middleton, J.W.; Aarabi, B.; Arnold, P.M.; Brodke, D.S.; Burns, A.S.; Carette, S.; Chen, R.; et al. A Clinical Practice Guideline for the Management of Patients With Degenerative Cervical Myelopathy: Recommendations for Patients With Mild, Moderate, and Severe Disease and Nonmyelopathic Patients With Evidence of Cord Compression. Glob. Spine J. 2017, 7, 70S–83S. [Google Scholar] [CrossRef] [Green Version]
  6. Kurpad, S.; Martin, A.R.; Tetreault, L.A.; Fischer, D.J.; Skelly, A.C.; Mikulis, D.; Flanders, A.; Aarabi, B.; Mroz, T.E.; Tsai, E.C.; et al. Impact of Baseline Magnetic Resonance Imaging on Neurologic, Functional, and Safety Outcomes in Patients With Acute Traumatic Spinal Cord Injury. Glob. Spine J. 2017, 7, 151S–174S. [Google Scholar] [CrossRef] [Green Version]
  7. Ahuja, C.S.; Wilson, J.R.; Nori, S.; Kotter, M.R.N.; Druschel, C.; Curt, A.; Fehlings, M.G. Traumatic spinal cord injury. Nat. Rev. Dis. Prim. 2017, 3, 21. [Google Scholar] [CrossRef] [Green Version]
  8. Talbott, J.F.; Whetstone, W.D.; Readdy, W.J.; Ferguson, A.R.; Bresnahan, J.C.; Saigal, R.; Hawryluk, G.W.J.; Beattie, M.S.; Mabray, M.C.; Pan, J.Z.; et al. The Brain and Spinal Injury Center score: A novel, simple, and reproducible method for assessing the severity of acute cervical spinal cord injury with axial T2-weighted MRI findings. J. Neurosurg.-Spine 2015, 23, 495–504. [Google Scholar] [CrossRef] [Green Version]
  9. Lee, E.; Lee, J.W.; Bae, Y.J.; Kim, H.J.; Kang, Y.; Ahn, J.M. Reliability of pre-operative diffusion tensor imaging parameter measurements of the cervical spine in patients with cervical spondylotic myelopathy. Sci. Rep. 2020, 10, 17410. [Google Scholar] [CrossRef]
  10. Cui, J.L.; Cui, J.L.; Wen, C.Y.; Wen, C.Y.; Hu, Y.; Hu, Y.; Mak, K.C.; Mak, K.C.; Mak, K.H.H.; Mak, K.H.H.; et al. Orientation entropy analysis of diffusion tensor in healthy and myelopathic spinal cord. Neuroimage 2011, 58, 1028. [Google Scholar] [CrossRef]
  11. Jin, R.; Luk, K.D.; Cheung, J.P.Y.; Hu, Y. Prognosis of cervical myelopathy based on diffusion tensor imaging with artificial intelligence methods. NMR Biomed. 2019, 32, e4114. [Google Scholar] [CrossRef] [PubMed]
  12. Li, X.; Cui, J.-L.; Mak, K.-C.; Luk, K.D.-K.; Hu, Y. Potential Use of Diffusion Tensor Imaging in Level Diagnosis of Multilevel Cervical Spondylotic Myelopathy. Spine 2014, 39, E615–E622. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Chan, T.Y.; Li, X.; Mak, K.C.; Cheung, J.P.; Luk, K.D.; Hu, Y. Normal values of cervical spinal cord diffusion tensor in young and middle-aged healthy Chinese. Eur. Spine J. 2015, 24, 2991–2998. [Google Scholar] [CrossRef] [PubMed]
  14. Jin, R.C.; Hu, Y. Effect of segmentation from different diffusive metric maps on diffusion tensor imaging analysis of the cervical spinal cord. Quant. Imaging Med. Surg. 2019, 9, 292–303. [Google Scholar] [CrossRef]
  15. Fonov, V.S.; Troter, A.L.; Taso, M.; Leener, B.D.; Lévêque, G.; Benhamou, M.; Sdika, M.; Benali, H.; Pradat, P.; Collins, D.L.; et al. Framework for integrated MRI average of the spinal cord white and gray matter: The MNI–Poly–AMU template. Neuroimage 2014, 102, 817–827. [Google Scholar] [CrossRef]
  16. El Mendili, M.-M.; Chen, R.; Tiret, B.; Villard, N.; Trunet, S.; Pélégrini-Issac, M.; Lehéricy, S.; Pradat, P.-F.; Benali, H. Fast and Accurate Semi-Automated Segmentation Method of Spinal Cord MR Images at 3T Applied to the Construction of a Cervical Spinal Cord Template. PLoS ONE 2015, 10, e0122224. [Google Scholar] [CrossRef]
  17. Leener, B.D.; Cohen-Adad, J.; Kadoury, S. Automatic Segmentation of the Spinal Cord and Spinal Canal Coupled With Vertebral Labeling. IEEE Trans. Med. Imaging 2015, 34, 1705–1718. [Google Scholar] [CrossRef]
  18. Chen, M.; Carass, A.; Oh, J.; Nair, G.; Pham, D.L.; Reich, D.S.; Prince, J.L. Automatic magnetic resonance spinal cord segmentation with topology constraints for variable fields of view. Neuroimage 2013, 83, 1051–1062. [Google Scholar] [CrossRef] [Green Version]
  19. Horváth, A.; Pezold, S.; Weigel, M.; Parmar, K.; Bieri, O.; Cattin, P.C. Variational Segmentation of the White and Gray Matter in the Spinal Cord Using a Shape Prior. In Computational Methods and Clinical Applications for Spine Imaging; Springer: Cham, Switzerland, 2016. [Google Scholar]
  20. Prados, F.; Cardoso, M.J.; Yiannakas, M.C.; Hoy, L.R.; Tebaldi, E.; Kearney, H.; Liechti, M.D.; Miller, D.H.; Ciccarelli, O.; Wheeler-Kingshott, C.A.M.; et al. Fully automated grey and white matter spinal cord segmentation. Sci. Rep. 2016, 6, 36151. [Google Scholar] [CrossRef] [Green Version]
  21. Dostál, M.; Keřkovský, M.; Korit Áková, E.; Němcová, E.; Stulík, J.; Staňková, M.; Bernard, V. Analysis of diffusion tensor measurements of the human cervical spinal cord based on semiautomatic segmentation of the white and gray matter. J. Magn. Reson. Imaging JMRI 2018, 48, 1217–1227. [Google Scholar] [CrossRef]
  22. Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
  23. Zhang, X.; Li, Y.; Liu, Y.; Tang, S.X.; Liu, X.; Punithakumar, K.; Shi, D. Automatic spinal cord segmentation from axial-view MRI slices using CNN with grayscale regularized active contour propagation. Comput. Biol. Med. 2021, 132, 104345. [Google Scholar] [CrossRef] [PubMed]
  24. Alsenan, A.; Youssef, B.B.; Alhichri, H. A Deep Learning Model based on MobileNetV3 and UNet for Spinal Cord Gray Matter Segmentation. In Proceedings of the 2021 44th International Conference on Telecommunications and Signal Processing (TSP), Virtual, 26–28 July 2021; pp. 244–248. [Google Scholar]
  25. De Leener, B.; Lévy, S.; Dupont, S.M.; Fonov, V.S.; Stikov, N.; Louis Collins, D.; Callot, V.; Cohen-Adad, J. SCT: Spinal Cord Toolbox, an open-source software for processing spinal cord MRI data. Neuroimage 2017, 145, 24–43. [Google Scholar] [CrossRef]
  26. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
  27. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 770–778. [Google Scholar]
  28. Zhu, W.; Huang, Y.; Zeng, L.; Chen, X.; Liu, Y.; Qian, Z.; Du, N.; Fan, W.; Xie, X. AnatomyNet: Deep learning for fast and fully automated whole-volume segmentation of head and neck anatomy. Med. Phys. 2019, 46, 576–589. [Google Scholar] [CrossRef] [Green Version]
  29. Tsai, A. Anatomical landmark localization via convolutional neural networks for limb-length discrepancy measurements. Pediatr. Radiol. 2021, 51, 1431–1447. [Google Scholar] [CrossRef] [PubMed]
  30. Zhu, Q.K.; Du, B.; Yan, P.K. Boundary-Weighted Domain Adaptive Neural Network for Prostate MR Image Segmentation. IEEE Trans. Med. Imaging 2020, 39, 753–763. [Google Scholar] [CrossRef] [PubMed]
  31. Leener, B.D.; Kadoury, S.; Cohen-Adad, J. Robust, accurate and fast automatic segmentation of the spinal cord. Neuroimage 2014, 98, 528–536. [Google Scholar] [CrossRef]
  32. Yiannakas, M.C.; Mustafa, A.M.; De Leener, B.; Kearney, H.; Tur, C.; Altmann, D.R.; De Angelis, F.; Plantone, D.; Ciccarelli, O.; Miller, D.H.; et al. Fully automated segmentation of the cervical cord from T1-weighted MRI using PropSeg: Application to multiple sclerosis. NeuroImage. Clin. 2016, 10, 71–77. [Google Scholar] [CrossRef] [Green Version]
  33. Fouladivanda, M.; Kazemi, K.; Helfroush, M.S.; Shakibafard, A. Morphological active contour driven by local and global intensity fitting for spinal cord segmentation from MR images. J. Neurosci. Methods 2018, 308, 116–128. [Google Scholar] [CrossRef]
  34. Sabaghian, S.; Dehghani, H.; Batouli, S.A.H.; Khatibi, A.; Oghabian, M.A. Fully automatic 3D segmentation of the thoracolumbar spinal cord and the vertebral canal from T2-weighted MRI using K-means clustering algorithm. Spinal Cord 2020, 58, 811–820. [Google Scholar] [CrossRef]
  35. Cadotte, A.; Cadotte, D.W.; Livne, M.; Cohen-Adad, J.; Fleet, D.; Mikulis, D.J.; Fehlings, M.G. Spinal Cord Segmentation by One Dimensional Normalized Template Matching: A Novel, Quantitative Technique to Analyze Advanced Magnetic Resonance Imaging Data. PLoS ONE 2015, 10, e0139323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. van Hecke, W.; Leemans, A.; Sijbers, J.; Vandervliet, E.; Van Goethem, J.W.; Parizel, P.M. A tracking-based diffusion tensor imaging segmentation method for the detection of diffusion-related changes of the cervical spinal cord with aging. J. Magn. Reson. Imaging 2008, 27, 978–991. [Google Scholar] [CrossRef]
  37. Gros, C.; Leener, B.D.; Badji, A.; Maranzano, J.; Eden, D.; Dupont, S.M.; Talbott, J.F.; Zhuoquiong, R.; Liu, Y.; Granberg, T.; et al. Automatic segmentation of the spinal cord and intramedullary multiple sclerosis lesions with convolutional neural networks. Neuroimage 2019, 184, 901–915. [Google Scholar] [CrossRef] [Green Version]
  38. Datta, E.; Papinutto, N.; Schlaeger, R.; Zhu, A.H.; Carballido-Gamio, J.; Henry, R.G. Gray matter segmentation of the spinal cord with active contours in MR images. Neuroimage 2017, 147, 788–799. [Google Scholar] [CrossRef] [PubMed]
  39. Dupont, S.M.; Leener, B.D.; Taso, M.; Troter, A.L.; Nadeau, S.; Stikov, N.; Callot, V.; Cohen-Adad, J. Fully-integrated framework for the segmentation and registration of the spinal cord white and gray matter. Neuroimage 2017, 150, 358–372. [Google Scholar] [CrossRef]
  40. Prados, F.; Ashburner, J.; Blaiotta, C.; Brosch, T.; Carballido-Gamio, J.; Cardoso, M.J.; Conrad, B.N.; Datta, E.; Dávid, G.; Leener, B.D.; et al. Spinal cord grey matter segmentation challenge. Neuroimage 2017, 152, 312–329. [Google Scholar] [CrossRef] [Green Version]
  41. Perone, C.S.; Calabrese, E.; Cohen-Adad, J. Spinal cord gray matter segmentation using deep dilated convolutions. Sci. Rep. 2018, 8, 5966. [Google Scholar] [CrossRef] [PubMed]
  42. Wang, S.Q.; Li, X.; Cui, J.L.; Li, H.X.; Luk, K.D.K.; Hu, Y. Prediction of myelopathic level in cervical spondylotic myelopathy using diffusion tensor imaging. J. Magn. Reson. Imaging 2015, 41, 1682–1688. [Google Scholar] [CrossRef] [Green Version]
  43. Skotarczak, M.; Dzierzanowski, J.; Kaszubowski, M.; Winklewski, P.J.; Romanowski, A.; Szurowska, E.; Szarmach, A. Diagnostic value of diffusion tensor imaging in patients with clinical signs of cervical spondylotic myelopathy. Neurol. Neurochir. Pol. 2022, 56, 341–348. [Google Scholar] [CrossRef]
  44. Scholler, K.; Siller, S.; Brem, C.; Lutz, J.; Zausinger, S. Diffusion Tensor Imaging for Surgical Planning in Patients with Cervical Spondylotic Myelopathy. J. Neurol. Surg. Part A 2020, 81, 1–9. [Google Scholar] [CrossRef]
  45. Ellingson, B.M.; Salamon, N.; Woodworth, D.C.; Yokota, H.; Holly, L.T. Reproducibility, temporal stability, and functional correlation of diffusion MR measurements within the spinal cord in patients with asymptomatic cervical stenosis or cervical myelopathy. J. Neurosurg.-Spine 2018, 28, 472–480. [Google Scholar] [CrossRef] [Green Version]
  46. Wen, C.Y.; Cui, J.L.; Liu, H.S.; Mak, K.C.; Cheung, W.Y.; Luk, K.D.K.; Hu, Y. Is Diffusion Anisotropy a Biomarker for Disease Severity and Surgical Prognosis of Cervical Spondylotic Myelopathy? Radiology 2014, 270, 197–204. [Google Scholar] [CrossRef]
  47. Borkowski, K.; Krzyzak, A.T. Assessment of the systematic errors caused by diffusion gradient inhomogeneity in DTI-computer simulations. NMR Biomed. 2019, 32, e4130. [Google Scholar] [CrossRef] [PubMed]
  48. Ahn, C.B.; Cho, Z.H. ANALYSIS OF THE EDDY-CURRENT INDUCED ARTIFACTS AND THE TEMPORAL COMPENSATION IN NUCLEAR-MAGNETIC-RESONANCE IMAGING. IEEE Trans. Med. Imaging 1991, 10, 47–52. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Pierpaoli, C.; Basser, P.J. Toward a quantitative assessment of diffusion anisotropy. Magn. Reson. Med. 1996, 36, 893–906. [Google Scholar] [CrossRef]
  50. Borkowski, K.; Krzyżak, A.T. Analysis and correction of errors in DTI-based tractography due to diffusion gradient inhomogeneity. J. Magn. Reson. 2018, 296, 5–11. [Google Scholar] [CrossRef]
  51. Bammer, R.; Markl, M.; Barnett, A.; Acar, B.; Alley, M.T.; Pelc, N.J.; Glover, G.H.; Moseley, M.E. Analysis and generalized correction of the effect of spatial gradient field distortions in diffusion-weighted imaging. Magn. Reson. Med. 2003, 50, 560–569. [Google Scholar] [CrossRef] [PubMed]
  52. Tan, E.T.; Marinelli, L.; Slavens, Z.W.; King, K.F.; Hardy, C.J. Improved correction for gradient nonlinearity effects in diffusion-weighted imaging. J. Magn. Reson. Imaging 2013, 38, 448–453. [Google Scholar] [CrossRef]
  53. Krzyżak, A.T.; Olejniczak, Z. Improving the accuracy of PGSE DTI experiments using the spatial distribution of b matrix. Magn. Reson. Imaging 2015, 33, 286–295. [Google Scholar] [CrossRef]
  54. Rogers, B.P.; Blaber, J.; Welch, E.B.; Ding, Z.; Anderson, A.W.; Landman, B.A. Stability of Gradient Field Corrections for Quantitative Diffusion MRI. In Proceedings of the SPIE Medical Imaging, Orlando, FL, USA, 11–16 February 2017; Volume 10132. [Google Scholar] [CrossRef] [Green Version]
  55. Kłodowski, K.; Krzyżak, A.T. Innovative anisotropic phantoms for calibration of diffusion tensor imaging sequences. Magn. Reson. Imaging 2016, 34, 404–409. [Google Scholar] [CrossRef]
  56. Zhu, T.; Hu, R.; Qiu, X.; Taylor, M.; Tso, Y.; Yiannoutsos, C.; Navia, B.; Mori, S.; Ekholm, S.; Schifitto, G.; et al. Quantification of accuracy and precision of multi-center DTI measurements: A diffusion phantom and human brain study. Neuroimage 2011, 56, 1398–1411. [Google Scholar] [CrossRef] [Green Version]
  57. Melhem, E.R.; Itoh, R.; Jones, L.; Barker, P.B. Diffusion tensor MR imaging of the brain: Effect of diffusion weighting on trace and anisotropy measurements. Am. J. Neuroradiol. 2000, 21, 1813–1820. [Google Scholar] [PubMed]
  58. Qin, W.; Yu, C.S.; Zhang, F.; Du, X.Y.; Jiang, H.; Yan, Y.X.; Li, K.C. Effects of Echo Time on Diffusion Quantification of Brain White Matter at 1.5T and 3.0T. Magn. Reson. Med. 2009, 61, 755–760. [Google Scholar] [CrossRef] [PubMed]
  59. Ni, H.; Kavcic, V.; Zhu, T.; Ekholm, S.; Zhong, J. Effects of number of diffusion gradient directions on derived diffusion tensor imaging indices in human brain. Am. J. Neuroradiol. 2006, 27, 1776–1781. [Google Scholar] [PubMed]
  60. Widjaja, E.; Mahmoodabadi, S.Z.; Rea, D.; Moineddin, R.; Vidarsson, L.; Nilsson, D. Effects of Gradient Encoding and Number of Signal Averages on Fractional Anisotropy and Fiber Density Index In Vivo at 1.5 Tesla. Acta Radiol. 2009, 50, 106–113. [Google Scholar] [CrossRef] [PubMed]
Figure 1. ROIs covering the whole spinal cord.
Figure 1. ROIs covering the whole spinal cord.
Diagnostics 13 00817 g001
Figure 2. Structure of the segmentation network.
Figure 2. Structure of the segmentation network.
Diagnostics 13 00817 g002
Figure 3. The percentage of pixels for each class on DTI images.
Figure 3. The percentage of pixels for each class on DTI images.
Diagnostics 13 00817 g003
Figure 4. The comparison between different encoders.
Figure 4. The comparison between different encoders.
Diagnostics 13 00817 g004
Figure 5. Several segmentation results with the proposed model.
Figure 5. Several segmentation results with the proposed model.
Diagnostics 13 00817 g005
Figure 6. The performance comparison of UNet with different losses.
Figure 6. The performance comparison of UNet with different losses.
Diagnostics 13 00817 g006
Figure 7. The performance comparison of UNet with different losses using different DTI images.
Figure 7. The performance comparison of UNet with different losses using different DTI images.
Diagnostics 13 00817 g007
Figure 8. The correlation between FA of predicted segmentation ROIs and FA of ground truth ROIs. Pred_: the ROIs from the proposed segmentation model; GT_: the ROIs from the manual segmentation.
Figure 8. The correlation between FA of predicted segmentation ROIs and FA of ground truth ROIs. Pred_: the ROIs from the proposed segmentation model; GT_: the ROIs from the manual segmentation.
Diagnostics 13 00817 g008
Table 1. The performance of UNet with proposed HDL loss.
Table 1. The performance of UNet with proposed HDL loss.
ClassDiceRecallPrecision
Left dorsal column0.69 ± 0.190.78 ± 0.230.64 ± 0.2
Left lateral column0.67 ± 0.240.74 ± 0.290.63 ± 0.23
Left ventral column0.57 ± 0.290.62 ± 0.330.55 ± 0.28
Left gray matter0.54 ± 0.280.59 ± 0.330.53 ± 0.28
Right dorsal column0.68 ± 0.210.74 ± 0.250.65 ± 0.2
Right lateral column0.67 ± 0.220.74 ± 0.260.65 ± 0.22
Right ventral column0.59 ± 0.260.63 ± 0.290.58 ± 0.27
Right gray matter0.55 ± 0.310.56 ± 0.340.57 ± 0.32
Table 2. The performance of UNet with proposed loss removed outlier slices.
Table 2. The performance of UNet with proposed loss removed outlier slices.
ClassDiceRecallPrecisionOutliers
Left dorsal column0.69 ± 0.190.78 ± 0.230.64 ± 0.20
Left lateral column0.68 ± 0.230.76 ± 0.270.64 ± 0.212%
Left ventral column0.61 ± 0.250.67 ± 0.30.59 ± 0.257%
Left gray matter0.57 ± 0.250.63 ± 0.30.56 ± 0.257%
Right dorsal column0.68 ± 0.20.74 ± 0.240.66 ± 0.21%
Right lateral column0.69 ± 0.180.76 ± 0.230.67 ± 0.193%
Right ventral column0.63 ± 0.230.67 ± 0.260.61 ± 0.246%
Right gray matter0.61 ± 0.270.63 ± 0.30.63 ± 0.2710%
Table 3. The percentage of absolute error between two types of FA value.
Table 3. The percentage of absolute error between two types of FA value.
Class F A e r r o r i
Left dorsal column0.07 ± 0.07
Left lateral column0.07 ± 0.11
Left ventral column0.11 ± 0.17
Left gray matter0.08 ± 0.14
Right dorsal column0.07 ± 0.08
Right lateral column0.1 ± 0.11
Right ventral column0.1 ± 0.23
Right gray matter0.07 ± 0.09
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fei, N.; Li, G.; Wang, X.; Li, J.; Hu, X.; Hu, Y. Deep Learning-Based Auto-Segmentation of Spinal Cord Internal Structure of Diffusion Tensor Imaging in Cervical Spondylotic Myelopathy. Diagnostics 2023, 13, 817. https://doi.org/10.3390/diagnostics13050817

AMA Style

Fei N, Li G, Wang X, Li J, Hu X, Hu Y. Deep Learning-Based Auto-Segmentation of Spinal Cord Internal Structure of Diffusion Tensor Imaging in Cervical Spondylotic Myelopathy. Diagnostics. 2023; 13(5):817. https://doi.org/10.3390/diagnostics13050817

Chicago/Turabian Style

Fei, Ningbo, Guangsheng Li, Xuxiang Wang, Junpeng Li, Xiaosong Hu, and Yong Hu. 2023. "Deep Learning-Based Auto-Segmentation of Spinal Cord Internal Structure of Diffusion Tensor Imaging in Cervical Spondylotic Myelopathy" Diagnostics 13, no. 5: 817. https://doi.org/10.3390/diagnostics13050817

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop