Deep Learning Convolutional Neural Networks for the Automatic Quantification of Muscle Fat Infiltration Following Whiplash Injury

Weber, Kenneth A.; Smith, Andrew C.; Wasielewski, Marie; Eghtesad, Kamran; Upadhyayula, Pranav A.; Wintermark, Max; Hastie, Trevor J.; Parrish, Todd B.; Mackey, Sean; Elliott, James M.

doi:10.1038/s41598-019-44416-8

Download PDF

Article
Open access
Published: 28 May 2019

Deep Learning Convolutional Neural Networks for the Automatic Quantification of Muscle Fat Infiltration Following Whiplash Injury

Kenneth A. Weber ORCID: orcid.org/0000-0002-0916-9174¹,
Andrew C. Smith²,
Marie Wasielewski³,
Kamran Eghtesad¹,
Pranav A. Upadhyayula¹,
Max Wintermark⁴,
Trevor J. Hastie⁵,
Todd B. Parrish⁶,
Sean Mackey¹ &
…
James M. Elliott^3,7,8

Scientific Reports volume 9, Article number: 7973 (2019) Cite this article

5255 Accesses
42 Citations
29 Altmetric
Metrics details

Subjects

Abstract

Muscle fat infiltration (MFI) of the deep cervical spine extensors has been observed in cervical spine conditions using time-consuming and rater-dependent manual techniques. Deep learning convolutional neural network (CNN) models have demonstrated state-of-the-art performance in segmentation tasks. Here, we train and test a CNN for muscle segmentation and automatic MFI calculation using high-resolution fat-water images from 39 participants (26 female, average = 31.7 ± 9.3 years) 3 months post whiplash injury. First, we demonstrate high test reliability and accuracy of the CNN compared to manual segmentation. Then we explore the relationships between CNN muscle volume, CNN MFI, and clinical measures of pain and neck-related disability. Across all participants, we demonstrate that CNN muscle volume was negatively correlated to pain (R = −0.415, p = 0.006) and disability (R = −0.286, p = 0.045), while CNN MFI tended to be positively correlated to disability (R = 0.214, p = 0.105). Additionally, CNN MFI was higher in participants with persisting pain and disability (p = 0.049). Overall, CNN’s may improve the efficiency and objectivity of muscle measures allowing for the quantitative monitoring of muscle properties in disorders of and beyond the cervical spine.

Multi-muscle deep learning segmentation to automate the quantification of muscle fat infiltration in cervical spine conditions

Article Open access 16 August 2021

Kenneth A. Weber II, Rebecca Abbott, … James M. Elliott

Convolutional neural networks for the automatic segmentation of lumbar paraspinal muscles in people with low back pain

Article Open access 05 August 2022

E. O. Wesselink, J. M. Elliott, … K. A. Weber II

Precise individual muscle segmentation in whole thigh CT scans for sarcopenia assessment using U-net transformer

Article Open access 08 February 2024

Hyeon Su Kim, Hyunbin Kim, … Jun-Il Yoo

Introduction

Muscle fat infiltration (MFI) has been described by conventional (T₁- and T₂-weighted) and advanced (Dixon and proton density fat fraction) magnetic resonance imaging (MRI) in cervical spine conditions, such as degenerative cervical myelopathy (DCM)^1,2, spinal cord injury (SCI)^3,4, and whiplash from a motor vehicle collision (MVC)^5,6,7,8. While the mechanisms underlying these conditions greatly differ, the patterns of MFI appear consistent with the greatest magnitude occurring in the deepest, and most architecturally complex, muscular layer of the cervical extensors (i.e., multifidus and semispinalis cervicis)^5,6,7.

While plausible to hypothesize that larger amounts of paraspinal MFI negatively impact function, recent studies do not provide confirmation^9,10. This could be due to the varied methods used to measure MFI and function¹¹, as others have shown MFI to be associated with lower physical function^12,13,14, and that better surgical outcomes are achieved in those with larger muscle size and better quality^15,16. Preliminary evidence suggests MFI may be reversible and associated with a concomitant improvement in chronic whiplash-related disability¹⁷.

Muscular degeneration (as the larger magnitude of MFI might indicate) may have clinical implications for management and rates of recovery from persistent spinal disorders that currently feature high as the world’s most disabling diseases: low back pain (first) and neck pain (fourth)¹⁸. Despite ever-increasing options for ‘treatment’ of these conditions, explanations for their persistence have become urgently needed¹⁹. MFI may be one neurobiological explanation^11,20,21. However, manual segmentation methods for MFI do not permit for time-efficient monitoring of these muscles in clinical practice.

The recent application of deep learning methods (i.e., convolutional neural networks (CNN’s)) in medical imaging analysis has demonstrated impressive gains in image segmentation, with accuracy nearing human-level performance^22,23,24. Accordingly, CNN’s may permit time-efficient quantification of the characteristic MFI observed in disorders of the spine (e.g., DCM, SCI, whiplash, and low back pain) and other musculoskeletal/neuromuscular conditions (e.g., rotator cuff pathology, osteoarthritis, diabetes, and laminopathies)²⁵.

In this study, we trained and tested a CNN for segmentation of the deep cervical spine extensor muscles using high-resolution fat-water Dixon images from participants with whiplash following an MVC. We leveraged a previously developed CNN for the segmentation of medical images, V-Net, and the newly released deep learning platform, NiftyNet^26,27. Then, we assessed the association of the automated CNN measures to clinical measures of pain and neck-related disability.

Results

CNN accuracy and reliability

Training the V-Net model was completed in 25,000 iterations. The trained CNN segmented every axial slice from the C3 to C7 vertebrae in less than 60 s per image. Accuracy of the trained CNN was evaluated on the testing dataset (n = 14). The average muscle volume ± 1 standard deviation (SD) for the CNN on the testing dataset was 34.9 ± 6.2 ml and 34.1 ± 6.1 ml for the left and right muscles, respectively, and the average MFI ± 1 SD was 20.7 ± 3.5% and 20.2 ± 4.3% for the left and right muscles, respectively.

Overall, we report high accuracy of the CNN on the testing dataset (Fig. 1). The average DICE ± 1 SD for the left and right deep cervical extensors was 0.862 ± 0.017 and 0.871 ± 0.016, respectively. The CNN model had high sensitivity (average true positive rate = 0.904 ± 0.021 and 0.908 ± 0.017 for the left and right muscles, respectively) and high precision (average positive predictive value = 0.829 ± 0.031 and 0.843 ± 0.032 for the left and right muscles, respectively). CNN segmentation performance was similar at the C3–C4 and C5–C6 vertebral levels where the deep cervical extensor muscle composition and morphometry differs. The average DICE was 0.876 ± 0.024 and 0.883 ± 0.026 at C3–C4 and 0.865 ± 0.020 and 0.866 ± 0.025 at C5–C6 for the left and right deep cervical extensors, respectively. The CNN performance metrics of the testing dataset are summarized in Table 1. Each rater’s masks were used as the ground truth (GT) to evaluate the performance of the CNN, and the average measures across the three raters were reported.

Table 1 Segmentation Performance Metrics of the CNN on the Testing Dataset (n = 14).

Full size table

Interrater reliability between the three raters was excellent with the intraclass correlation coefficients (ICC_2,1) for the left volume, right volume, left MFI, and right MFI equal to 0.85, 0.83, 0.90, and 0.92, respectively. Using the average measures across the raters, the interrater reliability between the raters and the CNN model was also excellent with the ICC_2,1 for the left volume, right volume, left MFI, and right MFI equal to 0.94, 0.95, 0.92, and 0.88, respectively. The average difference in volume (CNN − GT) was 2.5 ml (95% confidence interval (CI) 1.5–3.5) and 2.2 ml (1.4–3.1) for the left and right muscles, respectively, indicating that the CNN overestimated (i.e., bias) the muscle volume compared to the GT. Similarly, the average difference in MFI (CNN − GT) was 1.4% (95% CI 0.6–2.1) and 1.7% (0.6–2.8) for the left and right muscles, respectively, indicating that the CNN also demonstrated a bias towards higher MFI compared to the GT (Fig. 2).

Association with clinical measures of pain and neck-related disability

Muscle volume was significantly negatively correlated to pain (R = −0.415, p = 0.006) and neck-related disability (R = −0.286, p = 0.045). MFI tended to be positively correlated to neck-related disability (R = 0.214, p = 0.105) but not pain (R = 0.075, p = 0.331) (Fig. 3A). Average pain (t = −3.356, df = 37, p < 0.001) and neck-related disability (t = −6.060, df = 37, p < 0.001) were significantly higher in the persistent group (neck disability index (NDI) > 28, n = 20, 16 female, average age = 33.3 ± 9.5 years, body mass index (BMI) = 23.8 ± 3.2 kg/m²) compared to the recovered group (NDI ≤ 28, n = 19, 10 female, average age = 29.9 ± 9.1 years, BMI = 25.8 ± 4.0 kg/m²). As hypothesized, MFI (t = 1.696, df = 37, p = 0.049) was significantly higher in the persistent group compared to those in the recovered group. While muscle volume was lower in the persistent group compared to those in the recovered group, the difference was not significant (t = −1.036, df = 37, p = 0.154) (Fig. 3B). Pain, neck-related disability, muscle volume, and MFI for each group are summarized in Table 2.

Table 2 CNN Muscle and Clinical Measures for Recovered and Persisting Whiplash.

Full size table

Discussion

In this study, we trained and tested a CNN for segmentation of the deep cervical spine extensor muscles from high-resolution fat-water Dixon MRI datasets of participants with whiplash injury using V-Net and the NiftyNet deep learning platform. Overall, we demonstrate the feasibility of training a previous CNN model for a novel segmentation task. The trained CNN was highly efficient (< 60 s compared to ≈ 20 minutes for manual segmentation) in processing an image, and we report high accuracy and reliability of the CNN compared to manual segmentation for both the muscle volume and MFI measures. The association of the automated CNN measures to clinical measures of pain intensity and neck-related disability was also recognized. Lower muscle volume was associated with higher pain and higher neck-related disability, and higher MFI was present in participants with persisting whiplash versus those that recovered, supporting the validity and clinical utility of these measures.

Our findings are consistent with previous work from three different countries with different insurance schemes (Australia, United States, and Sweden), all demonstrating larger magnitudes of cervical spine extensor MFI in those with more severe levels of whiplash-related disability^5,6,7,28,29. Participants nominating full recovery or mild symptoms do not present with the same magnitude of MFI. As such, the expression of MFI may embody one neurobiological basis underlying the transition to chronicity in a discrete, but not insignificant, number of these patients with persistent whiplash. It is noteworthy that the findings of MFI are not unique to whiplash injury, as similar MFI profiles have been observed, and reported, in DCM^1,2, SCI³, but not idiopathic neck pain^8,30, suggesting degenerative, and potentially traumatic, factors play a role in their development. More mechanistic work for understanding why and how MFI develops and its influence on recovery from trauma and other common degenerative processes is warranted and underway.

The CNN demonstrated a bias towards higher muscle volumes and MFI compared to manual segmentation. Continued training of the model, training on a larger and more diverse dataset, or the adjustment of the model hyperparameters may have reduced the bias and improved the accuracy. The increased muscle volume could also be due to an intrinsic property of the V-Net architecture, leading to dilation of the output, possibly as the information is compressed and decompressed through the convolutional and deconvolutional layers. The higher segmentation volume likely led to the inclusion of extramuscular fat located adjacent to the deep cervical extensors, which would contribute to the higher MFI measure. However, the average bias of the CNN was small and did not preclude us from identifying associations with the clinical measures. Because no specific clinical cutoffs or normative comparative datasets yet exist for these measures, we are not able to assess the clinical significance of the bias. More testing is necessary to fully understand the properties and behavior of the network to improve the accuracy and relationships to the clinical measures.

The NiftyNet platform supports multi-modal CNN models and 3D convolutional layers. In the present study, 2D convolutional layers were employed using the water-only axial images as features. The inclusion of the fat-only images and use of 3D convolutional layers may provide additional information to more accurately segment the muscles, but at the trade-off of a greater number of features, larger network size, higher model complexity, and increased computational costs. The reason for our choice was that our group mainly uses the axial water-only images for segmentation of the deep cervical extensors, and intuitively, we feel that they contain the most information³¹. We are actively exploring the inclusion of multi-modal features and different network architectures. The use of dilated convolutional layers in the multi-modal models may help reduce the feature space while maintaining spatial coverage and accuracy²³.

The use of images from the same site, scanner, sequence, and imaging parameters reduces the generalizability of the trained model and is a recognized limitation. A major barrier in developing deep learning models for medical imaging tasks is the availability of large, diverse annotated datasets for model training. Fortunately, several collaborative efforts are currently in progress to pool clinically- and research-based imaging data towards the development of large multi-site (and multi-cultural) annotated imaging databases for research purposes. The use of multi-site imaging data will have its own inherit challenges that include, but are not limited to, developing/establishing analysis pipelines and models that can generalize to images of varying resolution, field-of-view, and orientation while also accounting for variability in image signal and contrast due to differences in the imaging parameters and equipment (TR, TE, imaging field strength, and manufacturer).

CNN models have potential to provide an efficient, accurate, and objective measure of muscle volume and MFI. Future directions will aim to refine CNN hyperparameters, compare different CNN models, explore the use of multi-modal imaging, obtain larger multi-site annotated imaging datasets to increase performance and generalizability, and establish a global resource of normative reference values where clinical comparisons can be informed on a patient-by-patient basis.

Methods

Participants

MRI datasets from 39 participants (26 female, average age ± 1 SD = 31.7 ± 9.3 years) were obtained from a prospective observational longitudinal study exploring recovery from whiplash (ClinicalTrials.gov Identifier: NCT02157038). Datasets from the third time point at 3-months post MVC were used in the present study. Inclusion criteria included age 18 to 65 years, Quebec Task Force whiplash grades of II to III, and < 1 week post MVC with a primary complaint of neck pain³². Exclusion criteria included spinal fracture from the MVC, history of a previous MVC, previous spinal surgery, previous diagnosis of cervical or lumbar radiculopathy, history of neurological or metabolic disorders, and standard contraindications to MRI. The study was approved by Northwestern University’s Institutional Review Board. All applicable institutional and governmental regulations concerning the ethical use of human volunteers were followed during the course of this research according to the Declaration of Helsinki, and written informed consent was obtained from every participant. Prior to working with the datasets, all personal identifying information was removed.

Image acquisition and processing

Imaging was performed on a 3.0 T Siemens (Munich, Germany) Prisma scanner equipped with a 64-channel head/neck coil. High-resolution 3D fat-water images of the cervical and upper thoracic spine were acquired using a dual-echo gradient-echo FLASH sequence (2-point Dixon, TR = 7.05 ms, TE₁ = 2.46 ms, TE₂ = 3.69 ms, flip angle = 12°, bandwidth = 510 Hz/pixel, FOV = 190 × 320 mm², slab oversampling of 20% with 40 partitions to prevent aliasing in the anterior-posterior direction, in-plane resolution = 0.7 × 0.7 mm², slice thickness = 3.0 mm, number of averages = 6, acquisition time = 4 min 5 s)³³. Fat and water have slightly different chemical structures and precessional frequencies that differentially influence the local magnetic field. Images can be acquired when the fat and water signals are in-phase (IP = W + F) and out-of-phase (OOP = W − F). The in-phase and out-of-phase images can be combined to create images with fat-only signal (F = (IP − OOP)/2) and water-only signal (W = (IP + OOP)/2). As the images are acquired simultaneously in the same sequence and space, the images require no registration. Three blinded, independent raters, each doctoral-level health professionals, with extensive training in the cervical spine anatomy and musculoskeletal imaging, manually segmented the left and right deep cervical extensor muscles (i.e., multifidus and semispinalis cervicis) from the water-only images using methods previously described (2018)³¹. The segmentation masks contained the background, left muscle group, and right muscle group, labeled as 0, 1, and 2, respectively.

Data augmentation

Data augmentation is a step commonly used to supplement the size of the training dataset. The images were randomly split into training (n = 25) and testing (n = 14) datasets, and 5,100 augmented datasets were generated by applying a series of random affine spatial transformations (scaling, shearing, rotation, translation, and reflection) and adding varying degrees of Gaussian noise to a training image. For each augmented dataset, the same spatial transformations were applied to the segmentation mask from one randomly selected rater for use as the GT. A similar approach using each rater as the GT was used by Perone et al. (2018) for model training, forcing the model to learn the optimal weights for segmentation despite the interrater variability²³. The augmented datasets were then split into final training (n = 5,000) and validation (n = 100) datasets.

V-Net

V-Net is a CNN designed for segmentation tasks. The network consists of several stages having one or more convolutional layers (5 × 5 kernels with stride 1 and padding) followed by a PReLu activation function to extract features. The last step of each stage is a convolutional or de-convolutional layer (2 × 2 kernels with stride 2) to decrease or increase the resolution, respectively. The first half of the network contracts the resolution, while the second half expands the resolution back to the input dimensions. At each stage, a residual learning framework is implemented by adding the input of each stage to the output of its last convolutional layer. Fine feature information from each convolutional stage is also forwarded to the corresponding de-convolutional stage to improve the contour prediction. To limit bias towards predicting the image background, a loss function based on the Sørensen-Dice index (DICE) was employed and minimized. The output after soft-max transformation is probabilistic segmentation masks for the left and right muscles with the same dimensions as the input volume²⁶.

Training

V-Net was trained on the water-only images from the augmented training dataset using NiftyNet (Version 0.2.2, spatial window = 256 × 256, window orientation = axial, padding = 128 × 128, learning rate = 0.001, optimizer = Adam, loss function = DICE, decay = 0.0001, samples per volume = 30, window sampling = uniform, batch size = 30). NiftyNet is an open-source CNN platform built on TensorFlow (Version 1.7) in Python (Version 2.7) and designed specifically for medical imaging analysis²⁷. Prior to training, histogram standardization was performed, the images were resampled to 0.5 × 0.5 × 0.5 mm³, mean-centered (i.e., mean subtracted from each image), and normalized by their standard deviation. The V-Net model was initialized with random weights, and training was completed once the average DICE plateaued on the validation dataset.

Performance

Performance of the CNN was assessed using DICE, Jaccard index, conformity coefficient, true positive rate, true negative rate, positive predictive value, and volume ratio (Table 3)³⁴. Muscle volume and MFI were measured for the left and right deep cervical extensors using the segmentation masks from each rater and the CNN model. MFI was calculated as the average fat-only signal divided by the sum of the average fat-only signal and the average water-only signal multiplied by 100. Reliability between the raters and the CNN was assessed using intraclass correlation coefficients (ICC_2,1), Pearson correlations, and Bland-Altman plots.

Table 3 Segmentation Performance Metrics.

Full size table

Clinical measures

To investigate the association of the CNN muscle volume and MFI and the clinical measures, the complete dataset (n = 39) was then input into the trained CNN, and segmentation masks were generated. The left and right muscle volume and MFI of the deep cervical extensors were then calculated and averaged. For pain, the 11-point numerical pain rating scale (NPRS) with anchors of “no pain” (0) and “extreme pain” (10) was used to assess neck pain³⁵. Neck-related function was assessed with the NDI³⁶. The NDI is a 10-item scaled questionnaire that assesses disability and functioning specific to the neck. The scores range from 0 to 100, and higher values indicate more neck-related disability in daily activities.

Relationships between muscle volume and MFI, and the clinical measures of pain and neck-related disability, were assessed with partial Pearson correlations correcting for age, gender, and BMI. We hypothesized that lower muscle volume and higher MFI would be correlated with higher pain and disability. Next, we divided the dataset into groups of recovered (NDI ≤ 28) versus persisting whiplash (NDI > 28) using the NDI at 3 months post MVC. Differences in average muscle volume and MFI between the recovered and persistent groups were assessed with independent samples t-tests after correcting for age, gender, and BMI. We hypothesized that the group reporting persistent disability and higher pain intensity would have lower muscle volume and increased MFI compared to the recovered group. As the hypotheses were directional, one-tailed tests were performed with an α < 0.05 considered statistically significant. Statistical analyses were performed using IBM SPSS Statistics (Version 21, Armonk, NY, USA).

Data Availability

The de-identified datasets used in this study are available from the corresponding author upon reasonable request.

References

Cloney, M. et al. Fatty infiltration of the cervical multifidus musculature and their clinical correlates in spondylotic myelopathy. Journal of clinical neuroscience: official journal of the Neurosurgical Society of Australasia 57, 208–213, https://doi.org/10.1016/j.jocn.2018.03.028 (2018).
Article Google Scholar
Fortin, M. et al. Relationship between cervical muscle morphology evaluated by MRI, cervical muscle strength and functional outcomes in patients with degenerative cervical myelopathy. Musculoskeletal science & practice 38, 1–7, https://doi.org/10.1016/j.msksp.2018.07.003 (2018).
Article Google Scholar
Smith, A. C. et al. Potential associations between chronic whiplash and incomplete spinal cord injury. Spinal cord series and cases 1, https://doi.org/10.1038/scsandc.2015.24 (2015).
Smith, A. C. et al. Ambulatory function in motor incomplete spinal cord injury: a magnetic resonance imaging study of spinal cord edema and lower extremity muscle morphometry. Spinal cord 55, 672–678, https://doi.org/10.1038/sc.2017.18 (2017).
Article CAS PubMed PubMed Central Google Scholar
Abbott, R. et al. The geography of fatty infiltrates within the cervical multifidus and semispinalis cervicis in individuals with chronic whiplash-associated disorders. The Journal of orthopaedic and sports physical therapy 45, 281–288, https://doi.org/10.2519/jospt.2015.5719 (2015).
Article PubMed Google Scholar
Karlsson, A. et al. An Investigation of Fat Infiltration of the Multifidus Muscle in Patients With Severe Neck Symptoms Associated With Chronic Whiplash-Associated Disorder. The Journal of orthopaedic and sports physical therapy 46, 886–893, https://doi.org/10.2519/jospt.2016.6553 (2016).
Article PubMed Google Scholar
Elliott, J. et al. The temporal development of fatty infiltrates in the neck muscles following whiplash injury: an association with pain and posttraumatic stress. PloS one 6, e21194, https://doi.org/10.1371/journal.pone.0021194 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Elliott, J. M. et al. Differential changes in muscle composition exist in traumatic and nontraumatic neck pain. Spine 39, 39–47, https://doi.org/10.1097/brs.0000000000000033 (2014).
Article PubMed Google Scholar
Dahlqvist, J. R., Vissing, C. R., Hedermann, G., Thomsen, C. & Vissing, J. Fat Replacement of Paraspinal Muscles with Aging in Healthy Adults. Medicine and science in sports and exercise 49, 595–601, https://doi.org/10.1249/mss.0000000000001119 (2017).
Article PubMed Google Scholar
Goubert, D. et al. Lumbar muscle structure and function in chronic versus recurrent low back pain: a cross-sectional study. The spine journal: official journal of the North American Spine Society 17, 1285–1296, https://doi.org/10.1016/j.spinee.2017.04.025 (2017).
Article Google Scholar
Elliott, J. M., Hancock, M. J., Crawford, R. J., Smith, A. C. & Walton, D. M. Advancing imaging technologies for patients with spinal pain: with a focus on whiplash injury. The spine journal: official journal of the North American Spine Society 18, 1489–1497, https://doi.org/10.1016/j.spinee.2017.06.015 (2018).
Article Google Scholar
Fortin, M., Lazary, A., Varga, P. P. & Battie, M. C. Association between paraspinal muscle morphology, clinical symptoms and functional status in patients with lumbar spinal stenosis. European spine journal: official publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research Society 26, 2543–2551, https://doi.org/10.1007/s00586-017-5228-y (2017).
Article Google Scholar
Fortin, M., Omidyeganeh, M., Battie, M. C., Ahmad, O. & Rivaz, H. Evaluation of an automated thresholding algorithm for the quantification of paraspinal muscle composition from MRI images. Biomedical engineering online 16, 61, https://doi.org/10.1186/s12938-017-0350-y (2017).
Article PubMed PubMed Central Google Scholar
Sions, J. M., Coyle, P. C., Velasco, T. O., Elliott, J. M. & Hicks, G. E. Multifidi Muscle Characteristics and Physical Function Among Older Adults With and Without Chronic Low Back Pain. Archives of physical medicine and rehabilitation 98, 51–57, https://doi.org/10.1016/j.apmr.2016.07.027 (2017).
Article PubMed Google Scholar
Khan, A. B., Weiss, E. H., Khan, A. W., Omeis, I. & Verla, T. Back Muscle Morphometry: Effects on Outcomes of Spine Surgery. World neurosurgery 103, 174–179, https://doi.org/10.1016/j.wneu.2017.03.097 (2017).
Article PubMed Google Scholar
Storheim, K. et al. Fat in the lumbar multifidus muscles - predictive value and change following disc prosthesis surgery and multidisciplinary rehabilitation in patients with chronic low back pain and degenerative disc: 2-year follow-up of a randomized trial. BMC musculoskeletal disorders 18, 145, https://doi.org/10.1186/s12891-017-1505-5 (2017).
Article PubMed PubMed Central Google Scholar
O’Leary, S., Jull, G., Van Wyk, L., Pedler, A. & Elliott, J. Morphological changes in the cervical muscles of women with chronic whiplash can be modified with exercise-A pilot study. Muscle & nerve 52, 772–779, https://doi.org/10.1002/mus.24612 (2015).
Article Google Scholar
Global, regional, and national age-sex specific all-cause and cause-specific mortality for 240 causes of death, 1990-2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet (London, England) 385, 117–171, https://doi.org/10.1016/s0140-6736(14)61682-2 (2015).
Ivanova, J. I. et al. Real-world practice patterns, health-care utilization, and costs in patients with low back pain: the long road to guideline-concordant care. The spine journal: official journal of the North American Spine Society 11, 622–632, https://doi.org/10.1016/j.spinee.2011.03.017 (2011).
Article Google Scholar
Foster, N. E. et al. Prevention and treatment of low back pain: evidence, challenges, and promising directions. Lancet (London, England) 391, 2368–2383, https://doi.org/10.1016/s0140-6736(18)30489-6 (2018).
Article Google Scholar
Elliott, J. M. et al. Advancements in Imaging Technology: Do They (or Will They) Equate to Advancements in Our Knowledge of Recovery in Whiplash? The Journal of orthopaedic and sports physical therapy 46, 862–873, https://doi.org/10.2519/jospt.2016.6735 (2016).
Article PubMed Google Scholar
Gros, C. et al. Automatic segmentation of the spinal cord and intramedullary multiple sclerosis lesions with convolutional neural networks. NeuroImage 184, 901–915, https://doi.org/10.1016/j.neuroimage.2018.09.081 (2018).
Article PubMed Google Scholar
Perone, C. S., Calabrese, E. & Cohen-Adad, J. Spinal cord gray matter segmentation using deep dilated convolutions. Scientific reports 8, 5966, https://doi.org/10.1038/s41598-018-24304-3 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, G., Li, W., Ourselin, S. & Vercauteren, T. Automatic Brain Tumor Segmentation using Cascaded Anisotropic Convolutional Neural Networks. Preprint at, https://arxiv.org/abs/1709.00382 (2017).
Gomez-Andres, D. et al. Muscle imaging in laminopathies: synthesis study identifies meaningful muscles for follow-up. Muscle & nerve, https://doi.org/10.1002/mus.26312 (2018).
Milletari, F., Navab, N. & Ahmadi, S. -A. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. Preprint at, https://arxiv.org/abs/1606.04797 (2016).
Gibson, E. et al. NiftyNet: a deep-learning platform for medical imaging. Computer methods and programs in biomedicine 158, 113–122, https://doi.org/10.1016/j.cmpb.2018.01.025 (2018).
Article PubMed PubMed Central Google Scholar
Elliott, J. M. et al. The Rapid and Progressive Degeneration of the Cervical Multifidus in Whiplash: An MRI Study of Fatty Infiltration. Spine 40, E694–700, https://doi.org/10.1097/brs.0000000000000891 (2015).
Article PubMed PubMed Central Google Scholar
Abbott, R. et al. The qualitative grading of muscle fat infiltration in whiplash using fat and water magnetic resonance imaging. The spine journal: official journal of the North American Spine Society 18, 717–725, https://doi.org/10.1016/j.spinee.2017.08.233 (2018).
Article Google Scholar
Elliott, J. et al. Fatty infiltrate in the cervical extensor muscles is not a feature of chronic, insidious-onset neck pain. Clinical radiology 63, 681–687, https://doi.org/10.1016/j.crad.2007.11.011 (2008).
Article CAS PubMed Google Scholar
Elliott, J. M., Cornwall, J., Kennedy, E., Abbott, R. & Crawford, R. J. Towards defining muscular regions of interest from axial magnetic resonance imaging with anatomical cross-reference: part II - cervical spine musculature. BMC musculoskeletal disorders 19, 171, https://doi.org/10.1186/s12891-018-2074-y (2018).
Article PubMed PubMed Central Google Scholar
Spitzer, W. O. et al. Scientific monograph of the Quebec Task Force on Whiplash-Associated Disorders: redefining “whiplash” and its management. Spine 20, 1s–73s (1995).
Article CAS PubMed Google Scholar
Dixon, W. T. Simple proton spectroscopic imaging. Radiology 153, 189–194, https://doi.org/10.1148/radiology.153.1.6089263 (1984).
Article CAS PubMed Google Scholar
Prados, F. et al. Spinal cord grey matter segmentation challenge. NeuroImage 152, 312–329, https://doi.org/10.1016/j.neuroimage.2017.03.010 (2017).
Article PubMed PubMed Central Google Scholar
Walton, D. M., Elliott, J. M., Salim, S. & Al-Nasri, I. A reconceptualization of the pain numeric rating scale: Anchors and clinically important differences. Journal of hand therapy: official journal of the American Society of Hand Therapists 31, 179–183, https://doi.org/10.1016/j.jht.2017.12.008 (2018).
Article Google Scholar
Vernon, H. & Mior, S. The Neck Disability Index: a study of reliability and validity. Journal of manipulative and physiological therapeutics 14, 409–415 (1991).
CAS PubMed Google Scholar

Download references

Acknowledgements

Research reported in this publication was supported by the National Institute of Child Health and Human Development/National Center for Medical Rehabilitation Research under award numbers R01HD079076 and R03HD094577, the National Institute of Drug Abuse under award numbers T32DA035165 and K24DA029262, and the National Institute of Neurological Disorders and Stroke under award number K23NS104211. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Systems Neuroscience and Pain Lab, Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University, Palo Alto, CA, USA
Kenneth A. Weber, Kamran Eghtesad, Pranav A. Upadhyayula & Sean Mackey
School of Physical Therapy, Regis University, Denver, CO, USA
Andrew C. Smith
Department of Physical Therapy and Human Movement Sciences, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Marie Wasielewski & James M. Elliott
Department of Radiology, Neuroradiology Section, Stanford University, Palo Alto, CA, USA
Max Wintermark
Statistics Department, Stanford University, Palo Alto, CA, USA
Trevor J. Hastie
Department of Radiology, Northwestern University, Chicago, IL, USA
Todd B. Parrish
Honorary Senior Fellow, School of Health and Rehabilitation Sciences, The University of Queensland, Queensland, Australia
James M. Elliott
Northern Sydney Local Health District, The Kolling Research Institute and The Faculty of Health Sciences, The University of Sydney, St. Leonards, NSW, Australia
James M. Elliott

Authors

Kenneth A. Weber
View author publications
You can also search for this author in PubMed Google Scholar
Andrew C. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Marie Wasielewski
View author publications
You can also search for this author in PubMed Google Scholar
Kamran Eghtesad
View author publications
You can also search for this author in PubMed Google Scholar
Pranav A. Upadhyayula
View author publications
You can also search for this author in PubMed Google Scholar
Max Wintermark
View author publications
You can also search for this author in PubMed Google Scholar
Trevor J. Hastie
View author publications
You can also search for this author in PubMed Google Scholar
Todd B. Parrish
View author publications
You can also search for this author in PubMed Google Scholar
Sean Mackey
View author publications
You can also search for this author in PubMed Google Scholar
James M. Elliott
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.A.W., A.C.S., T.B.P. and J.M.E. designed the study. A.C.S., M.W., T.B.P. and J.M.E. acquired the data. K.A.W., A.C.S., K.E., T.J.H. and P.A.U. analyzed the data. K.A.W. prepared all figures. K.A.W., A.C.S., T.J.H., M.W., S.M. and J.M.E. interpreted the results. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Kenneth A. Weber.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Weber, K.A., Smith, A.C., Wasielewski, M. et al. Deep Learning Convolutional Neural Networks for the Automatic Quantification of Muscle Fat Infiltration Following Whiplash Injury. Sci Rep 9, 7973 (2019). https://doi.org/10.1038/s41598-019-44416-8

Download citation

Received: 15 January 2019
Accepted: 15 May 2019
Published: 28 May 2019
DOI: https://doi.org/10.1038/s41598-019-44416-8

This article is cited by

Towards defining muscular regions of interest from axial magnetic resonance imaging with anatomical cross-reference: a scoping review of lateral hip musculature
- Zuzana Perraton
- Peter Lawrenson
- Adam I. Semciw
BMC Musculoskeletal Disorders (2022)
Morphology and composition of the ventral neck muscles in individuals with chronic whiplash related disorders compared to matched healthy controls: a cross-sectional case–control study
- Anneli Peolsson
- Anette Karlsson
- Olof Dahlqvist Leinhard
BMC Musculoskeletal Disorders (2022)
Convolutional neural networks for the automatic segmentation of lumbar paraspinal muscles in people with low back pain
- E. O. Wesselink
- J. M. Elliott
- K. A. Weber II
Scientific Reports (2022)
Analysis of the paraspinal muscle morphology of the lumbar spine using a convolutional neural network (CNN)
- David Baur
- Richard Bieck
- Anna Voelker
European Spine Journal (2022)
Classification of rotator cuff tears in ultrasound images using deep learning models
- Thao Thi Ho
- Geun-Tae Kim
- Eun-Kee Park
Medical & Biological Engineering & Computing (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

CNN accuracy and reliability

Association with clinical measures of pain and neck-related disability

Discussion

Methods

Participants

Image acquisition and processing

Data augmentation

V-Net

Training

Performance

Clinical measures

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links