Repeatability and reproducibility of FreeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis

Guo, Chunjie; Ferreira, Daniel; Fink, Katarina; Westman, Eric; Granberg, Tobias

doi:10.1007/s00330-018-5710-x

Repeatability and reproducibility of FreeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis

Neuro
Open access
Published: 21 September 2018

Volume 29, pages 1355–1364, (2019)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Repeatability and reproducibility of FreeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis

Download PDF

Chunjie Guo^1,2,
Daniel Ferreira¹,
Katarina Fink^3,4,
Eric Westman¹ &
…
Tobias Granberg ORCID: orcid.org/0000-0001-6700-1022^3,5

8211 Accesses
80 Citations
10 Altmetric
1 Mention
Explore all metrics

Abstract

Objectives

To compare the cross-sectional robustness of commonly used volumetric software and effects of lesion filling in multiple sclerosis (MS).

Methods

Nine MS patients (six females; age 38±13 years, disease duration 7.3±5.2 years) were scanned twice with repositioning on three MRI scanners (Siemens Aera 1.5T, Avanto 1.5T, Trio 3.0T) the same day. Volumetric T₁-weighted images were processed with FreeSurfer, FSL-SIENAX, SPM and SPM-CAT before and after 3D FLAIR lesion filling with LST. The whole-brain, grey matter (GM) and white matter (WM) volumes were calculated with and without normalisation to the intracranial volume or FSL-SIENAX scaling factor. Robustness was assessed using the coefficient of variation (CoV).

Results

Variability in volumetrics was lower within than between scanners (CoV 0.17–0.96% vs. 0.65–5.0%, p<0.001). All software provided similarly robust segmentations of the brain volume on the same scanner (CoV 0.17–0.28%, p=0.076). Normalisation improved inter-scanner reproducibility in FreeSurfer and SPM-based methods, but the FSL-SIENAX scaling factor did not improve robustness. Generally, SPM-based methods produced the most consistent volumetrics, while FreeSurfer was more robust for WM volumes on different scanners. FreeSurfer had more robust normalised brain and GM volumes on different scanners than FSL-SIENAX (p=0.004). MS lesion filling changed the output of FSL-SIENAX, SPM and SPM-CAT but not FreeSurfer.

Conclusions

Consistent use of the same scanner is essential and normalisation to the intracranial volume is recommended for multiple scanners. Based on robustness, SPM-based methods are particularly suitable for cross-sectional volumetry. FreeSurfer poses a suitable alternative with WM segmentations less sensitive to MS lesions.

Key Points

• The same scanner should be used for brain volumetry. If different scanners are used, the intracranial volume normalisation improves the FreeSurfer and SPM robustness (but not the FSL scaling factor).

• FreeSurfer, FSL and SPM all provide robust measures of the whole brain volume on the same MRI scanner. SPM-based methods overall provide the most robust segmentations (except white matter segmentations on different scanners where FreeSurfer is more robust).

• MS lesion filling with Lesion Segmentation Toolbox changes the output of FSL-SIENAX and SPM. FreeSurfer output is not affected by MS lesion filling since it already takes white matter hypointensities into account and is therefore particularly suitable for MS brain volumetry.

Magnetic resonance imaging in Alzheimer’s disease and mild cognitive impairment

Article Open access 17 August 2018

Avinash Chandra, George Dervenoulas, … for the Alzheimer’s Disease Neuroimaging Initiative

Conversion map from quantitative parameter mapping to myelin water fraction: comparison with R1·R2* and myelin water fraction in white matter

Article 06 April 2024

Shun Kitano, Yuki Kanazawa, … Akihiro Haga

A multimodal clinical diagnostic approach using MRI and 18F-FDG-PET for antemortem diagnosis of TDP-43 in cases with low–intermediate Alzheimer’s disease neuropathologic changes and primary age-related tauopathy

Article 05 April 2024

Anna Lavrova, Nha Trang Thu Pham, … Keith A. Josephs

Introduction

Multiple sclerosis (MS) is a common chronic neuroinflammatory and neurodegenerative disease [1]. Demyelinating lesions in the brain and spinal cord are the pathological hallmarks of MS, which are detectable in vivo with magnetic resonance imaging (MRI). MRI has therefore become an essential tool for the diagnosis and monitoring of disease activity in MS [1, 2]. In MS, the lesion volume reflects the inflammatory burden while atrophy measures quantify neurodegenerative aspects of the disease, which play an important role in all disease stages [3]. Volumetry is therefore commonly used as a secondary endpoint in clinical trials [4]. Furthermore, volumetry can be helpful in improving our understanding of the disease since atrophy patterns have been shown to be different in MS compared to other demyelinating disorders [5].

Obtaining robust imaging biomarkers in MS for assessment of the inflammatory and neurodegenerative burden of disease is, however, challenging [3]. Brain volumetry is influenced by several subject-related factors such as hydration status, inflammation and clinical therapy [6]. MS lesions can specifically affect tissue segmentations since white matter (WM) lesions can be misclassified as grey matter (GM) or cerebrospinal fluid (CSF) [7, 8]. Brain volumetry is also impacted by technical factors such as MRI field strength and scanner model, as well as post-processing related issues [8,9,10]. Understanding the effect and magnitude of technical factors is important when planning MRI studies [8].

There are several freely available tools for automated brain volumetry that are commonly applied in MS. Popular choices include FreeSurfer [11], Structural Image Evaluation with Normalisation of Atrophy Cross-sectional (SIENAX) [12] and Statistical Parametric Mapping (SPM) [13]. These software can automatically pre-process and segment T₁-weighted images of the brain. FreeSurfer is computationally demanding and is based on a combined volumetric- and surface-based segmentation aimed to reduce partial volume effects from the convoluted shape of the cortical ribbon [11]. FreeSurfer uses a template-driven approach to provide a detailed parcellation and segmentation of the cortex and subcortical structures. SIENAX, part of the FMRIB Software Library (FSL), is computationally less demanding but only provides measurements of the gross tissue volumes (WM, GM and CSF) [12]. FSL-SIENAX relies on registration to the Montreal Neurological Institute 152 template for skull stripping and then performs intensity-based segmentation; the template registration step provides a scaling factor that can be used for normalisation. SPM is based on non-linear registration of the brain to a template and segments brain tissues by assigning tissue probabilities per voxel [13]. Computational Anatomy Toolbox (SPM-CAT) is an extension for SPM that provides segmentations with a different segmentation approach based on spatial interpolation, denoising, additional affine registration steps, local intensity correction, adaptive segmentation and partial volume segmentation [14]. Like FSL-SIENAX, the SPM-based methods are less computationally demanding, relative to FreeSurfer, and only provide gross brain tissue volumes.

The primary purpose of this study was to compare the repeatability on the same scanner and the reproducibility on different scanners for brain tissue segmentations in FreeSurfer, FSL-SIENAX, SPM and SPM-CAT. A secondary aim was to study the effect of automated lesion filling to reduce MS lesion-related brain tissue segmentation bias.

Materials and methods

Participants

Nine MS patients (six females, three males; mean age 38±13 years; mean disease duration 7.3±5.2 years) diagnosed according to the McDonald 2010 diagnostic criteria [15], were prospectively recruited from the outpatient clinic at the Department of Neurology, Karolinska University Hospital in Huddinge, Stockholm, Sweden, among consecutive patients referred for a clinical MRI. The participants were representative of the MS population in Sweden, with all subtypes represented in proportion to their frequency in clinical practice: six relapsing-remitting (RR), two secondary progressive, one primary progressive [16]. Exclusion criteria were contraindications to MRI, neurological co-morbidities or a history of head trauma (none were excluded). The physical disability of the patients was assessed according to the Expanded Disability Status Scale [17] by an MS-experienced neurologist (K.F.). The median physical disability score was 2.0 (range 1.0–5.5). The study was approved by the local ethics committee and written informed consent was obtained from all participants.

MRI protocol

All participants were scanned twice on the same day on all three clinical MRI systems used in the study: Siemens Aera (1.5 T), Avanto (1.5 T) and Trio (3.0 T) (Siemens Healthcare, Erlangen, Germany). A 3D T₁-weighted magnetisation-prepared rapid gradient-echo (MPRAGE) sequence was acquired twice with repositioning in between, resulting in a total of six T₁-weighted volumes per participant. A representative example of the MPRAGE acquisitions is illustrated in Fig. 1. One 3D T₂-weighted Fluid-Attenuated Inversion Recovery (FLAIR) was additionally acquired on each scanner for lesion segmentation. The MRI acquisition parameters are detailed in Table 1.

Table 1 MRI acquisition parameters

Full size table

Image analysis

Each of the six 3D T₁-weighted volumes from each participant was analysed cross-sectionally and processed in FreeSurfer, FSL-SIENAX, SPM and SPM-CAT. No additional pre-processing or manual intervention was performed to avoid introducing biases in the tissue segmentations. All input and output underwent visual quality assurance by an experienced rater (T.G.) and were found to be of satisfactory quality. Examples of the volumetric output are presented in Fig. 2.

FreeSurfer

FreeSurfer 6.0.0 (http://surfer.nmr.mgh.harvard.edu, Harvard University, Boston, MA, USA) was used to perform automatic processing as previously described [11, 18]. FreeSurfer was run with the options ‘-mprage’ and for the 3.0 T data also ‘-3T’, as recommended by its developers. The variable ‘Brain Segmentation Volume Without Ventricles from Surf’ was used as the FreeSurfer estimation of the brain volume, which excludes the brainstem. The variable ‘Total grey matter volume’ was used as the estimation of the GM volume. The WM volume was assessed by summing the ‘cerebral WM’, ‘cerebellar WM’, ‘brainstem’ and ‘corpus callosum’ FreeSurfer variables. It is notable that FreeSurfer specifically segments white matter hypointensities. For normalisation purposes, the brain volume, GM volume and WM volume were divided by the ‘Estimated Total Intracranial Volume’.

FSL-SIENAX

The SIENAX method implemented in FSL 5.0 (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/SIENA, Oxford University, Oxford, UK) was used to obtain an automated quantification of the brain volume, GM volume and WM volume with automatic normalisation for head size with a subject-specific scaling factor, as previously described [19]. For this study, FSL-SIENAX was run with the optimised brain extraction parameters ‘-B -f 0.1’, in accordance with previous recommendations for MS studies [20].

SPM

Statistical Parametric Mapping, SPM12, (http://www.fil.ion.ucl.ac.uk/spm, University College London, London, UK) was used to automatically obtain the GM volume, WM volume and total intracranial CSF volume according to an adapted workflow as previously described [21]. The segment tool was run using the default settings. The brain volume in SPM was defined as the sum of the GM and WM volumes. For normalisation, the intracranial volume was used, which was calculated by summing the GM, WM and CSF volumes.

SPM-CAT

The Computational Anatomy Toolbox (CAT) 12 is an extension to SPM12 (http://www.neuro.uni-jena.de/cat/index.html, Jena University Hospital, Jena, Germany) [14]. The cross-sectional data segmentation tool was run using the default settings. The brain volume in SPM-CAT was defined as the sum of the GM and WM volumes and the total intracranial volume was used for normalisation.

Lesion filling

Lesion filling was performed on all 3D FLAIR volumes in SPM12 using the lesion probability algorithm in Lesion Segmentation Toolbox 2.0.10 (LST, http://www.applied-statistics.de/lst.html,Technische Universität München, Munich, Germany) [22]. LST provides an automated probabilistic lesion segmentation, specifically developed for MS. It also provides automatic lesion filling without the need for parameter optimisation or binary thresholding of the lesion masks. The FLAIR lesion probability maps were used to perform lesion filling on the corresponding T₁-weighted volumes from the same scanner [22]. Figure 3 illustrates the input and output of the lesion filling procedure.

Statistical analysis

SPSS Statistics 24.0 was used for the statistical analysis (IBM Corporation, Armonk, NY, USA). Due to the limited sample size, the data were treated as non-parametric. The robustness of repeated measures was assessed using the within-subject coefficient of variation (CoV). For intra-scanner repeatability, the measurements from the first and the second scan from the same scanner were used: CoV_{Intra-scanner} = SD/mean of Scan 1 and Scan 2. For the inter-scanner reproducibility, the first scans from each of the three scanners were used: CoV_{Inter-scanner} = SD/mean of Scan 1_Aera, Scan 1_Avanto, Scan 1_Trio. Paired comparisons were tested using the Wilcoxon signed ranks test with two-tailed exact significance. Group comparisons between the four software were tested using the Friedman test and in case of significant differences among the software, post hoc paired analyses were performed with the Wilcoxon signed ranks test. Correction for multiple comparisons was performed using the Benjamini-Hochberg procedure separately for the intra-scanner CoVs, inter-scanner CoVs and for each Friedman test post hoc analysis [23]. A corrected p<0.05 was considered statistically significant. All reported p-values were significant after correction for multiple comparisons, unless otherwise specified.

Results

Comparability of the brain volumetry from different software

There were notable differences in the numeric brain tissue segmentation output from FreeSurfer, SIENAX, SPM and SPM-CAT, as detailed in Table 2. A full report of the volumetric output can be found in Online Supplementary Table 1.

Table 2 Brain tissue volumes with/without normalisation and with/without lesion filling

Full size table

Repeatability and reproducibility of non-normalised brain volumetry

Repeated measurements on the same scanner generally resulted in lower variability than measurements on the different scanners (median CoV 0.17–0.96% vs. 0.65–5.0%, p<0.001 by Wilcoxon signed ranks test), as further detailed in Table 3. Overall, the brain volume was the most robust tissue segmentation within scanners, with the lowest variability (median CoV 0.17–0.28%), and a comparable performance of all segmentation methods (p=0.076 by Friedman test). For all other volumetrics there were, however, differences between the software, both for the intra-scanner repeatability (WM volume p=0.017, GM volume p=0.004, normalised brain volume p=0.012, normalised WM volume p<0.001 and normalised GM volume p=0.004) and the inter-scanner reproducibility (brain volume p=0.002, WM volume p<0.001, GM volume p<0.001, normalised brain volume p<0.001, normalised WM volume p=0.007 and normalised GM volume p=0.001), all by the Friedman test. Post hoc analyses with corrections for multiple comparisons showed that the SPM-based methods generally had the lowest CoV of the four software, reflecting good repeatability and reproducibility, with the exception of WM segmentations on different scanners, where FreeSurfer was more robust. The two SPM methods performed similarly in most regards, with the exception of inter-scanner WM segmentations where SPM-CAT had significantly lower variability.

Table 3 Repeatability and reproducibility of the brain tissue volumes

Full size table

Effects of normalisation on brain volumetry

Normalising the brain tissue volumes did not have a statistically significant positive effect on the intra-scanner repeatability, as further detailed in Table 3. On the contrary, for the FSL-SIENAX normalised brain volume there was a worsening of the intra-scanner repeatability after normalisation with the scaling factor. Normalisation to the FSL-SIENAX scaling factor did not significantly improve the inter-scanner reproducibility either. In contrast, normalisation to the intracranial volume often improved the reproducibility between scanners for FreeSurfer and the SPM methods. Specifically, significant improvements in the reproducibility were seen for the FreeSurfer normalised brain volume and normalised grey matter volume as well as for the normalised brain volume and white matter volume for both SPM-based methods. When normalising the tissues, FreeSurfer became more robust than FSL-SIENAX across scanners for both the normalised brain volume and normalised GM volume.

Effects of MS lesion filling

The median WM lesion volume was 1.8 ml (range 0.33–24 ml). There was no statistically significant effect of lesion filling on the FreeSurfer volumes, as detailed in Table 2. However, lesion filling caused changes in volumetrics from FSL-SIENAX, SPM and SPM-CAT. Most notably, highly significant changes were seen for all tissue compartments in SPM-CAT with increases in the estimations of the brain and WM volumes and decreases in the GM estimations, both for the non-normalised and normalised data. Lesion filling did not significantly affect the inter-scanner CoV for any of the software (data not shown).

Discussion

We present a prospective head-to-head comparison of the robustness of four of the most popular freely available brain segmentation tools in a representative real-life MS cohort scanned twice on three different scanners on the same day. New versions of the tested software have recently been released. An important contribution of the current study is therefore that we provide an up-to-date evaluation of the intra- and inter-scanner variability of brain tissue measurements in MS, facilitating an appropriate choice of software for volumetric studies.

We found that the volumetric output differed between the software, which is expected since they have large technical differences [11,12,13]. Previous studies of earlier versions of the software have indeed also found significant differences in the output, both numerically and topographically [24,25,26]. While most previous studies have focused on differences and similarities in the segmentation results [24,25,26], the current study mainly focused on the robustness of the segmentation tools. Overall, we report that the variability in volumetrics was lower on the same scanner than between scanners, supporting recommendations to follow individuals on the same scanner [27, 28]. Although brain atrophy rates can be double that of normal aging in untreated MS patients [29], treated MS patients have atrophy rates around 0.5%/year [30]. To accurately capture atrophy rates, it is therefore important to have a variability lower than that. Our reported CoVs for intra-scanner (0.17–0.92%) and inter-scanner (0.65–5.0%) variability suggest that measurements are feasible within 1–2 years for the most robust methods on the same scanner. In contrast, several years need to pass to be able to capture atrophy on different scanners, even with normalisation.

SPM-based methods overall had the best repeatability and reproducibility of the four software (except WM segmentations where FreeSurfer was more robust) and are therefore particularly suitable for cross-sectional MS studies. This is in line with a previous international study of two MS patients scanned at multiple sites and a segmentation challenge in persons with diabetes mellitus and cardiovascular risk factors [31, 32]. We also found that the whole-brain volume was the most robust volumetric, consistent with previous results [31, 33]. This could be explained by lower variability with a large volume of interest and a larger contrast difference of CSF versus brain parenchyma compared to GM/WM segmentations. In studies with differences in the MRI protocols, it can therefore be recommended to primarily focus on the brain volume. Interestingly, there was no significant difference in the intra-scanner robustness of the software for the brain volume, meaning that all studied software can be favoured for cross-sectional MS studies of the brain volume.

The current study focuses on some of the most commonly used freely available automated segmentation tools for brain volumetrics in MS, but there are several other segmentation tools available, such as AFNI and BrainSuite. While we provide information on the robustness of the studied software, the choice of software must also take other factors into account, such as which types of images are available, user skills and technical requirements [8]. In this study, we only provided the T₁-weighted images for segmentation, which is the only image contrast that FSL-SIENAX and SPM-CAT are optimised for [12, 14]. Previous results with segmentation based on multiple contrasts or multi-parametric maps have shown especially good robustness [32,33,34]. Evaluating such approaches is therefore an interesting avenue for future studies. From a technical standpoint, full functionality of SPM requires a MATLAB license [13], but a standalone version of SPM or FreeSurfer could be suitable alternatives since FreeSurfer was found to provide more robust normalised measurements between scanners than FSL-SIENAX, consistent with previous results [35]. While FreeSurfer is computationally more intense than the other software, it also provides more detailed regional morphometry.

Normalisation of the brain volumetrics to the intracranial volume generally improved the comparability of results between scanners, in line with previous recommendations [8]. This is likely due to a reduction of scaling effects between scanners [8]. However, using the scaling factor in FSL-SIENAX did not improve the robustness, suggesting that such normalisation may not be sufficient. Overall, there was also a lack of improvement in the repeatability within scanners for all three software with the normalisation. This finding likely reflects that normalisation procedures are less critical if measurements are produced on the same scanner. In clinical practice and longitudinal studies it is, however, important to consider that the variability in measurements are likely to be higher than that presented in this study, where all measurements were performed on the same day [31].

In terms of the effect of MS lesion filling, we found that lesion filling affected the volumetric results mainly for SPM and SPM-CAT, but also for FSL-SIENAX. These results are consistent with a previous MS study showing increased accuracy of SPM8 segmentations after lesion filling [36]. Of note, no effect was seen on the FreeSurfer volumes with lesion filling, likely due to the fact that FreeSurfer specifically segments WM T₁-hypointensities and thus take these into account during the WM segmentations [11].

This study has some limitations. First, the sample size is small, but in total there were 54 measurements since each patient was scanned twice on three scanners and the study showed statistically significant differences in robustness of the software. Second, the MRI scanners were all from the same manufacturer, while higher inter-scanner variability would be expected with multiple vendors [31]. Third, although the results of the study could change by adjusting acquisition or processing parameters, these results reflect the standard procedures for MRI in MS at Karolinska University Hospital and we used recommended post-processing options [11, 13, 20]. There was a difference in the resolution between the FLAIR volumes, which could affect the lesion filling but this difference was consistent for the input of all software. Lastly, the current study focused solely on cross-sectional segmentation methods while the robustness of segmentations can be improved by including a priori knowledge of several time-points [19, 35, 37]. We therefore recommend future studies to also focus on comparing the robustness of longitudinal segmentation methods.

In conclusion, the results highlight the importance of consistently using the same scanner and normalising to the intracranial volume when multiple scanners are used. The output from FreeSurfer, FSL-SIENAX and SPM differ but all three software provide cross-sectional brain volume segmentations with similar intra-scanner robustness. SPM-based methods overall produced the most consistent results, while FreeSurfer had less variability in WM volume segmentations across scanners and was less affected by WM lesions.

Abbreviations

CoV:: Coefficient of variation
CSF:: Cerebrospinal fluid
FLAIR:: Fluid-attenuated inversion recovery
FSL:: Functional magnetic resonance imaging of the brain software library
GM:: Grey matter
LST:: Lesion segmentation toolbox
MS:: Multiple sclerosis
SIENAX:: Structural image evaluation with normalisation of atrophy cross-sectional
SPM:: Statistical parametric mapping
WM:: White matter

References

Thompson AJ, Banwell BL, Barkhof F et al (2018) Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. Lancet Neurol 17:162–173. https://doi.org/10.1016/S1474-4422(17)30470-2
Article PubMed Google Scholar
Filippi M, Rocca MA, Ciccarelli O et al (2016) MRI criteria for the diagnosis of multiple sclerosis: MAGNIMS consensus guidelines. Lancet Neurol 15:292–303. https://doi.org/10.1016/S1474-4422(15)00393-2
Article PubMed PubMed Central Google Scholar
Enzinger C, Barkhof F, Ciccarelli O et al (2015) Nonconventional MRI and microstructural cerebral changes in multiple sclerosis. Nat Rev Neurol 11:676–686. https://doi.org/10.1038/nrneurol.2015.194
Article PubMed Google Scholar
Barkhof F, Calabresi PA, Miller DH, Reingold SC (2009) Imaging outcomes for neuroprotection and repair in multiple sclerosis trials. Nat Rev Neurol 5:256–266. https://doi.org/10.1038/nrneurol.2009.41
Article PubMed Google Scholar
Liu Y, Duan Y, Huang J et al (2018) Different patterns of longitudinal brain and spinal cord changes and their associations with disability progression in NMO and MS. Eur Radiol 28:96–103. https://doi.org/10.1007/s00330-017-4921-x
Article PubMed Google Scholar
Sampat MP, Healy BC, Meier DS, Dell'Oglio E, Liguori M, Guttmann CR (2010) Disease modeling in multiple sclerosis: assessment and quantification of sources of variability in brain parenchymal fraction measurements. Neuroimage 52:1367–1373. https://doi.org/10.1016/j.neuroimage.2010.03.075
Chard DT, Jackson JS, Miller DH, Wheeler-Kingshott CAM (2010) Reducing the impact of white matter lesions on automated measures of brain gray and white matter volumes. J Magn Reson Imaging 32:223–228. https://doi.org/10.1002/jmri.22214
Article PubMed Google Scholar
Vrenken H, Jenkinson M, Horsfield MA et al (2013) Recommendations to improve imaging and analysis of brain lesion load and atrophy in longitudinal studies of multiple sclerosis. J Neurol 260:2458–2471. https://doi.org/10.1007/s00415-012-6762-5
Article CAS PubMed Google Scholar
Han X, Jovicich J, Salat D et al (2006) Reliability of MRI-derived measurements of human cerebral cortical thickness: the effects of field strength, scanner upgrade and manufacturer. Neuroimage 32:180–194. https://doi.org/10.1016/j.neuroimage.2006.02.051
Article PubMed Google Scholar
Jovicich J, Czanner S, Han X et al (2009) MRI-derived measurements of human subcortical, ventricular and intracranial brain volumes: reliability effects of scan sessions, acquisition sequences, data analyses, scanner upgrade, scanner vendors and field strengths. Neuroimage 46:177–192. https://doi.org/10.1016/j.neuroimage.2009.02.010
Article PubMed Google Scholar
Fischl B (2012) FreeSurfer. Neuroimage 62:774–781. https://doi.org/10.1016/j.neuroimage.2012.01.021
Article PubMed Google Scholar
Jenkinson M, Beckmann CF, Behrens TE, Woolrich MW, Smith SM (2012) FSL. Neuroimage 62:782–790. https://doi.org/10.1016/j.neuroimage.2011.09.015
Ashburner J, Friston KJ (2005) Unified segmentation. Neuroimage 26:839–851. https://doi.org/10.1016/j.neuroimage.2005.02.018
Article PubMed Google Scholar
Gaser C, Dahnke R (2016) CAT - a computational anatomy toolbox for the analysis of structural MRI data. p 1
Polman CH, Reingold SC, Banwell B et al (2011) Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann Neurol 69:292–302. https://doi.org/10.1002/ana.22366
Article PubMed PubMed Central Google Scholar
Lublin FD, Reingold SC, Cohen JA et al (2014) Defining the clinical course of multiple sclerosis: the 2013 revisions. Neurology 83:278–286. https://doi.org/10.1212/WNL.0000000000000560
Article PubMed PubMed Central Google Scholar
Kurtzke JF (1983) Rating neurologic impairment in multiple sclerosis an expanded disability status scale (EDSS). Neurology 33:1444–1452. https://doi.org/10.1212/WNL.33.11.1444
Article CAS PubMed Google Scholar
Ferreira D, Voevodskaya O, Imrell K et al (2014) Multiple sclerosis patients lacking oligoclonal bands in the cerebrospinal fluid have less global and regional brain atrophy. J Neuroimmunol 274:149–154. https://doi.org/10.1016/j.jneuroim.2014.06.010
Article CAS PubMed Google Scholar
Smith SM, Zhang Y, Jenkinson M et al (2002) Accurate, robust, and automated longitudinal and cross-sectional brain change analysis. Neuroimage 17:479–489
Article PubMed Google Scholar
Popescu V, Battaglini M, Hoogstrate WS et al (2012) Optimizing parameter choice for FSL-Brain Extraction Tool (BET) on 3D T1 images in multiple sclerosis. Neuroimage 61:1484–1494. https://doi.org/10.1016/j.neuroimage.2012.03.074
Article CAS PubMed Google Scholar
Ashburner J (2007) A fast diffeomorphic image registration algorithm. Neuroimage 38:95–113. https://doi.org/10.1016/j.neuroimage.2007.07.007
Article PubMed Google Scholar
Schmidt P, Gaser C, Arsic M et al (2012) An automated tool for detection of FLAIR-hyperintense white-matter lesions in multiple sclerosis. Neuroimage 59:3774–3783. https://doi.org/10.1016/j.neuroimage.2011.11.032
Article PubMed Google Scholar
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol 57:289–300
Google Scholar
Klauschen F, Goldman A, Barra V, Meyer-Lindenberg A, Lundervold A (2009) Evaluation of automated brain MR image segmentation and volumetry methods. Hum Brain Mapp 30:1310–1327. https://doi.org/10.1002/hbm.20599
Heinen R, Bouvy WH, Mendrik AM, Viergever MA, Biessels GJ, de Bresser J (2016) Robustness of automated methods for brain volume measurements across different MRI field strengths. PLoS One 11:e0165719. https://doi.org/10.1371/journal.pone.0165719
Kazemi K, Noorizadeh N (2014) Quantitative comparison of SPM, FSL, and brainsuite for brain MR image segmentation. J Biomed Phys Eng 4:13–26
CAS PubMed PubMed Central Google Scholar
Wattjes MP, Rovira À, Miller D et al (2015) Evidence-based guidelines: MAGNIMS consensus guidelines on the use of MRI in multiple sclerosis—establishing disease prognosis and monitoring patients. Nat Rev Neurol 11:597–606. https://doi.org/10.1038/nrneurol.2015.157
Vågberg M, Axelsson M, Birgander R et al (2017) Guidelines for the use of magnetic resonance imaging in diagnosing and monitoring the treatment of multiple sclerosis: recommendations of the Swedish Multiple Sclerosis Association and the Swedish Neuroradiological Society. Acta Neurol Scand 135:17–24. https://doi.org/10.1111/ane.12667
Article PubMed Google Scholar
De Stefano N, Giorgio A, Battaglini M et al (2010) Assessing brain atrophy rates in a large population of untreated multiple sclerosis subtypes. Neurology 74:1868–1876. https://doi.org/10.1212/WNL.0b013e3181e24136
Article PubMed Google Scholar
De Stefano N, Stromillo ML, Giorgio A et al (2015) Establishing pathological cut-offs of brain atrophy rates in multiple sclerosis. J Neurol Neurosurg Psychiatry:jnnp-2014-309903. https://doi.org/10.1136/jnnp-2014-309903
Biberacher V, Schmidt P, Keshavan A et al (2016) Intra- and interscanner variability of magnetic resonance imaging based volumetry in multiple sclerosis. Neuroimage 142:188–197. https://doi.org/10.1016/j.neuroimage.2016.07.035
Article PubMed Google Scholar
Mendrik AM, Vincken KL, Kuijf HJ et al (2015) MRBrainS challenge: online evaluation framework for brain image segmentation in 3T MRI scans. In: Computational intelligence and neuroscience. https://www.hindawi.com/journals/cin/2015/813696/. Accessed 11 Jul 2018
Granberg T, Uppman M, Hashim F et al (2016) Clinical feasibility of synthetic mri in multiple sclerosis: a diagnostic and volumetric validation study. AJNR Am J Neuroradiol 37:1023–1029. https://doi.org/10.3174/ajnr.A4665
Article CAS PubMed PubMed Central Google Scholar
West J, Warntjes JB, Lundberg P (2011) Novel whole brain segmentation and volume estimation using quantitative MRI. Eur Radiol 22:998–1007. https://doi.org/10.1007/s00330-011-2336-7
Article PubMed Google Scholar
Durand-Dubief F, Belaroussi B, Armspach JP et al (2012) Reliability of longitudinal brain volume loss measurements between 2 sites in patients with multiple sclerosis: comparison of 7 quantification techniques. AJNR Am J Neuroradiol. https://doi.org/10.3174/ajnr.A3107
Valverde S, Oliver A, Roura E et al (2015) Quantifying brain tissue volume in multiple sclerosis with automated lesion segmentation and filling. Neuroimage Clin 9:640–647. https://doi.org/10.1016/j.nicl.2015.10.012
Article PubMed PubMed Central Google Scholar
Reuter M, Schmansky NJ, Rosas HD, Fischl B (2012) Within-subject template estimation for unbiased longitudinal image analysis. Neuroimage 61:1402–1418. https://doi.org/10.1016/j.neuroimage.2012.02.084
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the participants and their families as well as the staff at the MRI at Karolinska University Hospital in Huddinge for making this study possible.

Funding

The study was supported by grants provided by Stockholm County Council (ALF grants 20120213 and 20150166). Dr. Guo was supported by the Bilateral Scholarship Programme from China Scholarship Council and Swedish Institute, Young Scholars Program of the National Natural Science Foundation of China (No. 81600923) and Young Scholars Program of the First Hospital of Jilin University (No. jdyy72016007). Dr. Granberg was supported by the Swedish Society for Medical Research.

Author information

Authors and Affiliations

Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
Chunjie Guo, Daniel Ferreira & Eric Westman
Department of Radiology, The First Hospital of Jilin University, Changchun, China
Chunjie Guo
Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
Katarina Fink & Tobias Granberg
Department of Neurology, Karolinska University Hospital, Stockholm, Sweden
Katarina Fink
Division of Neuroradiology, Department of Radiology, Karolinska University Hospital, 141 86, Stockholm, Sweden
Tobias Granberg

Authors

Chunjie Guo
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Katarina Fink
View author publications
You can also search for this author in PubMed Google Scholar
Eric Westman
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Granberg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tobias Granberg.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Tobias Granberg, MD, PhD.

Conflict of interest

The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.

Statistics and biometry

One of the authors has significant statistical expertise.

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was obtained from all subjects (patients) in this study.

Ethical approval

Institutional Review Board approval was obtained.

Methodology

• Prospective

• Cross-sectional study/observational

• Performed at one institution

Electronic supplementary material

Supplementary Table 1

(XLSX 60 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Guo, C., Ferreira, D., Fink, K. et al. Repeatability and reproducibility of FreeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis. Eur Radiol 29, 1355–1364 (2019). https://doi.org/10.1007/s00330-018-5710-x

Download citation

Received: 10 April 2018
Revised: 14 July 2018
Accepted: 07 August 2018
Published: 21 September 2018
Issue Date: 02 March 2019
DOI: https://doi.org/10.1007/s00330-018-5710-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Repeatability and reproducibility of FreeSurfer, FSL-SIENAX and SPM brain volumetric measurements and the effect of lesion filling in multiple sclerosis

Abstract

Objectives

Methods

Results

Conclusions

Key Points

Similar content being viewed by others

Magnetic resonance imaging in Alzheimer’s disease and mild cognitive impairment

Conversion map from quantitative parameter mapping to myelin water fraction: comparison with R1·R2* and myelin water fraction in white matter

A multimodal clinical diagnostic approach using MRI and 18F-FDG-PET for antemortem diagnosis of TDP-43 in cases with low–intermediate Alzheimer’s disease neuropathologic changes and primary age-related tauopathy

Introduction

Materials and methods

Participants

MRI protocol

Image analysis

FreeSurfer

FSL-SIENAX

SPM

SPM-CAT

Lesion filling

Statistical analysis

Results

Comparability of the brain volumetry from different software

Repeatability and reproducibility of non-normalised brain volumetry

Effects of normalisation on brain volumetry

Effects of MS lesion filling

Discussion

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Methodology

Electronic supplementary material

Supplementary Table 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation