Elsevier

NeuroImage

Volume 40, Issue 3, 15 April 2008, Pages 1157-1165
NeuroImage

Detecting change in BOLD signal between sessions for atlas-based anatomical ROIs

https://doi.org/10.1016/j.neuroimage.2008.01.001Get rights and content

Abstract

An algorithm using pre-defined regions of interest (ROIs) to detect differences between sessions in Blood Oxygen-Level Dependent (BOLD) signal is proposed and results from a reproducibility study are reported here. It is important to know whether tests for change have the desired statistical properties, e.g., low variability between sessions and unbiased false-positive rates, under null conditions so one is confident in any conclusions based on the metric and test used. This study examined three cognitive tasks: Stroop, response inhibition, Sternberg digits, and a visually-cued finger tapping task, in 20 healthy subjects. Each subject had two fMRI sessions, one week apart. A series of ROI summaries was constructed by choosing different proportions of voxels from the ROI and calculating the mean of t values for the selected voxels. The choice of voxels was based on the magnitude of the t values, selecting the maximum value, then the top 1%, and so on until all voxels in the ROI were included. No ROIs were found to have significant differences between sessions based on paired comparison t tests. Generally, the observed false-positive rates were near the expected rates for all summaries and tasks, although the Sternberg and Response Inhibition tasks did have higher false-positive rates when α   0.1 for some ROI summaries. This study indicates results are reproducible and also have the desired statistical properties under null conditions.

Introduction

In drug development, we define a biomarker to be “a characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacological responses to a therapeutic intervention.” (Atkinson et al., 2001). In order to use fMRI as a biomarker, we need to be able to measure the amount of change due to a therapeutic intervention, not simply detect the occurrence of a change.

The limitations of voxel-based group analysis are well known (Brett et al., 2002). Even with smoothing and spatial normalization to a standard space, there is enough anatomical and functional variability between people that there can be significant differences between voxel-based group activation and activation in individuals (Vandenbroucke et al., 2004, Zahn et al., 2004, Eugene et al., 2003). These limitations and the complications of finding an appropriate way of quantifying the change make voxel-based group analysis difficult to use as a biomarker.

One alternative to voxel-based group analysis is to define regions of interest (ROI) in individuals and combine a summary of the ROIs to obtain a group result. Summarizing over the voxels in an ROI essentially smoothes over a restricted neighborhood of voxels thereby helping to reduce the effect of voxel-level spatial variability that is present in voxel-based group analysis. Another benefit of using ROIs is that translation to standard space is no longer required. Hence the manipulation of the “original” signal values during translation to standard space is removed from the analysis process.

There are some potential issues with summarizing over ROIs. Summarizing over areas larger than a voxel may decrease sensitivity as active voxels are grouped with inactive voxels in the summary. Also, active functional regions may not correspond to the defined anatomical ROIs. The functional regions may be a subregion of the anatomical ROI or it may be split between several ROIs leading to the signal being diluted during summarization of the ROI if the summary function is not chosen carefully.

Two summaries, location of the peak activation, and the number of voxels above a threshold (Rombouts et al., 1998, Constable et al., 1998), have been used to examine reproducibility of the fMRI signal, but are not good biomarker candidates because they do not easily quantify change due to a drug. The peak location is not expected to change with therapeutic intervention, although the magnitude might. The number of voxels above a threshold has several known limitations. The measure is dependent on an arbitrary threshold and results could be misleading if activation decreases to just below the threshold. The number of voxels above a threshold has also been shown to be highly variable compared to mean % signal change in an ROI (Cohen and Dubiosn, 1999).

The mean activation (e.g., mean t value or z score) over all or some voxels in a region has also been used (e.g., Dunckley et al., 2005, Iannetti et al., 2005). Typically, in a study only a single summary mean value is examined. In this paper we examine a family of summary measures which average over an increasing proportion of voxels in the ROI. This family of results presents a continuum of summaries from maximum signal in the ROI to the average of all signal values in the ROI. It also allows the active voxels to be filtered from the inactive voxels to some degree, hence improving the sensitivity of the summarized response.

Regions of interest that are investigated during a study are usually few in number and defined either functionally (for example, by making a mask from the active voxels determined from a voxel-based group analysis or separate scanning session) or anatomically. We chose to use an anatomical atlas to define the ROIs. This approach allowed us to predefine the ROIs and provide independence between ROI definition and functional response. We also chose to divide the entire brain into anatomical regions rather than focus on a few ROIs where activation or change in activation are expected. This allows us to systematically test regions where we expect activation and regions where we do not expect activation. A template based ROI map is easier to produce than a hand drawn map, and evidence suggests these automated mapping procedures may be as reliable as hand-drawn maps (Sun et al., 2007). We used the atlas developed by Tzourio-Mazoyer et al., 2002, for this paper, and have not investigated other available atlases (e.g., Hammers et al., 2002, Van Essen, 2005, Klein and Hirsch, 2005, Eickhoff et al., 2006).

In this paper we outline a methodology for whole brain analysis using ROIs which are summarized using a family of summary statistics. To assess the value of this approach as a biomarker for fMRI it is important to understand the degree of reproducibility between sessions in the metrics to be used and to have some confidence in the validity of the statistical tests that are likely to be used for second-level analysis. Reproducibility is characterized here by the standard deviation of the difference between sessions and potential temporal trends in response are examined through tests for mean differences between sessions. Intraclass correlation coefficients (ICCs) are not used to evaluate reproducibility here because of the variation in activation across all ROIs. For example, many ROIs are expected to have no association with a given task, so variation between subjects is likely to be low since the observed signals should just be random noise. Hence the ICC would be very low in these cases due to lack of variation between subjects rather than high variation between sessions, yet variability between sessions could be acceptably low as well. To assess the validity of the statistical tests it is necessary to investigate whether the tests have the desired statistical properties so as to have confidence in conclusions when the statistical tests reject the null hypothesis. To assess these statistical properties a study of the properties for the paired comparison t test under null conditions are examined. In particular the normality of the ROI summary values is evaluated and possible bias in the expected false-positive rates is examined.

Section snippets

Study design

Twenty healthy 18- to 65-year-old males and females with no history of neurological or mental disorders were recruited from the New Haven, CT area. Participants provided informed consent in accordance with the guidelines set forth by Western IRB.

The subjects completed two fMRI sessions approximately one week apart. During each fMRI session, 4 tasks were completed: fingertapping, response inhibition, Stroop, and Sternberg. The order of tasks was randomized within each session, except

Results

The set of gray matter voxels was considered to be the intersection of the SPM2 brain mask and the atlas gray matter mask. The atlas gray matter masks, however, were not always a subset of the SPM2 masks. The percent of voxels that were present in the atlas mask but not in the SPM2 mask ranged between 1.0 and 1.5% for the four tasks. In spite of our attempt to avoid having very small regions by a priori combining small regions with neighboring regions, we were not entirely successful and for

Discussion

The goal of this study was to begin to explore the viability of fMRI as a biomarker in drug trials. Two important steps in deciding whether fMRI has potential as a biomarker is assessing the quantitative reproducibility of a chosen signal over time across subjects and determining if the underlying assumptions for anticipated statistical tests used in future drug trials hold for replicate observations.

There were some important deviations in reproducibility during the creation of the ROIs,

References (26)

  • VandenbrouckeM.W. et al.

    Interindividual differences of medial temporal lobe activation during encoding in an elderly population studied by fMRI

    NeuroImage

    (2004)
  • ZahnR. et al.

    Recovery of semantic word processing in global aphasia: a functional MRI study

    Brain Res. Cogn. Brain Res.

    (2004)
  • BenjaminiY. et al.

    Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

    R. Stat. Soc., Ser. A Stat. Soc. Ser. B.

    (1995)
  • Cited by (15)

    • Reduced brain activation during spoken language processing in children with developmental language disorder and children with 22q11.2 deletion syndrome

      2021, Neuropsychologia
      Citation Excerpt :

      By providing information about both the strongest activating and the strongest de-activating voxels, we aimed to offer a representation of the full range of beta values for each group, task and ROI. In earlier studies, the selection of a subset of voxels, based on their level of activity, as a basis for a single measure of task-related signal changes within an anatomically defined ROI, has been found to be a useable and valid approach (Buck et al., 2008; Buma et al., 2016; Mitsis et al., 2008; Tong et al., 2016). Using each of these two voxel selections, we subsequently calculated the mean (de-)activation per ROI.

    • The role of fMRI in drug development

      2018, Drug Discovery Today
      Citation Excerpt :

      With such prescribed methodology in hand, an fMRI experiment can be reduced to a binary outcome more suitable to inform drug development decisions. In drug development, primary endpoints and hypotheses are typically based on prespecified ROIs [68,69] and power calculations are performed accordingly to avoid underpowering. Significant logistical requirements must be met when deploying fMRI to large numbers of heterogeneous, nonacademic imaging facilities, as is required for typical late-stage clinical trials.

    • Dynamic causal modelling of EEG and fMRI to characterize network architectures in a simple motor task

      2016, NeuroImage
      Citation Excerpt :

      The order of sessions was counterbalanced, so a systematic order effect on task performance or brain activation is unlikely. However, we cannot exclude that this strategy might have introduced some variance because BOLD and spectral EEG responses can vary from day to day (Buck et al., 2008; Fernandez et al., 1993; Menz et al., 2006; Napflin et al., 2007). We opted to do so because we assume that the better data quality of our 63-channel active electrode scalp recordings compared to EEG data measured in our MR scanner settings outweigh the potentially better comparative basis offered by a synchronous recording since it is generally assumed that repeated measures of EEG and BOLD signals, i.e., averaging across multiple trials, during motor tasks provide a valid basis for generalizable results.

    • Cerebral mechanisms for different second language writing systems

      2013, Neuropsychologia
      Citation Excerpt :

      Subsequently, to examine pairwise differences between the groups for each word condition and differences between the word conditions within each group, the % BOLD signal changes in the significant clusters (as determined by FSL Featquery) were further investigated via unpaired and paired t-tests, respectively. Note that the 90th percentile measure of % BOLD signal change was utilized, rather than the mean or maximum value, because it is considered a more representative measure of a typically active voxel within the ROI (Buck, Singhal, Arora, Schlitt, & Constable, 2008). Table 1 gives a summary of the demographic and cognitive measures (performed outside the scanner) for each group.

    • Within-limb somatotopy in primary motor cortex - revealed using fMRI

      2010, Cortex
      Citation Excerpt :

      Furthermore, although POA calculation was based on anatomical atlas-based M1 and conservative Bonferroni correction (range: .00006–.000003 for individual subjects), we found high variability for POA, also noted by Lotze et al. (2000) and Fesl et al. (2008). Fesl et al. (2008) and Buck et al. (2008) have suggested that as long as the thresholding technique used is ‘corrected’ and the threshold level is stringent, the variability of POA (t-weighted COM) is less relevant. This happens because at higher stringency, the activation shrinks and less significant voxels disappear, reducing noise in the data.

    View all citing articles on Scopus
    View full text