Introduction

Cerebral autoregulation (CA) is the mechanism aiming to keep cerebral blood flow constant during blood pressure (BP) fluctuations by constricting or dilating cerebral arterioles in response to BP increases and decreases, respectively (Bayliss 1902; Claassen et al. 2016; Moerman and De Hert 2019). CA acts complementary to the process of neurovascular coupling which aims to increase local cerebral blood flow in response to increased neural demand and is dependent on systemic and cerebral CO2 concentration (Rosengarten et al. 2001; Meng and Gelb 2015). Static and dynamic CA are distinguished to express the ability of CA to maintain cerebral blood flow during a changed steady-state BP and fluctuating BP, respectively (Tiecks et al. 1995; de Jong et al. 2017). Dynamic CA function is dependent on the frequency and speed of BP changes and can in contrast to static CA not be measured using MRI, PET or SPECT (Tiecks et al. 1995; de Jong et al. 2017). Impaired dynamic CA was reported in patients with stroke and traumatic brain injury (Radolovich et al. 2011; Castro et al. 2018) and may be a risk factor for mild cognitive impairment (MCI) and dementia, particularly in patients with hypertension and orthostatic hypotension (Freeman et al. 2011; Tarumi et al. 2014; Müller et al. 2020). Presently, dynamic CA is usually assessed using Transcranial Doppler (TCD) measurements, which provide a valid approximation of cerebral blood flow measured using MRI (Khan et al. 2017). However, TCD measurements require skilled investigators and are not feasible in a substantial proportion of older adults due to age-related temporal bone remodeling, limiting TCD availability and applicability in clinical populations (Claassen et al. 2016; Couture et al. 2017).

Near infrared spectroscopy (NIRS) is a potential alternative for TCD and measures changes in cerebral oxygenated and deoxygenated hemoglobin concentrations by detecting the intensity of reflected infrared light emitted into the brain. NIRS measurements were suggested to be potentially useful to assess CA in healthy young individuals (Kainerstorfer et al. 2015; Elting et al. 2018; Kim et al. 2018), healthy older adults (Gao et al. 2015), and various clinical populations (Zweifel et al. 2010; Rivera-Lara et al. 2017; Montgomery et al. 2020). NIRS-derived CA estimation may be performed using the cerebral oximetry index, or transfer function analysis (TFA) phase shift correcting for effects arising from the cerebral microcirculation using the information in the high-frequency range (0.2–0.5 Hz), in which CA is not active (Brady et al. 2010; Elting et al. 2018). However, there is very limited evidence on the validity of NIRS-derived CA estimation in an older, clinical population, for example with chronic diseases such as hypertension and cognitive impairment.

In this study, we assessed the criterion validity of NIRS-derived CA estimation in younger and older controls, patients with controlled hypertension, mild cognitive impairment (MCI) and Alzheimer’s dementia (AD) during supine rest and repeated sit to stand transitions. We hypothesize that NIRS-derived CA estimates correlate with TCD-derived CA measures, and have a high absolute agreement, i.e., a separation between upper and lower 95% limits of agreement < 50° (Sanders et al. 2019).

Material and methods

Study cohorts

BP, TCD and NIRS data from six cohorts, collected between 2008 and 2018 at three different centers, were included in this study: two cohorts of younger adults (younger controls; n = 39 and 14; mean age < 65 years), a cohort of older adults (older controls; n = 28; mean age > 65 years), a cohort of patients with controlled hypertension (n = 27), and cohorts of patients with MCI and AD (cognitively impaired patients; n = 37 and 57). The centers were (1) the Department of Neurology, Lucerne Kantonsspital, Lucerne, Switzerland (younger controls cohort 1 and patients with controlled hypertension); (2) the University Groningen Medical Center, Groningen, the Netherlands (younger controls cohort 2); (3) the Radboud University Medical Center, Nijmegen, the Netherlands (older controls and patients with MCI and AD). Table 1 lists the inclusion and exclusion criteria per cohort.

Table 1 Cohort characteristics

In the quantitative analysis, data from the two cohorts of younger controls and patients with MCI and AD were pooled, leaving four pooled cohorts for statistical analysis: younger controls, older controls, hypertension patients and cognitively impaired patients.

All patients and controls signed informed consent and for all studies medical ethical approval was obtained and they were performed in accordance with the declaration of Helsinki.

Participant characteristics

Information about age, sex, smoking habits, medical history and use of medication were obtained. Body mass index (BMI) and cognitive performance (MMSE and/or MOCA) were measured in older controls and patients with MCI and AD.

Instrumentation

Blood pressure (BP), near-infrared spectroscopy (NIRS) and transcranial Doppler (TCD) were simultaneously measured. The used BP, TCD and NIRS devices and manufacturers per cohort are listed in Table 1 as well as the used sampling frequencies, wavelengths and inter optode distances.

Continuous, beat-to-beat BP was measured non-invasively using finger photoplethysmography. Near-infrared spectroscopy (NIRS) measurements were obtained bilaterally on the forehead to assess changes in cerebral oxygenated hemoglobin concentrations (O2Hb). The differential pathway factor (DPF), which accounts for the increased distance traveled by light due to scattering, is age-dependent and was computed using the following formula in adults aged below 50 years (Scholkmann and Wolf 2013): 4.99 + 0.067 × Age0.814. In other individuals, DPF was set to 6.61, the value for an age of 50 years, the highest age for which the formula is validated. Transcranial Doppler (TCD) measurements were performed bilaterally over the temporal bone to measure cerebral blood flow velocity (CBFV) in the middle cerebral arteries. The TCD probes were fixed using a head holder.

Protocol

Room temperatures between 20 and 23 °C were pursued. Patients and controls were discouraged from talking and moving during the measurements.

Patients were asked to lie supine for at least 5 min during which blood pressure, cerebral blood flow velocity and oxygenated and deoxygenated cerebral hemoglobin (O2Hb and HHb, respectively) were measured.

Measurements during repeated sit-to-stand transitions were performed in patients with MCI and AD. The seat was adjusted to the patient’s height. Patients were asked to switch from sitting and standing position every 10 s (full cycle period of 20 s; frequency of 0.05 Hz) during 5 min to induce BP, CBFV and O2Hb oscillations.

Data analysis

All analysis was performed using MATLAB (version R2019b, The MathWorks Inc., Natick, Massachusetts, USA).

Signal preprocessing

All signals were resampled to a uniform sampling frequency of 200 Hz and a moving median filter with a 0.15 s window was applied to remove spike artefacts from the signals.

Signal quality assessment

The quality of BP, CBFV and O2Hb signals was visually assessed by two authors (AM and JWE). This was performed separately for each signal, individual, side (left/right) and test condition. A signal was considered poor quality and discarded if it contained spike or step artefacts, or a recurrent heartbeat was not visible.

Transfer function analysis (TFA)

BP-CBFV (TCD-derived) and BP-O2Hb (NIRS-derived) transfer functions (TF) were computed for both sides in each individual using software from the CARNet community (Claassen et al. 2016). Signals were filtered using a 6th order Butterworth lowpass filter with a cutoff of 0.5 Hz and the signal mean was subtracted before performing the TFA. TF gain (TFg), phase shift (TFφ) and coherence (TFc) were computed as a function of frequency. TFg and TFφ with insignificant coherence in a frequency bin (tested according to the CARNet recommendations) (Claassen et al. 2016) were discarded from further analysis.

TFφs were averaged over the sides with available data per individual and subsequently averaged over individuals to obtain grand average TFφs per cohort. Averaging was performed using the circular mean (i.e., transforming angular quantities to positions on the unit circle, averaging x and y coordinates of these positions separately, and transforming the averaged x and y coordinates back to an angular quantity). NIRS and TCD-derived grand average TFφs were plotted together to enable a visual assessment of their differences.

The very low frequency (VLF), low frequency (LF) and high frequency (HF) ranges were defined as 0.02–0.07 Hz, 0.07–0.2 Hz and 0.2–0.5 Hz, respectively (Elting et al. 2018). Mean TFφ in the VLF and LF ranges were used as CA measures.

Correction for cerebral microcirculation effects

The BP-O2Hb TFφs were corrected for cerebral microcirculation effects on waveform morphology: CBFV waveforms typically have a steeper upstroke compared to O2Hb waveforms, resulting in a relatively constant negative phase shift across frequencies (demonstrated in Supplementary File S1). Correction was performed by computing the mean phase shift in the HF range (in which CA is not active) and subtracting this mean phase shift from the TFφ (Fig. 1). The HF range mean phase shift was determined based on the pooled cohort average TFφ.

Fig. 1
figure 1

Grand average of BP-CBFV and BP-O2Hb TFφ in supine rest and during repeated sit to stand transitions, per cohort. The blue and red traces are the BP-O2Hb TFφs before and after correction, respectively. The yellow dotted lines are the means lines of the BP-O2Hb TFφ in the high frequency (HF) range. MCI mild cognitive impairment, AD Alzheimer’s dementia

Statistical analysis

Circular means and standard deviations were used to aggregate phase shift data.

TFφs were consecutively averaged over sides (left/right) and over frequencies within frequency ranges (VLF and LF), resulting in mean VLF and LF TFφs for each individual.

Spearman rank correlations of NIRS- and TCD-derived CA estimates were computed as a measure of criterion validity. Furthermore, the absolute difference between NIRS- and TCD derived CA estimates were visualized using Bland Altman plots and 95% upper and lower limits of agreement were computed. A separation between upper and lower limits of agreement < 50 was considered high absolute agreement (Sanders et al. 2019). This value corresponds to the separation between 95% limits of agreement of Bland Altman analysis reflecting the test–retest reliability of TCD-derived CA measures (Sanders et al. 2019).

p-values < 0.05 were considered statistically significant.

Results

Participant characteristics

Table 2 lists the participant characteristics per cohort. Mean age in the two cohorts of younger controls was 48 years (SD 17.7) and 28 years (range 21–45), respectively, and ranged between 69.2 and 73.3 years in the cohorts of older controls and patients. Mean systolic/diastolic blood pressure in patients with controlled hypertension was 124.0/78.5 mmHg (SD 15.6/12.9). Median MOCA score was 23 points (interquartile range (IQR) 20.5–25.0) in MCI patients; median MMSE score was 16 points (IQR 14–18) in patients with AD, and 29 points (IQR 28–30) in older controls. Cerebral blood flow velocity as measured with TCD ranged between 38.2 cm/s in patients with AD to 62.8 cm/s in younger controls.

Table 2 Characteristics of patients and controls

Signal quality

During supine rest, each signal (BP, CBFV, O2Hb) was available in good quality in 148/202 participants (73.2%; younger controls: 26/41; older controls 19/28; hypertension patients: 23/27; cognitively impaired patients: 59/94). During the repeated sit-to-stand transitions, each signal was available in good quality in 35/94 cognitively impaired patients (37.2%).

Correction for cerebral microcirculation effects

Figure 1 shows the results of correcting for cerebral microcirculation effects on grand average level. After correction, the grand average BP-O2Hb TFφ approximated the grand average BP-CBFV TFφ: − 43°/− 71° error before correction to 16°/− 12° error after correction (VLF/LF range, all participants, supine rest) and -62/-75 degrees error before correction to − 4°/− 17° error after correction (VLF/LF range, cognitively impaired patients, sit to stand transitions). Errors before and after correction per cohort and experimental condition are listed in Table 3.

Table 3 Cerebral autoregulation estimates derived from TCD and NIRS

Correlation between NIRS- and TCD- derived CA estimates

Spearman correlations between NIRS-and TCD derived CA estimates during supine rest were significant in patients with hypertension in the VLF range (ρ = 0.55, n = 27) and in younger controls in the LF range (ρ = 0.37, n = 53), but not in the other cohorts. In the pooled group of all patients and controls, Spearman correlations during supine rest were 0.22 (VLF, n = 120, p = 0.016) and 0.30 (LF, n = 111, p = 0.002). During repeated sit-to-stand transitions in cognitively impaired patients, Spearman correlations were 0.46 (VLF, n = 32, p = 0.009) and 0.61 (LF, n = 19, p < 0.001), respectively. Table 3 lists the correlations between NIRS- and TCD-derived CA estimates in the entire population as well as in the different pooled cohorts. Supplementary Table S2 lists the mean TFg, TFφ and TFc for each pooled cohort, test condition, TF, and frequency range. Supplementary Table S3 lists the Spearman correlation stratified for the NIRS measurement device. NIRS- and TCD-derived CA measures only were significantly correlated when measured with the Oxymon Mk III device (VLF/LF, n = 63/58, ρ = 0.22/0.30).

Bland Altman analysis

Bland Altman plots of the differences between NIRS- and TCD-derived CA estimates within the entire population of all patients and controls are shown in Fig. 2. For measurements in supine rest (pooled group of all patients and controls), the 95% limits of agreement were − 42° to 70° (VLF) and − 49 to 38° (LF). For measurements during repeated sit-to-stand transitions (cognitively impaired patients), the 95% limits of agreement were − 39 to 38° (VLF) and − 37 to 28° (LF). The 95% limits of agreement per cohort and experimental condition are listed in Table 3.

Fig. 2
figure 2

Bland Altman plots showing agreement between NIRS- and TCD- derived CA measures during supine rest and repeated sit-to-stand transitions. The horizontal solid lines indicate the mean difference; the horizontal dashed lines indicate the 95% limits of agreement. VLF very low-frequency range, LF low-frequency range

Discussion

This study assessed the criterion validity of near-infrared spectroscopy (NIRS) as a cerebral autoregulation (CA) estimation method. Correction of NIRS-derived CA estimates for waveform morphology differences between CBFV and O2Hb signals reduced the errors in the grand average. Significant but low correlations (0.22–0.30) between NIRS and TCD-derived CA measures were found for measurements performed during supine rest. During repeated sit-to-stand transitions, correlations were higher (0.46–0.61), but the analyzed number of individuals was low (19–32). Bland Altman analyses showed a low absolute agreement between NIRS- and TCD-derived CA measures (separation between upper and lower 95% limits of agreement ranging from 65° to 112°).

Agreement between NIRS- and TCD derived CA measures

Correction of the BP-O2Hb TFφ by subtracting the negative mean phase in the HF range reduced errors on grand average level both during supine rest and repeated sit-to-stand transitions with at least 27° and 58°, respectively. This negative mean phase was shown to reflect waveform morphology differences between the systemic and cerebral circulations. The significant correlations between NIRS- and TCD-derived CA measures suggest that NIRS-derived CA estimates may be used to compare groups. Absolute errors between NIRS and TCD-derived CA measures in individuals were higher than the used cutoff criterion for clinical application in individuals (Sanders et al. 2019). This finding was different from previous work that investigated the transfer function from O2Hb to deoxygenated haemoglobin (HHb) (Elting et al. 2018). In that study, a relatively good agreement between NIRS based autoregulation estimates and BP-CBFV based autoregulation estimates was found, with intra class correlation values up to 0.7. Apart from the difference in signals that were analysed, the most likely explanation for these different results is a difference in signal quality and artefact levels between these studies. In the current dataset, artefact levels were relatively high, with HHb signals that were largely of insufficient quality, which prevented us from examining O2Hb-HHb transfer functions.

The Bland–Altman plots showed higher errors when phase differences were higher, indicating a poor agreement between NIRS- and TCD-derived CA estimates on the individual level. This poor agreement is likely to arise from a combination of factors, i.e., noise in the NIRS signals, differences between NIRS devices used, effects of extra-cranial circulation, differences in the part of the brain monitored between NIRS and TCD, and the use of TCD as the gold standard because of the lack of the optimal gold standard. These factors are discussed in more detail below.

Supine rest versus sit to stand transitions

The correlation between NIRS- and TCD- derived CA measures was particularly large when measured during repeated sit-to-stand transitions. The transitions increased the coherence between BP, CBFV and O2Hb in the VLF range, implying a better validity of the TFA linearity assumption, as reported before (Claassen et al. 2016). The relatively high correlation between NIRS- and TCD-derived CA measures during repeated sit-to-stand transitions may also be explained by the higher BP variability, which was reported to be a positive determinant of CA reliability in a previous study (Elting et al. 2020). CA impairment may become manifest during transitions rather than in supine rest, indicating the potential clinical relevance of assessing CA during transitions (Mol et al. 2018, 2019, 2020; de Heus et al. 2020). In contrast to supine rest, transitions may induce rapid BP drops, challenging CA beyond its capacity due to its intrinsic latency (Kuo et al. 2003). However, the number of individuals with good quality signals during sit-to-stand transitions in the present study was low.

VLF versus LF range

The correlation between NIRS- and TCD-derived CA measures was higher in the LF range compared to the VLF range. Non-linear behavior in the VLF range may play a role as reported in a previous study and indicated by the low coherence in the VLF range compared to the LF and HF range (Giller and Mueller 2003). As demonstrated in supplementary file S1, estimation of the CBFV-O2Hb TFφ using the HF mean of the BP-O2Hb TFφ was less accurate in the VLF compared to the LF range. The findings may also be explained by the relatively short duration of the measurements relative to the oscillation period in the VLF range, implying a potentially large effect of artefacts on the BP-O2Hb TFφ in this frequency range. Further studies should preferably prolong the measurements, enabling the selection of data segments with more BP variation, improving the reproducibility of CA assessment (Elting et al. 2020).

Difference between cohorts

The correlation between NIRS- and TCD- derived CA measures was significant in the entire population of patients and controls during supine rest, but not in each of the pooled cohorts. Apart from low sample sizes, device and data quality differences between the different centers may have played a role. For example, differences in NIRS inter optode distance (i.e., the distance between transmitter and receiver), which partly determines the volume of brain tissue being sampled, may have influenced the results (Klaessens et al. 2003). The analysis stratified for NIRS device supports the idea that the device type and inter optode distance may play a role as the correlation between NIRS- and TCD-derived CA measures was only significant when measured with one type of device (Oxymon MK III), which in contrast to the other NIRS devices used had an inter optode distance of 5 cm. However, this difference in significance of correlations may alternatively well be attributed to the difference in a number of individuals measured, which was highest in Oxymon MK III group as absolute correlation values were comparable between devices.

NIRS measurements for CA estimation

Though NIRS aims to measure cerebral oxygenation changes only, the influence of changes in the extra-cranial circulation on the NIRS signal cannot be entirely excluded, posing a potential limitation to the study. Extra-cranial effects were demonstrated in a previous study, which showed an effect of blocking the extra-cranial circulation using a head cuff on the measured NIRS signals (Davie and Grocott 2012). However, these extra-cranial effects were reported to be relatively small when CA is assessed in the frequency domain using an inter optode distance > 3 cm (Obrig et al. 2000). Remaining extra-cranial effects may have contributed to the poor correlation between NIRS- and TCD-derived CA measures found in the present study. As the extra-cranial circulation is not influenced by CA, extra-cranial circulation is synchronous to the systemic circulation (i.e., BP). Extra-cranial circulation effects on the NIRS measurements would therefore draw the BP-O2Hb TFφ to zero in the VLF and LF ranges (Obrig et al. 2000). This is was not observed in the present study where the BP-O2Hb TFφ quite closely approximated the BP-CBFV TFφ (Fig. 1), suggesting only a minor effect of extra-cranial effects on the NIRS-derived CA measures. Extra-cranial effects should be aimed to be further excluded in future studies, which may in part be performed by using both small and large inter optode distances. Optodes with small distances primarily measure extra-cranial effects. These effects can then be subtracted from measurements from larger distance optodes, which are assumed to measure both cerebral and extra-cranial effects.

In the present study, NIRS measurements were performed bilaterally on the forehead to measure frontal cortical cerebral oxygenation changes as an indication of general cerebral perfusion. However, there may be differences between middle and anterior cerebral artery supplied brain regions (Wolf et al. 2002; Obrig et al. 2002), potentially explaining part of the differences between NIRS- and TCD-derived CA measures. Furthermore, the cortical neurovascular coupling may give rise to small differences between local cortical perfusion measured using NIRS and regional cerebral perfusion measured using TCD (Anderson et al. 1987; Iadecola 2017).

Signal quality

A considerable proportion of the included individuals did not have an adequate quality of all signals (BP, CBFV, O2Hb) during supine rest, which is a limitation of this study. The proportion was lowest (62.7%) in cognitively impaired patients, which might be explained by poor understanding of instructions not to move or a lower baseline cerebral blood flow causing a lower signal to noise ratio in these individuals. The even lower availability of all signals (37.2%) during repeated sit-to-stand transitions in this pooled cohort can be attributed to the abundant occurrence of transition-induced movement artefacts. Due to the removal of negative phase shifts, the number of individuals for whom both NIRS- and TCD-derived CA measures were available was further reduced, posing a substantial limitation to the applicability of NIRS-based CA estimation. Further efforts should be made to decrease NIRS sensitivity to movement artefacts to enhance its applicability for CA estimation.

TCD as gold standard

TCD is the gold standard for CA assessment, but also has limitations. TCD-derived CA measures have limited reproducibility (Sanders et al. 2018, 2019; Elting et al. 2020), potentially indicating that physiological factors apart from CA may influence cerebral blood flow velocity and hence TCD-derived CA measures. These factors may also partly explain the poor correlation with NIRS-derived CA estimates. TCD signals had to be discarded due to artefacts in some individuals with good quality BP and NIRS signals, limiting the number of individuals that could be included in the comparative analyses.

Strength and limitations

The strength of this study is the diversity of the included cohorts and the simultaneous measurements of BP, TCD and NIRS both during well standardized (supine rest) and CA challenging (repeated sit-to-stand transitions) test conditions. Limitations include the relatively short duration of the measurements, the differences between the NIRS devices used in the different cohorts, the susceptibility of NIRS devices to potentially measure extra-cranial along with cerebral oxygenation effects, and the relatively large proportion of data that could not be used in the final analysis due to the presence of artefacts.

Conclusion and clinical implications

Criterion validity of NIRS-derived CA estimates increases after correction for non-CA effects arising from the cerebral microvasculature. Criterion validity may be sufficient to enable comparisons between groups, but this could not be fully established due to the diversity of the included cohorts as well as the differences between the NIRS devices used. Criterion validity was insufficient for clinical application in individuals. The results suggest that NIRS-derived CA estimates may be more valid during repeated sit-to-stand transitions. However, artefacts in NIRS recordings impede CA estimation as indicated by the substantial proportion of the data that had to be discarded. Reducing motion artefacts is needed to increase the quality of measurements during transitions and the applicability of NIRS-derived CA estimation. Increasing the quality of NIRS-derived CA estimation during supine rest could be performed by prolonging measurements and selecting data segments with more BP variation. After optimization of measurement duration, NIRS device settings, measurement protocol and artefact removal, NIRS measurements in geriatric outpatients may potentially enable valid CA assessment in a wider range of patients during more instances.