Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Behavioral effects of rhythm, carrier frequency and temporal cueing on the perception of sound sequences

  • Miriam Heynckes ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    miriam.heynckes@maastrichtuniversity.nl

    Affiliation Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands

  • Peter De Weerd,

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands

  • Giancarlo Valente,

    Roles Formal analysis, Methodology, Writing – review & editing

    Affiliation Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands

  • Elia Formisano,

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliations Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands, Maastricht Centre for Systems Biology, Maastricht University, Maastricht, The Netherlands

  • Federico De Martino

    Roles Conceptualization, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands

Abstract

Regularity of acoustic rhythms allows predicting a target embedded within a stream thereby improving detection performance and reaction times in spectral detection tasks. In two experiments we examine whether temporal regularity enhances perceptual sensitivity and reduces reaction times using a temporal shift detection task. Participants detected temporal shifts embedded at different positions within a sequence of quintet–sounds. Narrowband quintets were centered around carrier frequencies of 200 Hz, 1100 Hz, or 3100 Hz and presented at presentation rates between 1–8 Hz. We compared rhythmic sequences to control conditions where periodicity was reduced or absent and tested whether perceptual benefits depend on the presentation rate, the spectral content of the sounds, and task difficulty. We found that (1) the slowest rate (1 Hz) led to the largest behavioral effect on sensitivity. (2) This sensitivity improvement is carrier-dependent, such that the largest improvement is observed for low-frequency (200 Hz) carriers compared to 1100 Hz and 3100 Hz carriers. (3) Moreover, we show that the predictive value of a temporal cue and that of a temporal rhythm similarly affect perceptual sensitivity. That is, both the cue and the rhythm induce confident temporal expectancies in contrast to an aperiodic rhythm, and thereby allow to effectively prepare and allocate attentional resources in time. (4) Lastly, periodic stimulation reduces reaction times compared to aperiodic stimulation, both at perceptual threshold as well as above threshold. Similarly, a temporal cue allowed participants to optimally prepare and thereby respond fastest. Overall, our results are consistent with the hypothesis that periodicity leads to optimized predictions and processing of forthcoming input and thus to behavioral benefits. Predictable temporally cued sounds provide a similar perceptual benefit to periodic rhythms, despite an additional uncertainty of target position within periodic sequences. Several neural mechanisms may underlie our findings, including the entrainment of oscillatory activity of neural populations.

Introduction

Natural sounds are often characterized by a rhythm that enables the listener to predict and prepare for relevant events [1], a finding confirmed by many studies probing the effects of periodicity and predictability on sound perception [27]. Recent results have shown that periodicity and predictability of rhythms have dissociable effects on reaction time and sensitivity. Periodic (isochronous) stimuli decrease the reaction time [3,8,9], even when their rhythm is not predictive of a target [10,11]. On the other hand, predictability can increase perceptual sensitivity [5,7,12,13] even when stimulus presentation is predictable but aperiodic [6,14].

Most previous studies have focused on broadband or a few selected frequencies and have often tested presentation rates around 1.5 Hz [4,8,1519]. This last choice is at odds with studies suggesting that the brain preferentially differentiates acoustic information covering a range of timescales through theta- (and gamma) frequency-band information [20], a result that highlights the relevance of the temporal modulations inherent in speech occurring at the syllabic-level [21,22]. The importance of testing multiple carrier frequencies and presentation rates is stressed by psychoacoustic studies on amplitude modulation suggesting that rate and carrier frequency interact [23]. To account for the interdependency between spectral and temporal processing, it has been suggested that the modulation filters modelling the auditory system [24] systematically change along the frequency axis. In particular, the auditory system may be optimized to track rapid modulations at high frequencies and slower modulations at lower carrier frequencies [25].

Electrophysiological recordings in macaque auditory cortex suggest that the mechanism engaged by rhythmic sound processing is tonotopic and thus cannot be investigated when using broadband noise bursts [26], but requires the use of narrow-band stimuli or tones [16,27]. These studies propose that the optimization of behavioral performance for periodic stimuli is achieved through the entrainment of neuronal oscillations, which temporally modulate the excitability of task-relevant neuronal populations [8,18,28,29]. Rhythmic external stimuli can entrain oscillations, during which neural delta [8] and theta oscillations [30] become aligned to the externally imposed rhythm. The entrained oscillations may form the basis of temporal predictions that can be beneficial for stimuli presented at the entrainment rhythm [19,31].

Predictions can also be formed without periodicity, by using temporally cued associations. The behavioral benefits of temporal cueing have been shown in studies on foreperiod effects [32,33]. At the behavioral level, a predictable rhythmic sequence should lead to better perceptual discrimination at predicted moments in time than a predictable single interval. This has been tested in foreperiod-paradigms where the duration of an interval is judged, either in isolated pairs or with a preceding rhythm, and results indicate that discrimination thresholds improve [13]. Studies focusing on reaction times have compared effects of temporal cueing and periodic stimulus presentation both when the predictive information was provided symbolically, showing a benefit of periodicity over symbolic cueing [34], and temporally cued not showing a difference [15,35] and suggest a cumulative benefit of combining a temporal cue and a periodic rhythm [3]. Similarly, studies using pitch discrimination tasks [36,37] have shown effects on accuracy and reaction times. Moreover, a benefit in accuracy derived from implicit predictability in temporally cued intervals has been shown [14]. In line with a recent study suggesting that delta-phase entrainment may relate differently to spectral-based and temporal sensitivity [12], it would be of interest to compare perceptual sensitivity and reaction times due to predictability derived from a periodic sequence and from a single acoustic temporal cue when using a temporal task. This would allow evaluating if a single mechanism underlies predictive effects induced by both a temporal cue and a periodic sequence.

In two behavioral studies, we systematically address these open points by investigating how rhythm and carrier frequency of periodically presented sounds contribute to the behavioral benefits of rhythmic sound perception (Experiment 1). In a pilot we compare a subset of these conditions to aperiodic sound sequences (see S1 File). In experiment 2 we compare periodic and aperiodic sound sequences and additionally test whether the predictive advantage afforded by periodicity exceeds the advantage afforded by a temporal cue (Experiment 2). Based on previous findings [6,7,12] we expect a perceptual benefit in sensitivity and reaction times of periodic over aperiodic rhythms. In addition, we hypothesize that a periodic rhythm should result in a benefit over a single temporal cue, especially when targets are presented close to perceptual threshold. With regard to the difference between various periodic rhythms, we expect behavioral improvement to occur around the peaks of the external rhythm, as opposed to a general improvement due to periodic stimulation, in line with [16]. In particular, we hypothesize that an external rhythm in the theta frequency range (~ 4 Hz), would result in improved behavioral performance compared to other presentation rates.

Materials & methods

Participants

Prior to testing, participants were screened for normal hearing (≤ 20 dB) at audiometric test frequencies ranging from 0.25–8 kHz. In experiment 1a and 1b, two sets of 20 participants each took part in the experiments. Two participants were excluded, one due to failure to comprehend the task and one due to discomfort caused by the loudness of the stimuli, resulting in the abortion of the experiment. As a result, we analyzed data collected from n = 19 participants for experiment 1a (15 females, 4 males) and 1b (12 females, 7 males) each, and n = 20 participants for experiment 2 (13 females, 7 males). The size of n was determined based on studies reporting similar effect sizes [6]. The Ethics Review Committee of the Faculty of Psychology and Neuroscience (ERCPN) at Maastricht University granted approval for all studies and all participants gave informed consent.

Stimuli & design

All stimulus presentation scripts were written in Matlab (The MATHWORKS Inc., Natick, MA, 234 USA), using the Psychophysics toolbox [38]. Sounds were created at a 44.1 kHz sampling rate, 16-bit resolution and delivered through Sennheiser HD650 headphones. Data and analysis scripts are publicly available (https://doi.org/10.5281/zenodo.3695583). Exemplary stimuli can be found here (https://doi.org/10.5281/zenodo.3549376). Participants were seated in a sound-attenuated chamber. Instructions were presented on a computer monitor and responses collected with a standard keyboard. Participants were asked to detect a target; a temporal shift (TS) of a narrow-band sound embedded in a sequence of quintets. Narrowband sounds were centered around carrier frequencies of 200 Hz, 1100 Hz, or 3100 Hz. The passbands around the carriers were constructed using equivalent rectangular bandwidths (ERBS = 4; [39]). Each passband consisted of a summation of 21 sinusoids with amplitude normalized to 1 and a random onset phase. A quintet consisted of five 10ms narrowband sounds, each separated by 10ms (see inset 1 Fig 1D). Targets were constructed by shifting in time the third sound in a quintet (see inset 2 Fig 1D). Depending on the experiment, this shift was either fixed at 6ms (Experiment 1) or ranged between 1.5-7ms (Experiment 2; see Table 1). In both experiments, during a trial and up to 1 second after a quintet sequence finished, participants could press a button upon detecting a TS or another button at the end of a sequence indicating they did not perceive a TS. Quintet sequences had either a periodic or aperiodic repetition of quintets or occurred in the form of a temporal cue, where a single quintet cued the following quintet that possibly contained a TS. Table 1 reports the design parameters for each experiment. The distribution of TS was non-uniform such that TS occurred in 75% of the trials. Targets appeared at one of 3 possible quintet positions within the periodic or aperiodic sequences, or in the cued quintet. In experiment 1a targets TS were presented in quintet 9, 10 or 11 (e.g. occurring after 9, 10 or 11s for the 1 Hz rhythm). With increasing rhythm, targets thereby occurred earlier in time within a sequence (the 9th quintet at 2 Hz occurs at 4.5 s). Experiment 1b controls for the earlier occurrence of targets with increasing rhythm, by keeping the time until a TS constant across rhythm (a TS occurring in the 9th quintet at 1 Hz would occur in the 18th quintet at 2 Hz). Similar to experiment 1b, TS occurred at a fixed time in both the pilot experiment preceding experiment 2 (see S1 File), as well as in experiment 2 itself.

thumbnail
Fig 1. Stimuli.

A Experiment 1. Narrowband quintets were presented at 4 different presentation rates (1,2,4,8 Hz) and 3 different carrier frequencies (200, 1100, 3100 Hz). Target position (exemplars in red) was varied between experiment 1a and 1b. In experiment 1a targets occurred in either the 9th, 10th or 11th quintet within a sequence, while in experiment 1b targets appeared at the same time across rhythms (at 9,10 or 11 s). B Stimuli in experiment 2 were 6s aperiodic and periodic sequences at 2 Hz, correspondingly the cue condition had an ISI of 500ms. The carrier frequency was 1100 Hz. C. Quintet structure. Narrowband sounds of 10ms length centered around the respective carrier frequencies were organized in a hierarchical rhythmic structure. Five sounds repeating at 50 Hz (10ms ISI) create a quintet (inset 1), while these quintets are repeated at a slow (a)periodic rhythm. Target stimuli (TS, see inset 2) had a different temporal structure: the third tone in a quintet was temporally shifted at 6ms in experiment 1a, 1b and 2 and between 1.5-7ms in experiment 3, as determined by the participant’s 70% detection threshold.

https://doi.org/10.1371/journal.pone.0234251.g001

In experiment 1a (n = 19) & 1b (n = 19) we investigated the effect of periodic rhythms (1, 2, 4, 8 Hz) and the carrier frequency (200, 1100 and 3100Hz) on target detection sensitivity (measured in d’) and reaction time. Every trial consisted of 12s periodic sequences of quintets. We conducted two versions of this experiment, in which we varied the position of the targets within the sequences. In experiment 1a we kept the number of preceding quintets until the TS constant across rhythms. This systematically reduced the time until a target appeared with increasing rhythm frequency. In experiment 1b, we kept the time at which targets appeared constant, thereby presenting an increasing number of preceding quintets with increasing rhythms prior to presenting a target quintet. The rationale to assess and compare these two experiment versions is that the strength of entrainment might increase with additional repetitions, and the effect of rhythm may be confounded by the systematic effect of target position in a sequence. Trials were counter-balanced and randomized with respect to target position and carrier frequency, and presented in (counterbalanced) blocks per rhythm, consisting of 12 trials. After four blocks (one per rhythm) subjects had a break. In total participants received 10*4 blocks, summing to a total of 480 trials. Subjects underwent a brief training session (8–12 trials) using a subset of the stimuli. During training, visual feedback on the performance was provided. Prior to the training and main experiment, participants adjusted the intensity of the sounds to equalize their perceived loudness. When comparing the three carrier frequencies at equal intensities, 200 Hz was generally perceived as softer and 3100 Hz perceived as louder relative to the reference frequency of 1100 Hz and were adjusted accordingly. These observations are in line with equal loudness contours [39].

Unless explicitly stated, the stimuli in the pilot experiment preceding experiment 2 (see S1 File) and stimuli in experiment 2 were identical to those used in experiment 1b. We decided to limit the number of conditions and chose the carrier frequency and rhythm based on our findings in experiment 1 showing that with a 2 Hz rhythm and a 1100 Hz carrier frequency behavioral performance was intermediate, and thus we could expect the behavior to be modulated by the manipulations in experiment 2. In the pilot (n = 20; 11 females, 9 males) we examined the effect of (average) rhythm on target detection, by comparing periodic predictable and aperiodic unpredictable sequences of quintets. The TS target remained fixed at 6ms. To create an aperiodic sequence, the main constraint was to present the same number of quintets in the same amount of time (compared to the periodic conditions) at aperiodic inter-stimulus intervals (ISIs). For instance, the aperiodic sequence corresponding to the 1 Hz rhythm had to be comprised of 12 quintets in 12 seconds. As a result, ISIs had to be sampled within two intervals, shorter or longer compared to the corresponding periodic condition. Periodic sequences at 1Hz have a 1s ISI, the corresponding aperiodic condition sampled ISI from two distributions with mean equal to 250ms and 1500ms. Similarly, periodic sequences at 2 Hz have a 500ms ISI, the corresponding aperiodic condition sampled ISI from two distributions with mean equal to 100ms and 733ms. In each aperiodic condition, the average over both sampled distributions approximates the periodic condition (1.07 Hz and 2.2 Hz respectively). Due to the nested temporal structure of the stimuli it was not possible to create aperiodic sequences at average rhythms of 4 Hz and 8 Hz, as the interval between quintets was too short. The participants performed 80 trials of the target detection task on aperiodic stimuli. Stimuli were presented in blocks of 16 trials, in which the average rhythm was constant.

D’ scores of the pilot experiment approached ceiling for many of the participants. Therefore, in experiment 2 (n = 20; 13 females, 7 males) we adjusted the difficulty range of the task to capture a modulation of behavior by periodicity. In addition, the stimuli were shortened to 6s, the (average) rhythm was fixed at 2 Hz and the carrier frequency at 1100 Hz, to limit the number of conditions and experiment duration. Aperiodic stimuli in experiment 2 were created similar to aperiodic stimuli in the pilot. Rhythmic stimuli (periodic predictable, aperiodic unpredictable) were compared to a temporal cue condition to test whether the predictive benefits derived from periodicity are larger than those derived from a single temporally predictive cue. Trials were presented in blocks (grouped by condition periodic, aperiodic, cue); within each block, trials were randomized, and the block order was counterbalanced across participants. Each participant completed a total of 432 trials. Task difficulty was determined through a staircase procedure preceding every block of trials, in which TS were set to achieve a behavioral performance at 70% detection threshold, as determined by means of a 2 down 1 up procedure. The termination criterion was after 200 trials or 15 reversals. The TS varied on a fixed step-size of 10 logarithmically spaced steps between 7ms and 1.5 ms (see S3 Fig for average TS size per staircase preceding a block).

Statistical analysis

All analyses were conducted in MATLAB 2017a (The MATHWORKS Inc., Natick, MA, 234 USA). For each participant, the d’ sensitivity index of signal detection theory and mean log-reaction times (logRT) of correct trials were calculated. Reaction time was calculated relative to target onset.

Statistical analysis of reaction times and sensitivity were carried out using a Generalized Linear Mixed Model (GLMM) with Matlab’s fitGLME function. We assessed the model fits using likelihood ratio tests (using the function compare for GLMM). Contrasts were carried out performing an F-test on the specified fixed effects of the GLMM (using the function coefTest). Reaction times were fitted using the default identity link function, unlike the fitting of the sensitivity data for which a probit link function was used as described in more detail below.

Traditionally, d’ is estimated by counting the frequency of an observer reporting ‘yes’ conditional on the presence and absence of a signal (i.e. the hit and false alarm rates) and taking the difference of these values on a z-transformed scale [40]. In the present work, statistical group analyses on d’ are carried out using a GLMM [41] We estimate both model parameters and d’ simultaneously to determine the effect of carrier and target position, periodicity and temporal cueing on the population, rather than estimating d’ on each condition separately and feeding the estimated values to a second level analysis. This statistical framework extends multiple linear regression to non-normal data such as count data and binary outcomes and it is more suited to handle extreme cases (100% hits or 0% false alarms). Within this framework, d’ can be estimated by linearly modeling the behavioral outcomes (i.e. ‘yes’ or ‘no’) with a predictor X coding for the presence or absence of the target (see right side of Eq (1); and “Target” predictor in Table 2). To fit an equal-variance Gaussian signal detection model an inverse Gaussian (probit) link function is used, where g is the link function and X represents the presence or absence of the signal.

(1)
thumbnail
Table 2. Wilkinson notation of final model in each experiment.

https://doi.org/10.1371/journal.pone.0234251.t002

When the signal is absent (i.e. X = 0), β0 provides an estimate of the normal quantile of the false alarm rate. When the signal is present (i.e. X = 1), β1 reflects the difference between hit and false alarm rate on the probit scale (hence, the difference between z-scaled hit and false alarm rates), or d’. The different experimental conditions are then added as predictors, and the estimated d’ for each of these conditions (hence our effect of interest) is described by the interaction term between X (‘target’) and the respective condition predictor (see [41]chapter 3.3.5). Unstandardized effect sizes (betas) are reported in units of the dependent variable (d ‘ or logRT), allowing for a meaningful comparison, in line with general recommendations on how to report effect sizes in psychological research [42].

For our visualization, we estimated d’ as it is traditionally computed. Standard errors of d’ group effects displayed in Figs 2 and 4 were obtained by non-parametric bootstrap sampling of estimated d’ values, carried out at the subject level (N = 1000). The mean was used as a measure of central tendency around which 95% confidence intervals were created. All planned contrasts were corrected for multiple comparisons, using Bonferroni correction.

thumbnail
Fig 2. Both, the rhythm and carrier frequency parametrically affect sensitivity.

Average d’ of each participant (colored dots). Horizontal line indicates group mean, and errorbars depict bootstrapped 95% confidence intervals (at subject-level). A Experiment 1a; target occurred after a constant number of quintets per rhythm. B Experiment 1b; target occurred after constant time across rhythms.

https://doi.org/10.1371/journal.pone.0234251.g002

Results

Comparing periodic sequences at different carrier frequencies (experiment 1)

We investigated the benefits in perceptual sensitivity and reaction times associated with rhythmic sound presentation as a function of the rhythm and the carrier frequency in two experiment variants in which we varied the target position within a sequence (Experiment 1a and 1b). The data from both experiments were fitted using a single large GLMM (see Table 2 for final notation of model).

Slow entraining rhythms improve target detection.

The analysis showed a significant interaction of experiment and rhythm on d’ (F (3,864) = 4.528, p < 0.01), (see Table 3). Therefore, we analyzed the effect of rhythm separately for experiment 1a and 1b, revealing an effect of rhythm on sensitivity in both, experiment 1a (Fig 2A), F (3,432) = 9.3704.19, p < 0.001) as well as experiment 1b (Fig 2B), (F (3,432) = 26.083, p < 0.001). Follow-up tests showed, for both experiment variants, a parametric effect of rhythm on sensitivity (see Table 4). Counter to our hypothesis of an inverted U-shape where 4 Hz would perform best, we observed a parametric effect of rhythm. The slowest rhythms (1 Hz & 2 Hz) led to significantly higher sensitivity compared to the fastest (8 Hz) in both experiments. In addition, in experiment 1b the slowest rhythms (1 Hz & 2 Hz) led to significantly better sensitivity than 4 Hz as well. (see Table 4 for specific contrasts). We did not observe this parametric effect of rhythm on reaction times (Fig 3) (F (3,431) = 2.498 p >0.05).

thumbnail
Fig 3. Carrier frequency, but not rhythm, affects reaction times in both experiments.

Mean logRT of each participant. Horizontal line indicates group mean, and errorbars depict SEM. A Experiment 1a; target occurred after a constant number of quintets per rhythm. B Experiment 1b; target occurred after constant time across rhythms.

https://doi.org/10.1371/journal.pone.0234251.g003

Carrier frequency of the stimulus affects target detection and reaction time.

We observed a parametric main effect of carrier frequency on sensitivity F (2,864) = 12.2, p<0.001). Participants were more sensitive in detecting a target when listening to a sequence with a 200 Hz carrier compared to the high carrier frequency at 3100 Hz (beta = -0.9364; F (1, 864) = 19.42, p = <0.001), but also when comparing the 1100 Hz carrier against 3100 Hz carrier (beta = -0.8381; F (1, 864) = 14.92, p<0.001). The difference in sensitivity between the 200 Hz and 1100 Hz was not significant (beta = 0.0983; F (1, 864) = 0.18). In addition, we observe that the carrier frequency had a significant effect on logRT (F (2,431) = 7.539, p< 0.001). Comparisons showed that the lowest carrier frequency led to significantly faster responses compared to the middle carrier (beta = -0.1078; F (1, 431) = 5.87, p<0.05), and the highest carrier frequency (beta = -0.1709; F (1, 431) = 14.74, p<0.001) (Fig 3).

Carrier Frequency and rhythm do not interact in their effects on sensitivity and reaction time.

The interaction of rhythm by carrier was not significant in reaction times (F (6,431) = 1.125, p > 0.05) or sensitivity (F (6,864) = 2.0454, p > 0.05).

Effect of (a)periodic sequences and a temporal cue at perceptual threshold (experiment 2)

Experiment 2 compared the effect of periodic predictable and aperiodic unpredictable sequences of 6 second length to a temporal cueing condition with a cueing interval matching the ISI of the sequences (500ms). We fitted two GLMMs for the reaction time data and d’ data respectively. Each model consisted of the fixed-effect within-subject factor temporal structure (3 levels; periodic, aperiodic, cue).

Temporally predictable stimulation (through a periodic rhythm or a cue) improves sensitivity.

We compared target detection sensitivity in three temporal context conditions: predictable periodic, unpredictable aperiodic, (predictable) temporal cue. We found d’ to vary significantly as a function of temporal context (Fig 4A, F (2, 114) = 52.663, p< 0.001). Comparisons between the conditions revealed significant differences in d’ between the predictable periodic and unpredictable aperiodic sequences (beta = 0.5903; t (1, 114) = 7.95, p< 0.001), as well as between the aperiodic sequence and the temporal cue (beta = 0.7154; t (1, 114) = 9.3612, p<0.001). The difference in d' between predictable periodic sequences and the temporal cue was not significant (beta = 0.1251; F (1, 114) = 2.49, p>0.05).

thumbnail
Fig 4. Experiment 2. When controlling for task difficulty, periodicity and a temporal cue improve hit reaction time and d’ compared to the aperiodic condition.

TS size between 1.5-7ms (70% detection threshold). See S3 Fig for average TS size. A d’ per participant. Errorbars depict bootstrapped confidence intervals (at subject-level). B Mean logRT of each participant. Errorbars depict SEM.

https://doi.org/10.1371/journal.pone.0234251.g004

Predictable temporal cue and a predictable periodic sequences improve reaction time compared to an aperiodic unpredictable sequence.

The analysis of reaction times yielded a significant, yet different pattern between conditions (Fig 4B, F (2, 57) = 48.1, p< 0.01). Comparisons revealed faster correct responses for periodic predictable than aperiodic unpredictable sequences (beta = -0.3317; t (1, 57) = -4.894, p< 0.001). Moreover, participants responded faster to temporally cued targets than periodic rhythms, (beta = -0.3331; F (1, 57) = 24.15, p<0.001) as well as aperiodic (beta = -0.6647; t (1,57) = 9.808, p<0.001) rhythms. Thus, temporal predictability (whether through a cue or a periodic rhythm) led to an improvement of auditory sensitivity, while participants additionally benefit in their response times from the periodicity of stimulation compared to aperiodic stimulation, and a temporally predictable cue leading to the fastest response.

Discussion

Using a temporal shift detection task we asked how the rate and the carrier frequency of a predictable periodic rhythm influence both reaction time and sensitivity of perceptual decisions. Moreover, we asked whether the predictive advantage derived from periodic stimulation is larger than that afforded by aperiodic stimulation or a single temporal cue. The data show that (1) the largest sensitivity improvement is observed for the slowest rhythm (1 Hz); (2) sensitivity improvement is larger for low-frequency (200 Hz) carriers compared to 1100 Hz and 3100 Hz carriers; (3) periodic stimulation significantly reduces reaction times compared to aperiodic stimulation (speeded responses were observed both at perceptual threshold as well as above threshold during the pilot experiment); (4) a response to a temporal cue is faster than a response to periodic stimulation (5) periodic stimulation and cueing significantly increase sensitivity compared to aperiodic stimulation.

Experiment 1. Largest sensitivity improvements occur at slowest rhythm

Psychophysical findings show that sensitivity towards amplitude modulation detection of noise is highest for humans in the (speech) range of 2–4 Hz, while highest for macaques in the range of 30–60 Hz [43]. These and other findings have led to the notion that the human auditory cortex is considered to be speech-ready, therefore, we expected a peak in perceptual sensitivity with periodic sound presentation around 4 Hz. However, our results show that listeners’ performance was highest at a slow rhythm and decreased with increasing rate of rhythm. A similar pattern of preference for slow rhythms (e.g. 2 Hz) as opposed to faster rhythms (8 Hz) has also been shown for cortical synchronization (entrainment) to speech in noise. Phase-locking of neural activity to speech embedded in noise decreased from low (2 Hz) to high (8 Hz) frequencies, correlating with speech intelligibility [44]. This pattern resembles the linear decrease across rhythms observed here and supports the predominant role of delta band frequencies in auditory processing.

Moreover, we speculate that our findings may relate to the nature of the task and stimuli we used. The acoustic features of the isochronous stimuli may be closer to music and its temporal modulations than to the modulations inherent in speech. Temporal modulations in western music peak between 0.5 and 3 Hz, depending on the instrument and may contribute to a preference for slower modulations. As to why the peak of the modulation spectrum in music may be lower than that of spoken speech, it has been suggested that music like language is limited by the dynamic rate of movement of the effector (i.e. the frequency range where movement is most efficient; usually hands and arms in the case of music and articulators in the case of language) [21]. Slow rhythms (1 Hz and 2 Hz) approximate the rate of spontaneous, hence most efficient, motor tempo (around 1.5 Hz for adults) as measured by spontaneous tapping-tasks [45]. A recent review substantiates the link between auditory processing and the motor system, suggesting a downward propagation of temporal predictions from the motor system involving delta-oscillations that shape auditory perception by imposing temporal constraints [46].

Lastly, we show that reaction times were not modulated by different rhythms. Preparatory response processes are typically studied in foreperiod (FP)—reaction time experiments, in which it is a classical finding that both the duration of the FP (usually in the range of seconds) as well as the variability of FP across trials within a block have a considerable effect on reaction times [33]. In the present study, ISI within a block were constant (i.e. low variability across trials), allowing the participant to prepare a motor response equally probable across conditions. Moreover, the absence of a difference in reaction times suggests that the time-range tested here allowed for non-specific (motor) preparations across all rates. Despite no difference in reaction times across rates, a difference in sensitivity across rates was observed, highlighting the perceptual benefit of rhythmic sound presentation, especially for slower rhythms.

Experiment 1. Carrier-dependent improvement of sensitivity and reaction times

Surprisingly, we found that the sensitivity decreased with increasing the carrier frequency. Based on the literature on temporal modulation processing in humans, we would expect higher sensitivity for higher carrier frequencies, as sounds are encoded by auditory spectral filters (tonotopic mechanism). These spectral filters are narrower at lower frequencies and wider at higher frequencies and limit the temporal resolution of the auditory system. Therefore detection performance of a temporal shift should decrease at lower carrier frequencies where the bandwidth is narrower [47]. Indeed, increasing modulation detection thresholds for decreasing center frequencies have been observed [48,49]. We controlled for this effect, by adjusting the stimuli to have equivalent rectangular bandwidth (ERB) and equivalent perceived intensity. Despite this equalization we observe an effect of carrier frequency. We suggest that the perceptual benefit at the low carrier frequency may be a product of temporal coding mechanisms. Phase locking up to 250 Hz has been observed in human intracortical recordings using click trains [50]. This temporal encoding seems to provide a perceptual benefit when making judgments about the presence of a temporally shifted target thereby improving sensitivity. Additionally, this benefit in sensitivity for the lowest carrier frequency was accompanied by an increase in response speed. Simpson, Reiss and McAlpine [25] estimated sensitivity to a range of amplitude modulation frequencies (0.5 Hz to 33 Hz) across a large number of frequency carriers including and beyond the range of carriers tested here. Their results suggest that at low carrier frequencies cortical modulation filters are most sensitive to slow modulation rates, similar to the rates used here (1–8 Hz). They speculate that such a frequency-dependent modulation tuning is related to the neural processing of acoustic properties of speech [51]. Lastly, these findings suggest that behavioral effects of entrainment, may depend on the type of spectral stimulation used for entraining and probing, which may be especially relevant in the context of spectral tasks [26].

Experiment 1. Effect of rhythm not confounded by target position

The number of preceding quintets was varied between experiment 1a and 1b. In experiment 1a we kept the number of preceding quintets until the TS constant across rhythms. This systematically reduced the time until a target appeared with increasing rhythm. In experiment 1b, we kept the time at which targets appeared constant across rhythms, thereby presenting an increasing number of preceding quintets with increasing rhythms prior to presenting a target quintet. Our rationale to assess and compare these two experiment versions being, that the strength of entrainment might increase with additional repetitions. The effect of rhythm would then be confounded by the systematic effect of target position in a sequence. Indeed, we show in experiment 1a as well as 1b an effect of rhythm. By having controlled the position of the target across the different presentation rates we therefore conclude that there is a difference between rhythms and said effect was not confounded by the systematic effect of target position in a sequence. In experiment 2 targets embedded in aperiodic and periodic sequences occurred late within a sequence (similar to Experiment 1b). It would be interesting to see what the effect of aperiodic and periodic sequences is when targets are presented earlier. We may speculate that this would further enhance the detection difficulty of the task enhancing the benefit of periodicity.

Experiment 2. Effects of periodicity and cueing diverge in logRT and sensitivity

The results of experiment 2 show that both perceptual sensitivity and reaction times are improved when stimuli are presented in periodic rhythms compared to aperiodic rhythms, when using a temporal detection task at perceptual threshold. Note that these effects are only apparent when controlling for task difficulty. In a pilot study, where this was not done, we did not observe an effect of periodicity on sensitivity (see S1 File). The results of experiment 2 support earlier findings reporting a benefit of predictability in periodic over aperiodic stimulation [47,5255]. This is in line with the idea of oscillatory entrainment and dynamic attending theory [56] and highlights the relevance of using a task with sufficient difficulty, in contrast to the pilot. However, this finding points to a more general question of the benefit of entrainment in everyday life as most stimuli we encounter are seldom at perceptual threshold.

In addition, we were interested in contrasting the benefit in reaction times and sensitivity of a predictable periodic rhythm to a predictable (but not periodic) temporal cue. We expected a benefit in reaction times similar to [6] (albeit different) showing a benefit of a periodic predictable sequence over an aperiodic, predictable condition of increasing tempo. Moreover, we expected a benefit in sensitivity for the periodic rhythm compared to the temporal cue. We show in experiment 2, that a temporal cue enables a participant to respond significantly faster than a periodic rhythm, while there is no difference in perceptual sensitivity between these two forms of temporal structure. The cue predicts target occurrence with a 75% validity (25% of trials were catch-trials without a target). Similarly, the periodic rhythm allowed participants to predict when a subsequent target may occur with a similar validity for the trial. Yet participants were faced with the additional uncertainty as to which quintet within a sequence may contain the target. This uncertainty may be the reason for the observed slowing of reaction times in the periodic condition compared to the cue. The greater effectiveness of the temporal cue compared to the rhythm in terms of reaction time suggests that the temporal cue induced a more confident temporal expectation, in line with the finding that target-occurrence uncertainty impairs reorienting, thereby lengthening of reaction times [57]. It would be of interest to compare these two conditions under similar uncertainty of target occurrence.

Interestingly though, despite said larger uncertainty the periodic condition is not significantly worse than the temporal cue in terms of sensitivity, therefore we suggest that a benefit of a periodic rhythm to some extent countered the increased uncertainty. Ten Oever et al. have shown that entrainment of low-frequency oscillations in the delta—range serves a mechanistic role in enhancing perceptual sensitivity of subthreshold periodic, predictable sound sequences compared to aperiodic sequences [18]. Under the hypothesis that oscillations align more efficiently to a rhythmic structure as compared to a single interval, it is surprising that the periodic sequences here did not result in a sensitivity benefit over the temporal cue. See for instance [58,59]showing an accuracy benefit of periodic sequence over cue in duration estimation. We speculate that the additional uncertainty of when within a sequence a target may occur, may have countered a benefit of the entraining rhythmicity of the sequence. Again, it would be of interest to compare instances of a predictable periodic rhythm and a predictable temporal cue conveying the same uncertainty. We would then predict sensitivity of the periodic predictable condition to be higher than the temporal cue condition.

At the neurophysiological level, we speculate that such a mechanism might be implemented by a more flexible phase reset model of neuronal oscillations [15,60,61], (see [62], for a recent review), in which the motor system tracks temporal regularities [46]. Further research will be necessary to elucidate the mechanism and nature of top-down predictions and how these affect auditory perception.

Conclusion

We show that overall temporal modulations in the range of 1–8 Hz are better processed at lower carrier frequencies, as measured by reaction times and sensitivity (experiments 1a and 1b). Additionally, the same results point to the perceptual benefit of slow rhythms (1 and 2 Hz) over faster ones (4 and 8 Hz). The regularity of rhythms enables the use of prediction to make more precise inferences about when we should expect to find a target embedded within the stream and, as a result, improve detection performance. Indeed, we show in experiment 2 a perceptual benefit of periodic predictable sequences over aperiodic unpredictable sequences in terms of reaction times and sensitivity (the latter only present when using a sufficiently difficult task). Crucially, in experiment 2 we show that the predictive value of a cue and that of a temporal rhythm do not differ in terms of the sensitivity in detecting a target, albeit it has to be noted that the periodic condition contained a larger uncertainty where the target would appear. These findings encourage us to reflect on what the perceptual benefits of periodicity and predictability respectively are, as these effects may diverge when teased apart using different tasks thereby allowing to make assumptions about the underlying mechanisms involved. Here we showed that both the cue and the rhythm induce confident temporal expectancies about the future occurrence of targets to effectively prepare and allocate attentional resources. Taken together, we may speculate that multiple processes may co-occur that facilitate the processing of rhythmic and predictable stimuli, in which oscillations form an intrinsic temporal constraint, controlled by temporal predictions. Potentially, cueing effects occur due to a single phase-reset of ongoing oscillations and similarly a rhythmic benefit occurs due to either a stimulus driven entrainment of oscillations or repeated top-down phase resets.

Acknowledgments

The authors thank Jacek Chudy and Louisa Zielke for help with data collection. We thank Sanne ten Oever for feedback on the manuscript and the reviewers for their valuable comments that helped improve this work. This work was funded by The Netherlands Organization for Scientific Research (NWO) Research Talent Grant 406.17.200 awarded to MH, EF and PdW. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  1. 1. Nobre AC, Van Ede F. Anticipated moments: Temporal structure in attention. Nat Rev Neurosci [Internet]. 2018;19(1):34–48.http://dx.doi.org/10.1038/nrn.2017.141 pmid:29213134
  2. 2. Andreou LV, Kashino M, Chait M. The role of temporal regularity in auditory segregation. Hear Res [Internet]. 2011;280(1–2):228–35.http://dx.doi.org/10.1016/j.heares.2011.06.001 pmid:21683778
  3. 3. Ellis RJ, Jones MR. Rhythmic context modulates foreperiod effects. Atten Percept Psychophys [Internet]. 2010 Nov [cited 2016 Nov 18];72(8):2274–88. http://www.springerlink.com/index/10.3758/BF03196701 pmid:21097869
  4. 4. Jones MR, Moynihan H, Mackenzie N, Puente J. Temporal Aspects of Stimulus-Driven Attending in Dynamic Arrays. Am Psychol Soc [Internet]. 2002 Jul [cited 2016 Dec 7];13(4):313–9. http://www.ncbi.nlm.nih.gov/pubmed/12137133
  5. 5. Lawrance ELA, Harper NS, Cooke JE, Schnupp JWH. Temporal predictability enhances auditory detection. 2015;135(6):1–10.
  6. 6. Morillon B, Schroeder CE, Wyart V, Arnal LH. Temporal Prediction in lieu of Periodic Stimulation. J Neurosci [Internet]. 2016;36(8):2342–7. http://www.jneurosci.org/content/36/8/2342.short pmid:26911682
  7. 7. ten Oever S, Schroeder CE, Poeppel D, van Atteveldt N, Zion-Golumbic E. Rhythmicity and cross-modal temporal cues facilitate detection. Neuropsychologia [Internet]. 2014;63(1):43–50. http://dx.doi.org/10.1016/j.neuropsychologia.2014.08.008
  8. 8. Lakatos P, Karmos G, Mehta AD, Ulbert I, Schroeder CE. Entrainment of neuronal oscillations as a mechanism of attentional selection. Science [Internet]. 2008 [cited 2016 Nov 18];320(5872):110–3. http://www.ncbi.nlm.nih.gov/pubmed/18388295
  9. 9. Lange K. Brain correlates of early auditory processing are attenuated by expectations for time and pitch. Brain Cogn [Internet]. 2009;69(1):127–37. http://dx.doi.org/10.1016/j.bandc.2008.06.004 pmid:18644669
  10. 10. Breska A, Deouell LY. Automatic Bias of Temporal Expectations following Temporally Regular Input Independently of High-level Temporal Expectation. J Cogn Neurosci. 2014 Jul;26(7):1555–71. pmid:24392898
  11. 11. Sanabria D, Capizzi M, Correa Á. Rhythms That Speed You Up. J Exp Psychol Hum Percept Perform. 2011;37(1):236–44. pmid:20718571
  12. 12. Chang A, Bosnyak DJ, Trainor LJ. Rhythmicity facilitates pitch discrimination: Differential roles of low and high frequency neural oscillations. Neuroimage [Internet]. 2019;198(December 2018):31–43. https://doi.org/10.1016/j.neuroimage.2019.05.007
  13. 13. Henry MJ, Herrmann B. Low-Frequency Neural Oscillations Support Dynamic Attending in Temporal Context. Timing Time Percept. 2014;2(1):62–86.
  14. 14. Herbst SK, Obleser J. Implicit temporal predictability enhances pitch discrimination sensitivity and biases the phase of delta oscillations in auditory cortex. Neuroimage [Internet]. 2019;203(August):116198. https://doi.org/10.1016/j.neuroimage.2019.116198
  15. 15. Breska A, Deouell LY. Neural mechanisms of rhythm-based temporal prediction: Delta phase-locking reflects temporal predictability but not rhythmic entrainment. PLoS Biol. 2017;15(2):1–30.
  16. 16. Lakatos P, Musacchia G, O’Connel MN, Falchier AY, Javitt DC, Schroeder CE. The Spectrotemporal Filter Mechanism of Auditory Selective Attention. Neuron [Internet]. 2013;77(4):750–61. http://dx.doi.org/10.1016/j.neuron.2012.11.034 pmid:23439126
  17. 17. Morillon B, Schroeder CE, Wyart V. Motor contributions to the temporal precision of auditory attention. Nat Commun [Internet]. 2014;5:1–9. http://dx.doi.org/10.1038/ncomms6255%5Cnhttp://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=4199392&tool=pmcentrez&rendertype=abstract
  18. 18. ten Oever S, Schroeder CE, Poeppel D, van Atteveldt N, Mehta AD, Mégevand P, et al. Low-Frequency Cortical Oscillations Entrain to Subthreshold Rhythmic Auditory Stimuli. J Neurosci [Internet]. 2017;37(19):4903–12. http://www.jneurosci.org/lookup/doi/10.1523/JNEUROSCI.3658-16.2017 pmid:28411273
  19. 19. Hickok G, Farahbod H, Saberi K. The Rhythm of Perception: Entrainment to Acoustic Rhythms Induces Subsequent Perceptual Oscillation. Psychol Sci. 2015;26(7):1006–13. pmid:25968248
  20. 20. Teng X, Poeppel D. Theta and Gamma Bands Encode Acoustic Dynamics over Wide-ranging Timescales. bioRxiv [Internet]. 2019;(February):547125. https://www.biorxiv.org/content/10.1101/547125v1
  21. 21. Ding N, Patel AD, Chen L, Butler H, Luo C, Poeppel D. Temporal modulations in speech and music. Neurosci Biobehav Rev [Internet]. 2017;81:181–7. https://doi.org/10.1016/j.neubiorev.2017.02.011 pmid:28212857
  22. 22. Poeppel D, Assaneo MF. Speech rhythms and their neural foundations. Nat Rev Neurosci [Internet]. 2020 May 6; http://dx.doi.org/10.1038/s41583-020-0304-4
  23. 23. Moore BC, Glasberg BR. Temporal modulation transfer functions obtained using sinusoidal carriers with normally hearing and hearing-impaired listeners. J Acoust Soc Am. 2001;110(2):1067–73. pmid:11519575
  24. 24. Dau T, Kollmeier B, Kohlrausch A. Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. J Acoust Soc Am [Internet]. 1997;102(5):2892–905. http://asa.scitation.org/doi/10.1121/1.420344
  25. 25. Simpson AJR, Reiss JD, McAlpine D. Tuning of Human Modulation Filters Is Carrier-Frequency Dependent. PLoS One. 2013;8(8):1–9.
  26. 26. O’Connell MN, Barczak A, Ross D, McGinnis T, Schroeder CE, Lakatos P. Multi-Scale Entrainment of Coupled Neuronal Oscillations in Primary Auditory Cortex. Front Hum Neurosci [Internet]. 2015 [cited 2016 Dec 1];9:655.http://www.ncbi.nlm.nih.gov/pubmed/26696866
  27. 27. O’Connell MN, Barczak A, Schroeder CE, Lakatos P. Layer specific sharpening of frequency tuning by selective attention in primary auditory cortex. J Neurosci [Internet]. 2014;34(49):16496–508. http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=4252556&tool=pmcentrez&rendertype=abstract pmid:25471586
  28. 28. Henry MJ, Obleser J. Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proc Natl Acad Sci [Internet]. 2012;109(49):20095–100. http://www.pnas.org/cgi/doi/10.1073/pnas.1213390109 pmid:23151506
  29. 29. Schroeder CE, Lakatos P. Low-frequency neuronal oscillations as instruments of sensory selection. Trends Neurosci. 2009;32(1):9–18. pmid:19012975
  30. 30. Ng BSW, Schroeder T, Kayser C. A Precluding But Not Ensuring Role of Entrained Low-Frequency Oscillations for Auditory Perception. J Neurosci [Internet]. 2012;32(35):12268–76. http://www.jneurosci.org/content/32/35/12268%5Cnhttp://www.jneurosci.org/content/32/35/12268.full.pdf%5Cnhttp://www.jneurosci.org/content/32/35/12268.long%5Cnhttp://www.ncbi.nlm.nih.gov/pubmed/22933808
  31. 31. Arnal LH, Giraud AL. Cortical oscillations and sensory predictions. Trends Cogn Sci [Internet]. 2012 [cited 2016 Nov 18];16(7):390–8. http://dx.doi.org/10.1016/j.tics.2012.05.003 pmid:22682813
  32. 32. Los SA, Knol DL, Boers RM. The foreperiod effect revisited: Conditioning as a basis for nonspecific preparation. Acta Psychol (Amst). 2001;106(1–2):121–45.
  33. 33. Naatanen R, Niemi P. Foreperiod and simple reaction time. Psychol Bull. 1981;89(1):133–62.
  34. 34. Ren Y, Xu Z, Wu F, Ejima Y, Yang J, Takahashi S, et al. Does Temporal Expectation Driven by Rhythmic Cues Differ From That Driven by Symbolic Cues Across the Millisecond and Second Range? Perception. 2019;
  35. 35. Breska A, Ivry RB. Double dissociation of single-interval and rhythmic temporal prediction in cerebellar degeneration and Parkinson’s disease. Proc Natl Acad Sci U S A. 2018;115(48):12283–8. pmid:30425170
  36. 36. Herbst SK, Obleser J. Implicit variations of temporal predictability: Shaping the neural oscillatory and behavioural response. Neuropsychologia [Internet]. 2017;101(January):141–52. http://dx.doi.org/10.1016/j.neuropsychologia.2017.05.019
  37. 37. Jones MR, Moynihan H, Mackenzie N, Puente J. Temporal aspects of stimulus-driven attending in dynamoc arrays. Psychol Sci. 2002;13(4):313–9. pmid:12137133
  38. 38. Brainard DH. The Psychophysics Toolbox. Spat Vis. 1997;10(4):433–6. pmid:9176952
  39. 39. Moore BCJ. An Introduction to the Psychology of Hearing. Bost Acad Press. 2003;3:413.
  40. 40. LUCE RD, GREEN DM. Detection, Discrimination, and Recognition. Psychophys Judgm Meas. 1974;I:299–342.
  41. 41. Knoblauch K, Maloney LT. Modeling Psychophysical Data in R [Internet]. New York, NY: Springer New York; 2012. 21–54 p. http://link.springer.com/10.1007/978-0-387-78171-6
  42. 42. Pek J, Flora DB. Reporting effect sizes in original psychological research: A discussion and tutorial. Psychol Methods. 2018;23(2):208–25. pmid:28277690
  43. 43. O’Connor KN, Johnson JS, Niwa M, Noriega NC, Marshall EA, Sutter ML. Amplitude modulation detection as a function of modulation frequency and stimulus duration: Comparisons between macaques and humans. Hear Res [Internet]. 2011;277(1–2):37–43. http://dx.doi.org/10.1016/j.heares.2011.03.014 pmid:21457768
  44. 44. Ding N, Simon JZ. Adaptive temporal encoding leads to a background-insensitive cortical representation of speech. J Neurosci. 2013;33(13):5728–35. pmid:23536086
  45. 45. McAuley JD, Jones MR, Holub S, Johnston HM, Miller NS. The time of our lives: Life span development of timing and event tracking. J Exp Psychol Gen [Internet]. 2006;135(3):348–67. http://doi.apa.org/getdoi.cfm?doi=10.1037/0096-3445.135.3.348 pmid:16846269
  46. 46. Morillon B, Arnal LH, Schroeder CE, Keitel A. Prominence of delta oscillatory rhythms in the motor cortex and their relevance for auditory and speech perception. Neurosci Biobehav Rev [Internet]. 2019;107:136–42.https://doi.org/10.1016/j.neubiorev.2019.09.012 pmid:31518638
  47. 47. Moore BCJ, Peters RW, Glasberg BR. Auditory filter shapes at low center frequencies. J Acoust Soc Am. 2005;88(1):132–40.
  48. 48. Moore BCJ, Peters RW, Glasberg BR. Detection of temporal gaps in sinusoids: Effects of frequency and level. J Acoust Soc Am. 1993;93(3):1563–70. pmid:8473610
  49. 49. Viemeister NF. Temporal modulation transfer functions based upon modulation thresholds. J Acoust Soc Am. 1979;66(5):1364–80. pmid:500975
  50. 50. Nourski KV., Brugge JF, Reale RA, Kovach CK, Oya H, Kawasaki H, et al. Coding of repetitive transients by auditory cortex on posterolateral superior temporal gyrus in humans: an intracranial electrophysiology study. J Neurophysiol. 2013;109(5):1283–95. pmid:23236002
  51. 51. Giraud A-L, Poeppel D. Cortical oscillations and speech processing: emerging computational principles and operations. Nat Neurosci [Internet]. 2012;15(4):511–7. http://www.scopus.com/inward/record.url?eid=2-s2.0-84859217287&partnerID=tZOtx3y1 pmid:22426255
  52. 52. Rimmele J, Jolsvai H, Sussman E. Auditory target detection is affected by implicit temporal and spatial expectations. J Cogn Neurosci. 2011;23(5):1136–47. pmid:20146603
  53. 53. Rohenkohl G, Cravo A. M, Wyart V, Nobre A. C. Temporal Expectation Improves the Quality of Sensory Information. J Neurosci. 2012;32(24):8424–8. pmid:22699922
  54. 54. Schmidt-Kassow M, Schubotz RI, Kotz SA. Attention and entrainment: P3b varies as a function of temporal predictability. Neuroreport. 2009;20(1):31–6. pmid:18987559
  55. 55. Wollman I, Morillon B. Organizational principles of multidimensional predictions in human auditory attention. Sci Rep [Internet]. 2018;8(1):13466.http://www.nature.com/articles/s41598-018-31878-5 pmid:30194376
  56. 56. Schroeder CE, Wilson DA, Radman T, Scharfman H, Lakatos P. Dynamics of Active Sensing and perceptual selection. Curr Opin Neurobiol [Internet]. 2010 Apr [cited 2016 Dec 4];20(2):172–6. http://www.ncbi.nlm.nih.gov/pubmed/20307966
  57. 57. Correa Á, Lupiáñez J, Tudela P. The attentional mechanism of temporal orienting: Determinants and attributes. Exp Brain Res. 2006;169(1):58–68. pmid:16273403
  58. 58. Barnes R, Jones MR. Expectancy, Attention, and Time. Cogn Psychol. 2000;41(3):254–311. pmid:11032658
  59. 59. Drake C, Botte MC. Tempo sensitivity in auditory sequences: Evidence for a multiple-look model. Percept Psychophys. 1993;54(3):277–86. pmid:8414886
  60. 60. ten Oever S, van Atteveldt NM, Sack AT. Increased Stimulus Expectancy Triggers Low-Frequency Phase reset during restricted vigilance. J Neurosci. 2015;
  61. 61. Wilsch A, Henry MJ, Herrmann B, Maess B, Obleser J. Slow-delta phase concentration marks improved temporal expectations based on the passage of time. Psychophysiology. 2015;52(7):910–8. pmid:25684032
  62. 62. Rimmele JM, Morillon B, Poeppel D, Arnal LH. Proactive Sensing of Periodic and Aperiodic Auditory Patterns. Trends Cogn Sci [Internet]. 2018;22(10):870–82. https://doi.org/10.1016/j.tics.2018.08.003 pmid:30266147