Introduction

Attention-deficit/hyperactivity disorder (ADHD) is a neurodevelopmental disorder accompanied with varying levels of inattention, hyperactivity and/or impulsivity [1]. About 40% to 50% of people show symptoms beyond the childhood, raising the scientific focus to adult ADHD in the past recent years [2]. Besides the core symptoms that are highly linked to executive functioning [3], ADHD patients also show anomalies in basal sensory processing, although this topic is rather under researched. Sensory processing in ADHD is mainly marked by deficits in early stimulus-modulatory processes (e.g., inhibition of irrelevant stimulus characteristics or distractors) especially in the auditory domain, but also reported for the visual sense, touch, taste, smell [4]. Overall, patients with ADHD can be described as being sensory hypersensitive [5, 6].

Despite the interplay of the core symptoms of ADHD and sensory processing are not fully understood yet, the difficulties in sensory processing seem to extend in adulthood [5]. Former studies most frequently investigated single sensory domains. However, the complexity of the environment and our daily-life interaction with it demands to process multiple sensory streams in parallel. By combining different stimulus streams to reach a unified percept, a process that is called multisensory integration (MI), our brain is optimally adapted to detect, locate and differentiate external objects, compared when confronted with only one unimodal stream [7]. To integrate successfully, attention seems to play a major role in MI, whereby the exact role of attention as a condition for MI is subject of an ongoing debate [8]. Depending on stimulus complexity, MI can occur at early, pre-attentive stages (bottom–up triggered MI) for simple stimuli or at later stages, i.e., top–down attentional effects for more complex stimuli (e.g., semantic stimuli) [8, 9]. With its attentional difficulties and abnormalities in unimodal sensory processing, ADHD is a valuable clinical condition to study MI, raising the intriguing question whether ADHD patients show deficits in MI, in dependence of varying attentional demands. Only a few studies investigated MI in adult ADHD, yielding mixed results. In one study, ADHD patients did not improve speech-in-noise understanding to a similar degree like healthy controls, when the visual source, namely the lip movements of the speakers were added. In other words, there were no beneficial effects from a multimodal source in ADHD patients [10]. Such a multimodal benefit, measured in an acceleration in the detection of multimodal stimuli compared to unimodal stimuli, was evident in McCracken et al. [11]. They mixed simple stimuli (an auditory beep) with complex stimuli (the spoken word ‘red’), instead of using only complex stimuli like speech. A recent study investigated MI using simple stimuli (namely the sound-induced flash illusion, where simple visual flashes and auditory beeps were used) and complex stimuli (namely the McGurk illusion where speech-like phonemes are used) in an adult ADHD sample compared to healthy controls [12]. Here, ADHD patients showed intact MI abilities for simple stimuli but failed to integrate the complex stimuli. While the bottom–up MI seems to be intact, the top–down MI in ADHD patients seems to be deficient.

In everyday life, not only the attentional demands, but also in addition the perceptual demands can be very different. The model of perceptual capacity was suggested in Lavie’s perceptual load theory to be limited but this capacity is always involuntarily filled. Selective attention, i.e., the ability to attend to one source of information while ignoring others [13, 14] is therefore dependent on the available perceptual capacity.

When confronted with a task composed of a high load, the perceptual capacities are reached resulting in an early selection, hence no distractors or task-irrelevant stimuli can be processed, contrary, in a low-load task where free perceptual resources allow for additional processing of distractors due to a late perceptual selection [15]. Studies combining perceptual load and MI in healthy subjects yield mixed effects but there seem to be an ‘immunity-effect’ of MI for perceptual load [16,17,18]. In other words, reaction time benefits for the detection of multimodal stimuli, as compared to unimodal stimuli presentation, seem to be robust irrespective of the perceptual load. In ADHD, studies investigating MI in dependence of the given perceptual load are missing. In paradigms, where the perceptual load was manipulated (without measuring sensory processing or MI), ADHD patients showed less distraction for the higher load condition [19, 20]. The current study challenges the robustness of bottom–up MI in ADHD by manipulating simple stimuli MI through additional task demands. This manipulation is realized by varying the attentional capture through perceptual loads.

By combining bottom–up stimuli with varying amounts of perceptual capacities to process those stimuli, we assume, that ADHD-patients’ limits in MI are reached for the high load, since they are more focused and less distracted by the sensory stimuli. In other words, the immunity effect for load-dependent MI should not hold for ADHD patients since once their perceptual capacities are reached in the high-load conditions, the reaction time benefit for multimodal stimuli should be diminished.

Materials and methods

Participants

Eighteen adult ADHD patients [7 females, age: 34.7 (SD: 9.3) years] were recruited at our psychiatric outpatient department and compared to 18 healthy controls [7 females; age: 35.6 (SD: 11.8) years], recruited through bulletin boards.

According to international and German guidelines [21, 22], patients were diagnosed by a specialist psychiatrist following a detailed clinical and psychosocial interview that integrates somatic differential diagnosis, the patients’ psychiatric and developmental history, and observer reports.

ADHD symptoms were further assessed with the Conners Adult ADHD Rating Scale (CAARS-long version self-rated) [23] and the Wender Utah Rating Scale (WURS-k) for ADHD childhood symptoms [24]. Borderline personality disorder, as a frequently associated comorbidity was screened with the borderline symptom list (BSL-23) [25].

In addition, depressive symptoms were self-rated with the Becks Depression inventory [26]. In case of medication intake (e.g., methylphenidate), ADHD patients were asked to discontinue at least 24 h prior to the experiment. Adult sensory profile (ASP) was administered to measure sensory processing according to Dunns’ model, i.e. person being sensory seeking, sensory avoidant, sensory sensitive or low registering to sensory stimuli [27].

All participants gave written consent to take part, and the study was approved by the ethics committee of the medical faculty of the University of Bonn (408/20).

Behavioral experiment

The experiment took place in a light-dimmed room and participants were seated at a distance of approximately 60 cm in front of a screen. Auditory stimuli were presented via headphones (Sennheiser HD300 pro) with adjusted sound pressure level for each participant.

Paradigm

Paradigm was adapted by Lunn et al. [16] and programmed with Presentation software (Version 22.1, Neurobehavioural Systems, http://www.neurobs.com). Each trial began with a central fixation cross (presented for 500 ms) followed by the load condition, the sensory condition or the control condition. In the load condition, the participants’ task was to press a button as fast as possible when they perceived an X on the screen. The target letter was, in case of the high-load condition, embedded among six circularly arranged (2.0o radius) distractor letters (H, K, M, V, W, Z) and in the low-load condition embedded among placeholders (O’s). The position of the target letter was randomized across trials and total presentation time was 200 ms, with 4 load blocks (each comprised of 40 trails) for low- and high-load respectively. The sensory conditions comprised peripheral targets presented either left or right from the circular array of letters and the participants should indicate by button press, whether the event occurred left or right. Of note, in those trials, no target letter was present, but the distractors were visible according to the low/high-load condition. The sensory stimuli comprised a short auditory beep tone (1KhZ; 100 ms) and a short visual flash presented unimodal (auditory or visual alone) or multimodal (parallel presentation of auditory and visual stimuli). In total, 18 sensory events were presented interleaved for each load (with target letter) condition. As a control condition, only distractors were presented hence neither an ‘X’ nor an event of the sensory trial was presented (22 trials in within a block). Here, participants should not react at all. The participants completed four blocks per load with an interleaved design counter-balanced across subjects (either ABBAABBA or BAABBAAB) resulting in total of 640 trials. Prior to the main experiment, a training took place entailing all conditions (~ 5 min) to make the participants familiar with the tasks.

Statistical analysis

All demographical variables were group-wise compared with independent samples t tests.

For all conditions, reaction times were outlier corrected with the MATLAB build in function ‘rmoutliers’ i.e., defining an outlier with a value above 3 median absolute deviation (MAD). Reaction time data for the unimodal and multimodal conditions were then fitted to one linear mixed model (LMM) using the package lme4 implemented in R [28]. LMMs are advantageous over classical variance analysis approaches since they also account for single observations for each subject in a given condition, rather than averaging the condition/subject [29]. With reaction time as a dependent variable, model comparisons were realized with Akaike information criterion (AIC). Here, the model with fixed effects of group, loading and modality (uni-, multimodal) and random effects of subject and trial order lead to the lowest AIC. Variance components were estimated with restricted-maximum likelihood (REML). Type II Wald Chi-square tests were applied to derive p values from interactions. To further inspect the influence of group, loading and conditions on RT, least-squared means were extracted and compared in a contrast analysis with Bonferroni p value adjustments.

Percentage of right answers, as well as commission and omission errors of the sensory conditions were averaged per condition and compared by 2 × 2 × 2 two-way repeated measurements analysis of variance (ANOVA) with group, modality, and loading as factors. Similarly, comparisons of the reaction times and errors of the control condition were entered in separate ANOVA’s as described above.

Modeling multisensory reaction time acceleration

Presenting multimodal cues can lead to a faster detection in terms of reaction time compared to uni-sensory presentation only. The, so-called redundant signal effect leads to perceptual decision-making by statistical facilitation where the faster processed uni-sensory cue triggers a motor response [30, 31]. Of note, this is opposed to ‘pooled’ sensory processing models where an interaction between the uni-sensory processing modalities reaches a threshold enabling a motor response. To address MI processing as a function of statistical facilitation, the race model inequality was calculated [32]:

$${P}_{AV}(t)\le {P}_{A}\left(t\right)+{P}_{V}(t)$$

where P representing the cumulative probability-motor response at a time t for the respective conditions denoted with A auditory only, V visual only, AV audiovisual. The race model assumes that no interactions between uni-sensory processing units occur, hence the sum of the cumulative distribution function of the reaction time in AV condition may not exceed the sum of the cumulative distribution for the two uni-sensory processing units. The race model was implemented in MATLAB using the RSE toolbox [33]. As a first step, individual cumulative distribution functions were calculated for each condition per load. Next, the Miller’s bound was calculated i.e., the upper bound for assuming statistical independence of the two uni-sensory processing units. Violation of the race model was calculated by examining the size of the area between Miller’s bound and the distribution of the MI distribution function [34] and statistical assessed by one-sample t tests for each load within the groups. The upper (race model violation) and lower (multisensory benefit) bounds were group-wise compared with a two-sample t test.

Results

Demographics

For an overview of demographical variables, see Table 1. There was no significant group difference in terms of age, and gender. ASP were significantly different between groups for low registration (t(35) = − 5.07, p < 0.001), sensory avoidance (t(35) = − 2.48, p = 0.002), sensory sensitivity (t(35) = − 3.59, p = 0.001) but not for sensation seeking (t(35) = − 0.287, p = 0.77).

Table 1 Demographical description

Perceptual load

Sensory conditions

LMM with fixed factors of group, loading and modality showed no group differences of reaction time (χ2 = 0.068, p = 0.79). However, a significant group*loading*modality interaction was found (χ2 = 13.62, p < 0.001). This effect is explained by faster reaction time estimated least square means for multimodal stimuli, compared to unimodal stimuli (see Fig. 1) suggesting successful MI for both groups. MI within-group comparisons for load revealed no significant differences (pADHD = 1.0; pcontrols = 0.3). In terms of errors, ANOVA revealed no group differences for commission errors (F(1, 35) = 2.251, p = 0.13) and commission errors (F(1,35) = 0.38, p = 0.68).

Fig. 1
figure 1

A Predicted reaction times for sensory conditions; A auditory, V visual, AV audiovisual; B reaction times of the control-condition

Control condition

ANOVA analysis revealed a significant effect for reaction times for load (F(1,35) = 154.4, p < 0.001) but not for group (F(1,35) = 0.26, p = 0.61). Similarly, commission errors were significant for load (F(1,35) = 8.93, p = 0.005) but not for groups (F(1,35) = 0.76, p = 0.38). Load led also to more omission errors (F(1,35) = 29,86, p < 0.001) but not different across groups (F(1,35) = 0.17, p = 0.6).

Race model

One sample t tests between empirical and predicted CDF revealed significantly race model violations for low load for ADHD patients (t(17) = 0.08, p < 0.001) and controls (t(17) = 0.03, p = 0.004). For high load, only controls showed race model inequalities (t(17) = 0.02, p = 0.006), but not ADHD patients (t(17) = − 0.02, p = 0.21). No group differences were evident for multisensory benefit (see Fig. 2). ADHD patients violated the race model significantly higher in the low-load condition compared to controls (t(35) = 0.002, p = 0.019), whereas there were no differences for the high-load condition (t(35) = − 0.016, p = 0.81).

Fig. 2
figure 2

A Sample race model violation plot; B group-wise comparison of multisensory gain and race model violation; **p < 0.001

Discussion

This is the first study systematically addressing the robustness of MI in an adult ADHD sample for simple stimuli with varying perceptual loads. We hypothesized that ADHD patients cannot successfully integrate when perceptual capacities are high. As a result, patients with ADHD showed robust integration rates similar to controls, irrespective of load. This was marked by faster reaction times for multimodal stimuli compared to unimodal stimuli. Also, error rates across all conditions were similar to healthy controls, further suggesting that MI is not load-dependent in adult ADHD.

The load manipulation was successful, since we found significant higher reaction times in the high-load condition compared to low load for both groups in the visual search task. However, load manipulation did not have an influence on attentional capture to the sensory stimuli. This mirrors the ongoing debate of the role of attention and MI. On the one hand, stimuli can be integrated bottom–up driven via a pre-attentive stage, without the need of attentional deployment from higher-order areas [8, 9]. This type of integration is facilitated through stimulus features, such as temporal and spatial coincidence, with the closer they occur, the more likely MI takes place pre-attentively. On the other hand, more complex stimuli (e.g., linguistic material) require top–down attentional deployment. In our study, the applied stimuli comprised low-level properties as we used simple beeps and flashes. Therefore, it could be the case that in our study, the automaticity of stimuli processing is independent of the load since the basal stimuli may be integrated at a pre-attentive stage. The goal of the study was to challenge the robustness of MI for simple stimuli in ADHD. Consequently, one can conclude, that the process of bottom–up MI seems to be robust in ADHD and is not attenuated by additional task demands. To further challenge the robustness of load-dependent bottom–up MI, it would be advisable to manipulate the stimulus complexity. It has been shown, that MI for complex, speech-like stimuli, that require top–down attention is mitigated in patients with ADHD associated with higher resting-state connectivity from polymodal to unimodal areas [12]. A study by Alsius et al. could show, when a dual task demands high levels of attention, MI for speech-like stimuli declines [35]. It would be intriguing to study that phenomenon in ADHD as to whether MI for complex further mitigates in a dual task compared to simple task. Interestingly, the phenomenon of declining MI performance for complex stimuli in dual tasks could be explained by a narrowed temporal window of integration (TWI), i.e., maximum temporal characteristics between sensory events that allows their perceptual binding [36]. It has been hypothesized that, depending on the task demand, adaptively widening/narrowing of the TWI could take place, responsible for integration for simple stimuli (wide TWI) or declining integration (narrowed TWI). Abnormal TWI were associated with ADHD traits [37]. Further, in a temporal order judgment task, where participants were required to reproduce the sequence of heard tones, ADHD is associated with higher thresholds compared to healthy controls indicating poorer performance when two sensory events occur close in time [38]. To further challenge the robustness of MI in dual tasks for patients with ADHD, it would be desirable to alter the stimulus-onset asynchronies between the sensory events to investigate the role of TWI in ADHD closer.

In the current study, we found significant race model violations in ADHD evident in the low-load condition, but not in the high-load condition. The theoretical underpinning of the race model assumes that perceptual decision-making is a result of two ‘racing’ sensory modalities: when one sensory modality reaches a threshold of stimulation, it triggers a response, hence both sensory modalities are independent of each other. In contrast, the pooled approach assumes that two sensory modalities can be combined to reach a single supra-modal formation process [30, 39,40,41,42]. Since ADHD patients, did not violate the race model for the high-load condition, it could be that perceptual decision-making is altered in attentional demanding situations towards independent sensory accumulation. Additionally, we found that ADHD is associated with significant higher gaining from two sensory modalities in the low-load condition, which cautiously can be interpreted as reaching a supra-modal threshold faster compared to healthy controls. This in turn could be the result of sensory hypersensitivity as reported for ADHD and evident in significant higher ASP scores in the current study. It is also known, that dopamine plays a role in sensory evidence accumulation, more specifically at modulatory stages differentiating signal from noise [43]. Suboptimal dopamine levels have shown to lead to ineffective sensory accumulation [43, 44]. Since ADHD is a neurodevelopmental disorder especially with abnormalities in the dopaminergic pathways, it could be that perceptual decision-making is altered due to suboptimal dopamine levels. Altered perceptual decision-making must not necessarily result in a behavioural difference. Kwasa et al. also found no behavioural differences between ADHD patients and healthy controls, however, the neuronal signature differed [45]. In an auditory task, that required attentional deployment, weaker neuronal modulation was reported in networks involved in both bottom–up and top–down processing [45,46,47,48]. Further studies are necessary to systematically address a possible altered perceptual decision-making in adult ADHD.

Limitations

Since autism spectrum disorders (ASD) are also associated with difficulties in sensory processing and MI [49, 50] and is comorbid to ADHD [51], we should have screened for ASD. However, we recruited the patients sample via the outpatient department where they got a proper diagnosis along with documentation of all comorbidities. Nevertheless, we cannot rule out the presence of ASD in our sample. Further, the existence of comorbidities, e.g., MDD could have also influenced our results. While we asked for discontinuation of the ADHD-related medication, some participants also took antidepressants or risperidone. Therefore, those medication could have influenced the behavioral performance.

Conclusion

Bottom–up MI for adult ADHD is robust against additional task demands. The underlying sensory accumulation may differ compared to healthy controls but nevertheless leading to similar MI rates for simple stimuli. Future studies are needed to investigate the underlying perceptual decision-making and its possible limits for MI in adult ADHD more specifically.