Genetic contributions to attentional response time slopes across repeated trials

Attention provides vital contribution to everyday functioning, and deficits in attention feature in many psychological disorders. Improved understanding of attention may eventually be critical to early identification and treatment of attentional deficits. One step in that direction is to acquire a better understanding of genetic associations with performance on a task measuring reflexive (exogenous) visual attention. Reflexive attention is an important component of overall attention because (along with voluntary selective attention) it participates in determining where attention is allocated and how susceptible to distractors the subject might be. The task that we used involves the presentation of a target that is preceded by one of several different types of cues (none, double, or single, either ipsilateral or contralateral to where the target subsequently appears). We used several different outcome measures depending on the cue presented. We have previously studied the relationship between selected genes and mean response time (RT). Here we report on the contributions of genetic markers to RT increases or decreases over the course of the task (linear trend in RT slope). Specifically, we find that RT slope for a variety of reflexive attention outcome measures is dependent on DAT1 genotype. DRD4 was near significant for one outcome measure in the final (best) model. APOE, COMT, and DBH were not significant in any models. It is especially interesting that genotype predicts linear changes in RT across trials (and not just mean differences or moment-to-moment variability). DAT1 is a gene that produces a protein involved in the transport of dopamine from the synapse. To our knowledge, this is the first study that has associated neurotransmitter genotypes with RT slope on a reflexive attention experiment. The direction of these effects is consistent with genetic risk for attention deficit hyperactivity disorder (ADHD). That is, those with two risk alleles for ADHD (6R/6R on the DAT1 intron 8 VNTR) either got slower as the task progressed or had the least improvement. Those with no risk alleles (5R/5R) had the most improvement in RT as the task progressed.


Background
Attention is a broad concept that has often been divided in the literature along various dimensions. One such division involves reflexive (exogenous) versus sustained attention. Reflexive attention refers to a stimulus-driven reorienting of the brain's resources, often to an external object that newly appears, has a relatively salient color, or involves motion. The cued-orienting task that we used in our first study [1] is an example of a reflexive task and is similar to that used by Posner et al. [2]. Stimuli flash briefly on the computer display and subjects automatically move their attention. It is generally assumed that attention has been captured reflexively if subjects were faster at responding to a target that was preceded by the presentation of a brief pre-cue at that same location. Attention can be captured in this way even though the stimulus presentation is too brief to involve eye movements. On the other hand, sustained attention is measured with an effortful task designed to require vigilance over time. For example, the Continuous Performance Test (CPT) [3] and the Sustained Attention to Response Task (SART) [4] involve the presentation of a stream of stimuli, some of which require a response while others require that a response be withheld. Examining slope in response time (RT) over trials (see Fig. 1) on a reflexive attention task involves some elements of reflexive attention and some elements of sustaining attention to the task over time (~20 min). Contributions of reflexive attention to attentional deficits are often overlooked [5][6][7][8] but could be useful for a more complete understanding of disorders that have an attentional component, such as ADHD, autism, anxiety, and depression by answering questions about biological contributions to attention using exogenous versus endogenous cues [9,10]. For example, one question might be whether it is the nature of the task itself (i.e., designed to induced effortful attention) that induces declining performance over time or if noneffortful (reflexive) attention tasks can induce the same decline in performance in some individuals. There are several studies that discuss genetic associations with RT changes across the course of a sustained attention task [11][12][13][14]; however, the literature contains almost no genetic-association studies on how RT changes across trials during the course of a reflexive task. Such studies might be helpful in determining if declining performance over time shares similar or has distinct genetic influences in reflexive attention and sustained attention tasks. The genes we selected are related to the availability of neurotransmitters, such as acetylcholine and dopamine (see Table 1). We have previously found some of these genes to be associated with mean RT difference scores [1]. In this paper, we extend those findings with a novel look at the influence of these genes on the slope Fig. 1 Hypothetical illustration of two types of variability. The top portion shows three individuals who differ in the spread (moment-to-moment variability) around a regression line after slope is covaried out. The bottom portion represents variability of RT slope between individuals. Some individuals become faster (or slower) over the 20 presentations of a given trial type. We are primarily interested in RT slope variability in this paper of RT over the course of the 20-min reflexive attention task. Looking at RT slope is unlike looking at momentto-moment RT variability because the former is probably dependent on alertness or learning [15,16] while the latter is probably related to vigilance and the ability to detect a signal beyond neural noise [17,18].
Candidate gene studies that show associations between specific genes and various attentional measures provide evidence of genetic influences on attention. Such an association might occur because a given gene influences the availability of a neurotransmitter. That is, the biological pathway from gene to behavior could include neurotransmitters whose availability impacts those behaviors. For example, attention [19,20] and memory [21] are known to be influenced by the availability of the neurotransmitters. The varying availability of these neurotransmitters due to genetic differences could make behaviors like attending more or less efficient in some individuals, and this, in turn, could be captured by various measures of attention.
One component of ADHD is difficulty in maintaining attentional arousal. Individuals with ADHD are frequently identified as having this difficulty [11,12,[22][23][24][25][26][27][28][29]. Usually the connection between maintaining arousal and dopamine is made for moment-to-moment variability. For example, Johnson et al. [30] make the argument that dopamine must be involved in moment-to-moment RT variability because methylphenidate (which increases dopamine availability) reduces RT variability. Exaggerated RT variability could arise because of difficulty in sustaining attention such that RTs rise and fall over the course of a task independently of the task requirements. RT variability may increase with less dopamine availability because it becomes harder to distinguish signal from noise in the nervous system. That is, reduced availability of dopamine could lead to weaker neural signals giving random neural noise a larger impact on target detection and response [17,18].
Another form of variability over trials would be monotonic increases or decreases in RT. That is, dopamine availability could also be related to RT slope over the course of a task. For example, Bioulac et al. [23] found that children with ADHD decline in performance over time on a Continuous Performance Test. This finding could apply to slight attentional deficits found in nonclinical populations as well. In such populations, there may be variations in neurotransmitters that have been associated with attention generally and these variations could contribute to an increase in RT over the course of a task. This is interesting because our previous analysis of the data in the current study showed that individuals from the general population differed on their mean RTs depending on their neurotransmitter-related genotypes [1]. Risk alleles for each genetic marker were identified in previous studies as associated with ADHD, greater RT costs, and/or reduced cognitive functioning generally. Each genetic marker was coded such that zero indicates neither parent contributed a risk allele for these cognitive outcomes, one indicates one parent contributed a risk allele, and two indicates that each parent contributed a risk allele. Those findings that used an RT attention task generally associated genetic markers with mean RTs or RT difference scores. As mentioned previously, we could find no studies in the literature on genetic associations with RT slope in reflexive attention tasks.
In the current study we asked a simple question: Are RT slopes on a reflexive attention task associated with specific genes? We have genotyping information for subjects on the dopamine-related genes COMT, DAT1, DBH and DRD4. We also have information on APOE, which carries risk for Alzheimer's disease, is related to acetylcholine and has been related to cued orienting, even in Table 1 Rationale for genetic markers selected Bellgrove et al. [62] refer to 3R but according to NN Rommelse, ME Altink, A Arias-Vasquez, CJ Buschgens, E Fliers, SV Faraone, JK Buitelaar, JA Sergeant, B Franke and J Oosterlaan [64] 3R is now called 6R

Genetic marker Risk allele Biological effect Cognitive associations
COMT rs4680 G G at rs4680 produces valine making the gen product more active in catabolizing dopamine [59] Reduced cognitive function generally [59] and ADHD [60] DAT1 intron 8 VNTR 6R 6R leads to more dopamine transporter, less dopamine in the synapse [61] and thus terminates the signal [37] Increased RT costs for targets in the left hemifield [62] and ADHD [63] DRD4 rs747302 C C leads to fewer dopamine receptors via reduced transcription [64] Association with ADHD [64] APOE rs429358 + rs7412 ε4 ε4 reduces acetylcholine receptors [65] and may diminish synthesis of acetylcholine [31] Reduced spatial cueing in visual tasks for middle aged (non-demented) carriers of the ε4 allele [34,66,67] DBH rs1108580 A DβH converts dopamine to norepinephrine, therefore, the A allele is associated with lower norepinephrine to dopamine ratios [68] Association with ADHD [69,70] young adults and adolescents [31][32][33] and with attention generally [34,35]. It is useful to examine genes that do not fit with the dopamine hypothesis of attentional deficits and yet have a logical and historical association with attention. Using these data previously on some of these genes [1], we found significant mean RT differences between individuals with different genotypes. Our goal was to determine if there are also genetic associations with RT slopes across trials in an reflexive task. Thus, we are looking for gene-by-trial interactions by modeling logarithm transformed RT as the dependent variable (the outcome in the multilevel model). Such results would be the first to our knowledge to demonstrate such associations. Our interest in RT slope is to give a more complete picture of what is happening in the minds of those individuals who struggle with attentional deficits, which could potentially lead to better environmental supports and treatments.

Results
All markers were in Hardy-Weinberg equilibrium (Ps > 0.90). In the following models, we were specifically interested in gene-by-trial interactions because they signal differences in the response time trends across trials depending on genotype (see Table 2 for correlations between covariates, genetic markers, and the dependent variable.). Multiple models were tested for each outcome measure (cue-target condition). We began with a full model (including all predictors and covariates) and compared subsequent models for improved model fit (as determined by a reduction in Bayesian Information Criteria; BIC). Full models used logarithm transformed RT as the dependent variable and included predictors of interest (APOE, COMT, DAT1, DBH, DRD4, and trial number), covariates (the standard deviation of logarithm transformed RT, error rate, dummy coded ethnicity, side of target, sleepiness score, and age), and the interactions of interest (the slopes, which are the five gene-by-trial interactions). The final (reduced model) eliminated all covariates except ethnicity. We also tested models removing non-significant interactions, but these did not improve model fit for any cue-target condition. However, BIC did improve and the same gene-by-trial interactions (the slopes) were significant or nearly significant when using a simpler model without including covariates (the standard deviation of the log transformed RT, target side, error rate, sleepiness, or age). For parsimony, we settled on the simpler models which showed substantially the same results and had significantly smaller BIC values (as determined by χ 2 deviance statistics). For every cue-target condition, these were the best models and the improvements over models with all predictors (the full models) were significant at p < 0.001. Reporting Akaike's Information Criterion (AIC) instead of BIC does not change the model building results in any instance.
Estimates from tested models are presented in Tables 3,  4, 5, 6, 7, 8, 9, 10 and 11 (one table per cue-target   Table 2 Correlations between the dependent variable, genetic markers, and covariates Note that the highest correlation with log transformed RT is the standard deviation of log transformed RT. This is likely due to the mathematical relationship between SD of log RT and log RT ** Significant at p < 0.01 * Significant at p < 0.05  Tables 3, 4, 5, 6, 7, 8, 9, 10 and 11 have been retransformed to the original RT scale to aid in conceptual clarity. 2 Additionally, Table 12 shows slopes for the different genotypes from a simple regression of untransformed RT on trial for each genotype to aid in conceptual clarity. The net change in RT across 200 trials can be derived from the slopes in Table 12 by multiplying the slope by 200 trials. For example, for the DAT1 5R/5R genotype for the Single Dim Valid condition: mean(RT change) = 200 trials × (−0.56) ms/ trial = −112 ms across 200 trials (subjects with this genotype responded more than 100 ms faster on trials at the end of the experiment than at the beginning). Our primary interest is in whether different genotypes showed non-zero slopes (RT across trial). While an additional slope was significant with the full model (DAT1 on Single Bright Valid), the reduced models had a lower BIC, and so they are described here. Three of the reduced (final) models had slopes that were significant. In these cases it was the DAT1 by trial interactions that were significant. The model with slopes and without covariates for No Cue 3 was significantly better than the model with all predictors (the full model) at  Tables 3, 4, 5, 6, 7, 8, 9, 10 and 11 have been retransformed. The statistical models were run on the log(10) transformed RT data. For ease of exposition, we took each of these parameter estimates and retransformed them so that their magnitudes would be interpretable in terms of the original, untransformed scale that used RT in msec. Note that it is not appropriate simply to antilog(10) these parameter estimates; that is, it is not appropriate to compute the retransformed value of the parameter β 1 as 10 β1 . Instead, we used the formula shown below to retransform these parameters estimates [58]. Given a parameter, β 1 , estimated using the log(10) transformed values of RT as the outcome variable, the retransformed value of this parameter estimate is: 1 where b 1 is the transformed parameter estimate, β 1 is the original estimate using log(10) transformed RT, log 10 (RT ) is the mean of the log(10) transformed RT values, and the summation/averaging expression after β 1 is the mean of the antilogged residuals.  Tables 3, 4, 5, 6, 7, 8, 9, 10 and 11 with the original parameter estimates, i.e., the estimates before they were retransformed to the original scale, are available from the first author upon request. 3 The names for the various conditions are described in the "Methods".
were significantly steeper (averaging 0.11 ms more of a decrease in RT per trial) than the one risk allele group, which in turn averaged 0.08 ms more of a decrease in RT per trial than the two risk allele group. The two-risk allele group had an estimate set to zero in the multilevel model (see Table 7). The zero-risk allele group was significantly different from a flat slope, (t[2137] = −2.40, p = 0.02), as was the one-risk allele group (t[2137] = −2.60, p = 0.01). The differences are easier to visualize in Table 12 Table 10) than individuals in the one risk allele group (who averaged 0.08 ms faster per trial than those in the two risk allele group). Those in the zero risk allele group had slopes that were marginally different from a flat slope, t(2139) = −1.75, p = 0.08. Those in the one-risk allele group had slopes that were significantly different from a flat slope, t(2139) = −2.50, p = 0.01. As can be seen in Table 12, subjects with zero risk alleles responded approximately 112 ms faster over the course of the task, while those with two risk alleles (5R/5R) were responding only 19 ms faster by the end of the task.
The final model for Single Dim Invalid was significantly better than the model with all predictors at χ 2 (5, N = 2513) = 352.66, p < 0.001. The main effects were significant for DAT1 (F [2,2099]  The DAT1 zero risk allele group has slopes that are significantly larger (averaging a decrease of 0.13 ms more per trial than individuals in the one risk allele group (who have slopes that decrease 0.06 ms per trial faster than the two risk allele group). The zero risk allele group had slopes that were significantly different from zero (t[2099] = −2.36, p = 0.02), but the one risk allele group was only marginally different from a flat slope, t(2099) = −1.75, p = 0.08. Subjects with zero risk alleles got about 87 ms faster over the course of the task, while those with two risk alleles (5R/5R) were responding slower by about 17 ms by the end of the task.

Table 3 Estimates from models of the predictors of log-RT for the dual by bright condition
Significant values for ethnicities indicate that the ethnic group is different from Caucasians on RT values. Other significant values indicate that the estimate is different from zero. We report in the "Results" section tests of fixed effects for gene by trial interactions to determine if slopes differ by genotype Note that this table shows model parameter estimates when analyzing log transformed RTs. The values have been retransformed. However, to get a rough idea of slope differences by genotype group, please see Table 12 for raw RTs (ms) NS the slopes were not significantly different from zero and so the row has been collapsed * Significant at p < 0.05 ** Significant at p < 0.01 a This parameter has been set to zero

Discussion
One can think of variability in RT in an attention task across repeated trials in two ways: (1) random variability from trial to trial that fluctuates around some mean RT, and (2) a systematic RT slope across trials that tends to make latter RTs slower (or faster) on average than earlier RTs. Because it is common to study the momentto-moment variability of behavioral or physiological measures between genotypes, it is especially interesting to find that genotype also predicts the latter kind of variability in RT in a cued orienting task: the linear component of changes across trials in RT (slope RT). This was true even after covarying sleepiness, age, self-reported ethnicity, error rate, and moment-to-moment variability, although the statistical analysis led us to settle on the simpler models without these covariates. DAT1 was the only gene in the simpler models that interacted with trial number in predicting RT slopes. DAT1 produces a protein involved in the transport of dopamine from the synapse. We note that our task produced the typical costs and benefits of a reflexive attention task [1,36]. Therefore, our finding is intriguing because it implies that it is not the nature of the task per se (i.e., whether it is designed to involve effortful/sustained attention or not) but possibly that there is a task that transpires across an extended time period. For DAT1, the mean slope increased (subjects got slower) across genotypes with increasing numbers of risk alleles for three outcome measures that were significant. The significant outcome measures were conditions in which there was no cue and in which the single cues were dim (both valid and invalid). The conditions that were not significantly associated with DAT1 are also interesting. These include the Dual Asymmetric Bright (the target appeared near the brighter of two simultaneous cues) and Dual Asymmetric Dim conditions (the target appeared near the dimmer of two simultaneous cues), and the Neutral Both Bright and Neutral Both Dim conditions (the target appeared near either of two identical cues). The Single Bright Valid and Invalid cues were also not significant. This all suggests that it is not validity that distinguishes genotype groups (because subjects with two risk alleles do not respond more quickly across trials with either valid or invalid pre-cues). It may be, however, that cue luminance matters (because those with two risk alleles do not respond more quickly across trials with single, dim cues) and cue number matters (because dual cues did not distinguish between subjects in the different genotype groups). The risk allele for our genetic marker on DAT1 (six repeats of a VNTR) increases production of the dopamine transporter so that the dopaminergic signal is often terminated too soon [37]. One possible conclusion regarding genetic influence on the no-cue and single, dim cue conditions is that bright cues are too salient to distinguish between genotype groups (i.e., all groups performed at ceiling) because dopamine variability in the general population is sufficient for this performance. Another possible conclusion regarding genetic influence is that the addition of a second cue induces subjects to perform more similarly to each other across trials than a single cue, again, because dopamine is sufficient with two cues. In a followup analysis with DAT1, the number of cues showed a slight statistical trend for association with RT slope (p = 0.15) with steeper improvements in RT across trials for the no risk allele (5R/5R) group with one cue trials than with two cue trials. This finding is worth investigation in a future study because it suggests the possibility of an effect in which RT slopes across trials are approximately equal between genotype groups when there are more cues (perhaps because attention is divided) and if they have any copy of the risk allele (6R). Conversely, they respond more quickly across trials if there is only one cue and they have no risk alleles. In other words, subjects respond more similarly across the course of the task when there are two cues, but single cues reflexively capture attention and require a disengage step, which may become more effortful for some subjects over the course of the task.
Why would the DAT1 genotypes be associated with systematic differences in the rate of responding across trials? There are three possible outcomes when examining the interaction between gene and trial in predicting logarithm-transformed RT: (1) subjects can respond more quickly across trials (negative RT slope as trial number increases), (2) subjects can respond equally quickly across trials, or (3) subjects can respond more slowly across trials (positive RT slope as trial number increases). Also keep in mind that a significant difference in RT slope across trials between genotype groups does not always entail a negative RT slope in one group and a positive RT slope in another group. It could simply mean that the RT slope was less negative in one group than in another, while both groups could show negative slopes. In all cases, the significant effects consisted of less negative or more positive RT slopes across trials associated with more risk alleles for the various genes (as defined by association with ADHD). Recall that possession of more copies of the risk allele on DAT1 would tend to result in faster transport of dopamine from the synapses. The nature of the behavioral effect is that subjects tend to slow down across the course of the experiment, especially with the 5R/6R and 6R/6R genotypes. One possible explanation for this effect, then, is that an optimal level of dopamine is required to maintain alertness [22,28,[38][39][40][41][42]. This is actually not different from saying that the association between DAT1 and RT slope may be related to vigilance or sustained attention, even though this is a reflexive attention task. There is a large literature for genetic effects of dopamine related genes for sustained attention [22,28,41,42].
In the case of the current results with the reflexive attention task, waning vigilance across the course of the experiment would tend to lead to gradually longer target detection times and longer RTs on average despite the reflexive nature of the cue-target trials. This hypothesis would, of course, predict that possession of more DAT1 risk alleles should have caused a general slowing across trials in all conditions. Instead, this effect was observed in three of nine conditions. It is possible that issues of power prevented detection of the effect in the other conditions. Nevertheless, dopamine availability, cue luminance and cue number could all be important. We have provided evidence that there are genetic influences on RT trends over the course of a reflexive attention task and not simply on terms of moment-to-moment RT variability.
Of course, chance is another possibility that could explain our results. In fact, the biggest limitation of this study regards sample size which all other things being equal tends to make results less replicable. While some authors caution against using sample sizes smaller than 500 to explore interaction effects [43], others have noted sufficient power and stability to explore cross-level interactions in multilevel models with sample sizes of 30-50 level-2 units (individual subjects in this study) [44,45] and at least 30 level-1 units (the trials in this study) per level-2 unit. Our study certainly met the latter criterion. The best way to handle this question is to replicate this study.
Another limitation could be the heterogeneity of our sample. While we see advantages to including multiple ethnicities and ages, population stratification as an artifact is certainly more likely as the heterogeneity of the sample increases. This would argue for a replication with a larger overall sample and more balanced ethnic subsamples. This could lead to more certainty in the generalizability of the results. It would also be advantageous to explore associations with more genetic markers including those that are associated with neurotransmitters other than dopamine.

Conclusions
Over the course of a reflexive visual attention experiment with repeated trials of the same conditions, some subjects fail to respond more quickly to identical stimuli, but others respond more quickly as the task progresses. This is interesting because it suggests the possibility of similar biological mechanisms for sustaining attention to exogenous and endogenous cues [9,46,47]. However, there are other possibilities. Responding more slowly to later presentations of the same stimulus could indicate, for example, fatigue, a build-up of trial-to-trial inhibition, or greater interference from immediately preceding trials. Responding more quickly to the same stimulus in later presentations could be considered a simple form of learning. The subject's task was simply to make a choice response (right versus left) to a particular target that appeared across trials either unaccompanied by a temporally preceding cue, or accompanied by such a cue that could appear spatially near the target's position or contralaterally across the visual field from the location of the target.
We asked whether individual differences in these RT slopes in response to the cue-target manipulation were associated with genetic markers for five genes: COMT, DAT1, DBH, DRD4, and APOE. These genes were chosen because they have been shown to be related to various aspects of visual attention in previous research, and because in some cases, there is a plausible biological pathway from the gene to the phenotypes that we used to study visual orienting. DAT1 genotype was associated with variations in these slopes in three different cue-target conditions. The mean slope increased with increasing numbers of risk alleles. A larger positive mean slope implies that the difference in RT between early trials with a given stimulus and later trials with that same stimulus was greater (greater RT slowing) for subjects carrying more of the DAT1 risk alleles. The risk allele was determined based primarily on previous research on related sustained attention tasks.
Previous work, including our own, has shown associations between the mean RT in different conditions and various genes, but such associations are like snapshots of genotype/phenotype relations in the sense that the phenotype is not extended in time. It reflects a static aspect of visual attention. In contrast, the phenotypes that we examined were by definition extended in time because they were defined by the slope of RT across repeated presentations of the same stimulus. It was primarily DAT1 that was associated with these temporally extended phenotypes. The phenotypes reflect how speeded attentional choices change over perhaps a 30 min period filled with repetitions of multiple trial types. However, this temporally extended phenotype could reflect the ability to maintain attentional arousal-the ability to learn to recognize the spatiotemporal configurations of the various trial types in order to respond more discriminatively when they appear again-or other processes that systematically alter RTs across an extended period of time. Our results suggest that the DAT1 gene is a promising candidate for understanding the underlying pathways involved in these more temporally extended aspects of either reflexive or sustained visual attention.

Subjects and data set
This study was approved by Rice University's Internal Review Board for Human Subjects in Research and conducted in accordance with ethical guidelines established by the Office for Human Research Protections at the US Department of Health and Human Services. We tested subjects between the ages of 18 and 61 years old from the general population. Most of the subjects (n = 107) were Rice University students, and the remaining subjects (n = 54) were recruited from the surrounding community by posting to neighborhood email groups (see Table 13 for a comparison of these two recruiting sources and Table 14 for demographics). Prior to completing the visual orienting task, subjects signed a consent form and completed an intake questionnaire that included questions on basic demographics, tobacco use, sleepiness (using the Epworth Sleepiness Scale) [48], and attentional disorders affecting the subject and their biological relatives. Subjects provided a saliva sample from which we extracted DNA to obtain information on genotypes for five genetic markers (see Table 1).
A subject's data were removed from the data set if he or she had error rates over 10 % on the behavioral task, could not be classified into a single ethnicity, or reported a history of a serious neurological disorder. Of the 161 Table 13 Comparison of these two recruiting sources The two recruiting sources were Rice University students and subjects recruited from a suburb of Houston. As can be seen, the subjects from the two groups differed on age (the suburb sample was older) and DRD4 genotype. Multilevel modeling accounts for the fact that trials are not independent (certain trials come from certain individuals) and so these differences are less of a concern than they might be with other statistical analyses . High errorrates could indicate that subjects might not be motivated or understand the task. The error rate for the remaining subjects averaged 2 % (SD = 0.02). One subject was excluded for not being classifiable to a single ethnicity. However, eleven individuals with dual ethnicity were classifiable based on the pattern of their genes. We calculated the proportion of each single ethnic group having each genotype. Then we looked at the genotypes of those individuals reporting two ethnicities. If there was a pattern (e.g., if a person's genotype matched the most common genotype for Hispanics on 4 out of 5 genes) and one of the two ethnicities they reported was Hispanic, then they were coded as Hispanic. We compared the results to the best distinguishing genotype for each ethnicity (e.g., if a person had 5R/5R on DAT1, then he/she was 83 % likely to be Black). No individuals reported more than two ethnicities. Three individuals were excluded for reporting prior strokes or seizures. In addition, four subjects were excluded for experimenter error. The final data set contained data from 146 individuals who had complete behavioral data. Subjects reporting ADHD were not automatically excluded because attention symptoms exist in a continuum across the general population [49][50][51]. However, models including and excluding these subjects were compared to verify that these subjects were not driving the results. We determined that the same statistical decisions would have been reached except for slight differences in p values obtained, so we retained the subjects reporting ADHD. Eight subjects in the final data set reported a diagnosis of ADHD, four of whom were medicated (one additional subject reporting ADHD was excluded for having fewer than 200 trials). However, fewer than half of these eight subjects had the high risk genotype on any given genetic marker so we reasoned that these subjects could not drive any results consistent with an analysis by risk allele. Ninety-four of 107 subjects in the Rice University sample had data that met inclusion criteria (45.74 % male) as did 51 of the 54 community subjects (41.18 % male). Overall, 90.06 % of the subjects had data that was not excluded. The mean age for the university sample was 20.22 years (range 18-38 years), and for the community sample the mean age was 35.45 years (range 18-61 years).
For behavioral data collection, subjects completed a choice response task (responding with a button press to indicate either a left or right target). Subjects viewed a 1024 × 768 pixel CRT monitor with a background luminance of 0.08 cd/m 2 . A fixation cross, centered on the monitor, was always visible. Subjects were instructed to fixate the central cross and to maintain fixation throughout data collection. Before beginning data collection trials, subjects were dark adapted and completed 20 practice trials.
One or two pre-cues were flashed for 67 ms. On all except "catch" trials (for which no target appeared), there was an 83 ms gap after the offset of the cue(s) and prior to the onset of the target. The target remained on display for 1000 ms or until the subject made a key press (see Fig. 2). Subjects were asked to respond as quickly as possible while maintaining accuracy by making a key press to indicate a target either to the left (pressing ' A' on the left side of the keyboard) or to the right (pressing 'L' on the right side of the keyboard) of fixation. On catch trials, no target appeared, and subjects were instructed to withhold responding. After the subject responded (or the trial timed-out), there was a variable delay (1.3-1.8 s), and the next trial began. The task did not provide feedback.
In a traditional Posner-like reflexive orienting task, there are only single pre-cues (except for a neutral condition used as a baseline, which uses two equiluminant cues). To these we added unequal-luminance ("asymmetric"), dual-cue trials [52] in order to examine the ability to benefit from either of two simultaneous pre-cues with possibly different saliences. As in a traditional Posner-like task, the cues could be valid (i.e., the target would subsequently appear ipsilateral to the cue location) or invalid (i.e., contralateral). For dual cues, the target could appear either near the brighter of the two cues or near the dimmer. Never more than one target was presented and we averaged the RTs for left and right target presentations. Having, therefore, two luminances, we used a total of nine different pre-cue conditions on the display prior to the appearance of the target. These nine conditions varied on cue number (0, 1, or 2) and luminance (bright or dim). In addition, dual cues could be equiluminant or asymmetric. In this task the pre-cues were uninformative because the probability of the target appearing near where the pre-cue (or brighter cue) had appeared was 50 %. To refer to these various cue-target conditions, we provided unique names. Single Dim Valid indicated a single cue of the dimmer luminance preceding an ipsilateral target. Conversely, Single Dim Invalid indicated a dim cue followed by a contralateral target. There were corresponding valid and invalid configurations for the Single Bright cues. We also included Neutral Bright and Neutral Dim cues. On these trials, identical bright or dim cues were presented simultaneously on both sides of the fixation cross. As mentioned previously, these spatially neutral cues provided baseline measures and were used to calculate alerting effects (that is, reduced RT due to the temporal appearance of the cue). When the dual asymmetric cues are presented, the target can appear either near the brighter (Dual Asymmetric Bright) or dimmer cue (Dual Asymmetric Dim). Targets can also appear uncued (the No Cue condition). RT was measured from the onset of the target. Finally, there was a tenth condition for which we do not have RT data because no target appeared (Catch trials) and subjects were instructed to withhold responding. Each of the 10 conditions (nine target-present plus one target-absent) was presented 20 times (10 times with a left target and 10 times with a right target), yielding 200 trials.
Subjects were told (1) that the cues did not predict the target's location and (2) to ignore the cues as much as possible. Subjects completed all trials within one session with pauses as necessary.
The brighter and dimmer cue luminances were 11.7 and 2.0 cd/m 2 , respectively. The target (a square) always had a luminance of 15.5 cd/m 2 . The centermost edge of the target appeared 5.5° to either side of the fixation cross. The cues were shaped like the letter X, measured 0.8 (width) × 1.0 (height) degrees, and appeared 7.3° (innermost edges) to the left and right of the display's center.

Genetic methods
Saliva was collected using Oragene-250 kits (DNA Oragene, Kanata, Ontario, Canada). Genetic assays were performed to identify genotypes at known SNPs (see Table 15) using polymerase chain reaction (PCR) amplification. Purification was carried out on products by loading them onto an Applied Biosystems 3730xl DNA analyzer and reading the results using Mutation Surveyor software (SoftGenetics, PA, USA). For more details, please see our earlier paper [1]. Fig. 2 Representation of stimuli. The pre-cue stimulus flashed on for 67 ms and then off for 83 ms. Targets were on the same side as the single cue 50 % of the time. The target remained on display until a response was made but for no longer than 1000 ms. Dual cues were identical except that two cues appeared simultaneously in the same locations in which the single cues appeared Genetic analysis of the VNTR (DAT1 exon 8) was performed using fluorescently labeled PCR products (see Table 13). Similar to the SNPs, the amplified VNTR products were analyzed on an Applied Biosystems 3730xl DNA analyzer. Genemapper 4.0 software was used to assign the allele distribution (Applied Biosystems).
The potential for spurious associations in genetics studies, termed population stratification, is a commonly discussed problem [53]. Stratification in this case is an artifact that occurs when there are systematic differences in a phenotype that have nothing to do with a genetic marker under study, yet the association appears statistically significant. Spurious relationships are possible because ethnicity is related to genetic ancestry. Studies are more at risk for these stratification artifacts when subjects from various ethnic groups are (1) combined in the same analysis, (2) differ on a phenotype, and (3) simultaneously differ for unrelated reasons on the frequencies of target genotypes. In a particular study it is often impossible to determine if ethnic group phenotypic differences are due to genetic differences. However, population stratification is often suspected if a study fails to replicate. To address potential stratification, we used a strategy similar to that recommended by Hutchison et al. [54]. That is, we used self-reported ethnicity as a proxy for genetic subpopulation entered as a covariate in the statistical model. We examined markers on five genes, COMT, DAT1, DBH, DRD4, and APOE. The first four each have three genotypes based on two alleles (one inherited from each parent, such as AA, AG and GG for COMT). All genes except APOE were coded ordinally so that "0" represents no risk alleles, "1" represents one risk allele, and "2" represents two risk alleles. APOE was coded so that "0" indicates possession of a protective allele, "1" represents the most common "normal" variant, and "2" represents possession of either one or two risk alleles.
The coding for APOE has a slightly different interpretation because it consists of two different markers (a haplotype) in contrast to the other genes that each had only one marker. However, the risk for cognitive deficits appears to be additive and can still be tested for linear effects using similar coding. APOE is based on two SNPs from which we have created three groups (ε2/ε3, ε3/ε3, and any ε4) based on the literature. For example, Hubacek et al. [55] did not consider individuals with the ε2/ε4 genotype since one allele carries risk for cognitive deficits (even in middle-aged adults without Alzheimer's) [31] and the other provides protection against cognitive deficits [56]. In our data set two individuals had the ε2/ ε4 genotype and were therefore excluded from analysis on this gene. Other genotypes, such as ε2/ε2, are rare and do not occur in our data set. The first group (ε2/ ε3) represents those with a protective allele; the second group represents those with typical risks, and the third group represents those with at least one risk allele (that is, either one or two ε4 alleles) since even one risk allele carries risk for cognitive deficits. These five genes (COMT, DAT1, DBH, DRD4, and APOE) were entered as potential predictors of RT variability for each cue-target condition in turn.

Statistical analysis
Prior to the main analyses, we determined that assumptions of normality were not met (the RT variables had significant positive skew). Raw RT values were transformed using a base ten logarithm. We did not use incorrect (wrong side) or RT out of range trials, and this reduced the impact of attentional lapses on slope.
Recall that 16 subjects had their data removed for reasons described under Subjects and Data Set. In addition to these exclusions, some subjects had missing genetic information due to the inability to obtain genotypic information from their saliva sample on a particular marker. For APOE, 12 subjects had missing genetic information. For COMT, two subjects had missing genetic information. For DAT1, seven subjects had missing genetic information. For DBH, three subjects had missing genetic information. Finally, for DRD4 the number of subjects with missing genetic information was 13. If subjects were missing genotyping data for a particular marker, they were excluded from that analysis.
We used a multilevel modeling approach in SPSS (IBM, 2012) using the MIXED command. We used a backwards design similar to backwards regression. There were nine final models (one for each cue-target condition). Final models were determined by comparison to an empty model with no predictors and to simpler models. Essentially, the multi-level model is fitting a line for each person through the log transformed RTs of each cue-target condition and then the interaction between trial and genotype indicates if the slope changes by genotype. Epworth sleepiness scores, age, and ethnicity were entered as covariates.
We examined slope using mixed (multilevel) modeling. Multilevel modeling accounts for the nested nature of the data (i.e., that trials can be attributed to individuals). We used this method primarily to account for the cross-level interaction that we were interested in between trial and genotype (i.e., the slope) [57]. In addition, the intraclass correlation indicated that 4.39 % of the variance came from the individual-level data (level-2). Although this is relatively small, using multilevel modeling addresses the nested nature of the data. We entered all five genetic markers, sleepiness, age, side of target (right or left), trial number, standard deviation of base10 logarithm-transformed RT, error rate, and ethnicity (dummy coded). We also entered the interaction between trial number and genetic marker as predictors of RT. These were all fixed effects. Trial was entered as a repeated effect. The output of interest when examining RT slopes over the course of a task is the interaction between the genetic marker and trial number.