Numerous candidate EEG biomarkers have been put forward for use in clinical research on autism spectrum disorder (ASD), but biomarker development has been hindered by limited attention to the psychometric properties of derived variables, inconsistent results across small studies, and variable methodology. The authors evaluated the basic psychometric properties of a battery of EEG assays for their potential suitability as biomarkers in clinical trials.

Methods:

This was a large, multisite, naturalistic study in 6- to 11-year-old children who either had an ASD diagnosis (N=280) or were typically developing (N=119). The authors evaluated an EEG battery composed of well-studied assays of resting-state activity, face perception (faces task), biological motion perception, and visual evoked potentials (VEPs). Biomarker psychometrics were evaluated in terms of acquisition rates, construct performance, and 6-week stability. Preliminary evaluation of use was explored through group discrimination and phenotypic correlations.

Results:

Three assays (resting state, faces task, and VEP) show promise in terms of acquisition rates and construct performance. Six-week stability values in the ASD group were moderate (intraclass correlations ≥0.66) for the faces task latency of the P1 and N170, the VEP amplitude of N1 and P1, and resting alpha power. Group discrimination and phenotype correlations were primarily observed for the faces task P1 and N170.

Conclusions:

In the context of a large-scale, rigorous evaluation of candidate EEG biomarkers for use in ASD clinical trials, neural response to faces emerged as a promising biomarker for continued evaluation. Resting-state activity and VEP yielded mixed results. The study’s biological motion perception assay failed to display construct performance. The results provide information about EEG biomarker performance that is relevant for the next stage of biomarker development efforts focused on context of use.

Neural biomarkers are objective markers that may serve as an intermediate phenotype related to core features (e.g., social attention) or neurobiological functioning (e.g., excitatory/inhibitory balance) in autism spectrum disorder (ASD). Scalp electrophysiology (EEG) is a noninvasive technique for recording brain activity and is well suited for biomarker development in neurodevelopmental conditions (1–4). EEG does not require participant responses and can be collected over brief recording periods. Applicable across age and developmental levels, EEG has been widely used in ASD, with established best practices (5, 6). However, reproducibility is a pervasive problem in ASD neuroimaging research, and it is not known whether inconsistent findings reflect true population heterogeneity, methodological or acquisition variability, or poor psychometric qualities. The present study focused on addressing these limitations of previous research by using a large sample with a constrained age range, prespecifying derived variables and directional hypotheses, and evaluating basic psychometric properties to rule in or out EEG metrics potentially suitable as biomarkers in future clinical trials.

The Autism Biomarkers Consortium for Clinical Trials (ABC-CT) (7) selected four EEG assays with strong prior evidence of group discrimination or relation to phenotype. First, face processing is a fundamental social ability that is affected in ASD and is putatively related to early social communicative symptoms (8). The N170 is a face-sensitive event-related potential (ERP) (9) evident by age 4 (10), with developmental precursors in infancy (11, 12). The latency of the N170 to upright faces is delayed in individuals with ASD, although variability exits within reports (13, 14). Second, biological motion perception is critical to social understanding (15). Biological motion perception is indexed by the P1, N2, and P3 ERP components, and attenuated responses have been observed in ASD (16). Third, visual evoked potentials (VEPs) provide information about the integrity of visual pathways and index biological mechanisms related to ASD. Children with ASD and associated genetic syndromes show reduced-amplitude VEPs (17–19). Fourth, resting-state EEG characterizes atypicalities in intrinsic neural function in ASD, manifested as spectral power increased in low- and high-frequency bands and decreased in midrange activity (20). Psychometric evaluation of these assays focused on 1) valid acquisition and signal, 2) construct performance, and 3) 6-week stability. We also sought to replicate discrimination between ASD and typically developing groups and phenotypic correlations within the ASD group.

Methods

ABC-CT Protocol

Details on the ABC-CT protocol have been published previously (7, 21), and details on the full protocol, the participants, and the EEG protocol are provided in the online supplement. The study included measures of social function and a battery of EEG and eye-tracking tasks administered across five sites during 2-day visits spanning three time points; the analyses here focus on time 1 (T1) and time 2 (T2 [T1 + 6 weeks]). A central institutional review board ensured that informed consent or assent was obtained from guardians or participants after they had been given an explanation of the study procedures and an opportunity to ask questions.

Participant Characteristics

Across five sites, we included 280 children with ASD (mean age, 8.6 years [SD=1.6, range=6.0–11.5]; mean full-scale IQ, 98.6 [SD=18.1, range=60–150]) who met gold-standard diagnostic criteria and 119 typically developing children (mean age, 8.5 years [SD=1.6, range=6.0–11.5]; mean full-scale IQ, 115.1 [SD=12.6, range=80–150]) without ASD or psychiatric conditions. Full diagnostic and inclusion criteria as well as demographic characteristics are presented in the online supplement and in Table S1 for the total sample and for the subsamples that contributed valid data to each assay. Age did not differ significantly between the ASD and typically developing groups, nor based on inclusion in each assay (F values <1.0, p values >0.20).

EEG Acquisition Standardization

All procedures were manualized (21). Each experiment started with a welcome screen, followed by experiment-specific information, behavioral directions, and then a start screen. Behavioral assistants provided supplemental verbal directions and behavioral supports responsive to the participant, as per guidelines. During acquisition, the experimenter coded the participant’s behavior for attention and compliance.

EEG equipment.

All sites had an EGI 128-channel acquisition system, from the Net Amps 300 series (three sites) or 400 series (two sites), 128-electrode EGI HydroCel Geodesic Sensor Nets, Logitech Z320 computer speakers, a Cedrus StimTracker (for visual presentation timing), and a 23-inch monitor. A standard acquisition setup was implemented: a 1,000-Hz sampling rate, a 0.1- to 200-Hz filter, EGI MFF file format, onset recording of amplifier and impedance calibrations, and a postacquisition 0.1-Hz digital high-pass filter. EPrime, version 2.0, was used for experimental control.

EEG protocol.

The four EEG assays were administered on the second day of each time point: resting state, followed by a faces task, followed by a counterbalanced biological motion perception (BM) task or VEP. Quality review and signal transformation were performed centrally (21) (see the online supplement for details on the methods).

Resting-state assay.

The resting-state experiment included three blocks of two 30-second videos, for a total of 180 seconds. Video stimuli consisted of abstract moving images subtending 8×6 degrees. The primary outcome variable was the slope of the EEG power spectrum (2–50 Hz), calculated by fitting a least-squares linear regression of log-transformed power as a function of frequency (Figure 1A). Slope indexes cortical excitation-to-inhibition ratio and long-range temporal correlations (22, 23). Secondary variables were alpha (6–12.99 Hz) and gamma (35–54.99 Hz) power. (Results for the delta, theta, and beta bands are provided in the online supplement.) Power was averaged across the entire scalp.

FIGURE 1. EEG resting state, faces task, visual evoked potential, and biological motion recognition task for the ASD and the typically developing groups^a
^aGraphs in the first row are for EEG resting state, in the second row for the faces task, in the third row for visual evoked potentials, and in the fourth row for the biological motion recognition task. Panel A shows group-averaged power by frequency spectrum and slope at time 1; panel B is a histogram of slope values at time 1; panel C shows the 6-week stability of the slope from time 1 to time 2 (T1 to T2); panel D shows the upright faces (solid line) and upright houses (dotted line) group-averaged event-related potential (ERP) waveform at T1; panel E is a histogram of N170 latency to the upright faces at T1; panel F shows the 6-week stability of N170 latency upright faces values from T1 to T2; panel G shows the checkerboard VEP group-averaged ERP waveform at T1; panel H is a histogram of the P100 amplitude to checkerboards at T1; panel I shows the 6-week stability of the P100 amplitude to checkerboards from T1 to T2; panel J shows the biological (solid line) and scrambled motion (dotted line) group-averaged ERP waveform at T1; panel K is a histogram of the P300 amplitude biological motion specificity (BMS) effect at T1; and panel L shows the 6-week stability of the P300 amplitude BMS response from T1 to T2. Shading in panels A, D, G, and J indicates standard error of the mean. ASD=autism spectrum disorder group; BMS=biological motion specificity effect; TD=typically developing group.

Faces assay.

The faces task included 216 trials, acquired in six blocks of 36. Each trial consisted of a fixation crosshair (500–650 ms), stimulus (500 ms), and blank screen (500–650 ms). Three female neutral faces, presented upright and inverted, and three upright houses were presented, for 72 trials per stimulus condition (24). We focused on the right posterior-temporal region created by averaging across the channels (electrodes 89, 90, 91, 95, 96) for each segment and within stimulus type. Peak amplitude and latency for the P100 (60–200 ms), reflecting attention allocation, and the N170, reflecting early-stage facial processing (120–400 ms), were identified using an automated algorithm, visually inspected for accuracy and removed if invalid (details are provided in the online supplement). As shown in Figure 1D, all variables were to upright faces, with the primary variable N170 latency; the secondary variable was the P100 latency.

VEP assay.

VEP stimuli consisted of two black-and-white checkerboards that reversed phase, presented in four blocks of 50 500-ms trials, for a total of 200 trials. Analysis focused on the midline occipital region (average of electrodes 70, 74, 75, 82, 83). All peaks were visually inspected for accuracy; peaks that were deemed invalid were removed from analyses (see the online supplement). As shown in Figure 1G, the primary dependent variable was the P100 amplitude (VEP P100 amplitude, 60–200 ms), and the secondary variable was the N1 amplitude (VEP N1 amplitude, 20–120 ms), reflecting activity in the primary visual cortex and dorsal-medial occipital cortex, respectively (25).

Biological motion perception assay.

The BM assay included 112 trials, in four blocks of 28, with each trial starting with a fixation crosshair (1,025–1,200 ms) followed (randomly) by 1,000 ms of a dynamic biological walking motion or a scrambled point-light display. We used the same region as in the faces task (the right posterior temporal region). We focused on the biological motion specificity effect (BMS; the difference between biological minus scrambled motion response), with the primary variable the P3 average amplitude (P3 amplitude, 330–500 ms) and the secondary variable the N200 peak amplitude (N200 amplitude, 120–400 ms). In addition to the N200 and P300 (Figure 1J), we also abstracted the P100 (80–180 ms), as in the faces task and VEP. Peaks that were deemed invalid were removed from analyses (see the online supplement).

Analytic Plan

We conducted five sets of prespecified analyses to address the psychometric properties of these potential biomarkers and their relation to ASD diagnosis and behaviors (21). The results for T1 are described here, with expanded results provided in the online supplement; results for T2 are provided in the online supplement to demonstrate replication of effects.

Acquisition.

We present rates of acquisition by group below, and additional details are provided in Table S2 in the online supplement. We used chi-square tests to compare rates within each diagnostic group across the five sites; for BM and VEP, we also examined the group-by-order interaction (counterbalanced to the third or fourth position in the battery). We used analyses of covariance (ANCOVAs) to check for differences in characteristics between participants whose data were included and those whose data were excluded (see Table S3 in the online supplement). We then used Pearson correlations to examine the relationship between the number of trials included and child characteristics (see Table S4 in the online supplement).

Construct performance.

Construct performance was established in the typically developing group using experiment-specific evaluation for the faces task and the BM task. Construct performance was related to directional stimuli contrasts (upright faces vs. upright houses, biological motion vs. scrambled motion, respectively) using repeated-measures analysis of variance (ANOVA).

Six-week stability.

We assessed short-term stability of individual biomarkers in both groups from T1 to T2 using intraclass correlation via mixed models with absolute agreement. In Table 1 and in Table S5 in the online supplement, we provide values for subgroups within the ASD group: split by age (8.5 years) and IQ (full-scale IQ of 75). The data supplement includes an expanded version of Table 1 with 95% confidence intervals for all intraclass correlations reported. A priori acceptability criteria defined intraclass correlations ≥0.5 as moderate and ≥0.75 as high (26).

TABLE 1. Six-week stability (intraclass correlations) from time 1 to time 2 for primary and secondary prespecified biomarkers^a

Measure	Typically Developing	ASD	ASD <8.5 Years Old	ASD ≥8.5 Years Old	ASD IQ ≤75	ASD IQ >75
Resting state
Slope	0.539	0.594	0.575	0.606	0.621	0.588
Alpha	0.667	0.730	0.707	0.724	0.725	0.718
Gamma	0.453	0.555	0.561	0.452	0.476	0.542
Faces task
Upright faces P100 latency	0.687	0.680	0.799	0.592	0.895	0.667
Upright faces N170 latency	0.749	0.662	0.622	0.644	0.789	0.650
Visual evoked potential
VEP N1 amplitude	0.679	0.732	0.610	0.801	0.380	0.861
VEP P100 amplitude	0.743	0.700	0.752	0.570	0.820	0.693
Biological motion perception task
BMS N200 amplitude	0.097	0.025	0.103	0.092	—^b	0.026
BMS P3 amplitude	0.149	0.020	-0.109	0.211	—^b	0.023

^aThe values provided are for the included samples of typically developing children and children with autism spectrum disorder (ASD), and then for ASD subgroups by age and IQ. BMS=biological motion specificity effect; VEP=visual evoked potential.

^bIntraclass correlations not included because sample size was ≤10.

TABLE 1. Six-week stability (intraclass correlations) from time 1 to time 2 for primary and secondary prespecified biomarkers^a

Enlarge table

Group discrimination.

We first ran an independent-samples ANOVA with group as the between-subject variable, and we report partial eta-squared (η_p²) for effect size; if variances differed significantly between groups, we report Welch ANOVAs with omega-squared (ω²) for reporting effect size. We also include ANCOVAs with child age, number of trials, sex, and full-scale IQ as covariates. For BM and VEP, the contrast also included order and the group-by-order interaction.

Phenotypic correlations.

We assessed associations between biomarkers and child characteristics in the ASD group. Analyses are reported using Pearson’s correlations; we also verified biomarker-to-phenotype relations using partial correlations controlling for age and number of trials.

Results

Resting Assay

Acquisition.

Valid resting-state data were provided by 86% of the ASD sample and 92% of the typically developing sample, and inclusion rates did not differ significantly by site. Children with ASD who provided valid data had greater nonverbal and full-scale IQ scores and better social and communication skills than those who did not provide valid data (see Table S3 in the online supplement).

The ASD group provided fewer valid data trials than the typically developing group (Table 2). Among children in the ASD group, more valid trials were provided by those who were older, had higher IQ scores, had better communicative functioning, and had fewer autism behaviors (see Table S4 in the online supplement).

TABLE 2. Time 1 group discrimination for primary and secondary variables^a

	ASD		Typically Developing		Analysis
Measure	Mean	SD	Mean	SD	Analysis Type	Main Effect of Group
Resting state
Resting trials	143.75	25.1	161.61	9.4	Welch ANOVA	F_1,341=93.49, p<0.001, ω²=0.208
Resting slope	−1.279	0.144	−1.317	0.139	ANOVA	F_1,350=5.29, p=0.02, η_p²=0.015
					ANCOVA	F_1,346=1.23, p=0.27, η_p²=0.004
Resting alpha (µV2/Hz)	0.259	0.116	0.269	0.109	ANOVA	F_1,350=0.57, p=0.45, η_p²=0.002
					ANCOVA	F_1,346=3.49, p=0.06, η_p²=0.010
Resting gamma (µV2/Hz)	0.026	0.017	0.024	0.02	ANOVA	F_1,350=0.62, p=0.43, η_p²=0.002
					ANCOVA	F_1,346=0.44, p=0.51, η_p²=0.001
Faces task
Upright faces trials	46.38	13.9	54.31	10.5	Welch ANOVA	F_1,295=33.92, p<0.001, ω²=0.091
Upright faces P100 latency (ms)	121.61	17.1	117.22	12	Welch ANOVA	F_1,307=7.38, p=0.007, ω²=0.012
					ANCOVA	F_1,324=5.65, p=0.02, η_p²=0.017
Upright faces N170 latency (ms)	209.63	32.4	196.86	25.7	Welch ANOVA	F_1,284=15.36, p<0.001, ω²=0.042
					ANCOVA	F_1,324=7.60, p=0.006, η_p²=0.023
Visual evoked potential
VEP trials	152.52	36.5	172.83	21	Welch ANOVA	F_1,338=43.54, p<0.001, ω²=0.108
VEP N1 amplitude (µV)	−4.39	3.3	−4.53	3.5	ANOVA	F_1,331=0.13, p=0.72, η_p²=0.000
					ANCOVA	F_1,325=0.63, p=0.43, η_p²=0.004
VEP P100 amplitude (µV)	8.23	4.4	8.98	4.1	ANOVA	F_1,349=2.34, p=0.13, η_p²=0.006
					ANCOVA	F_1,343=0.39, p=0.53, η_p²=0.001

^aAnalyses are presented using unadjusted analysis of variance (ANOVA) or Welch ANOVA. Follow-up analyses use analysis of covariance (ANCOVA) with covariates for age, number of valid trials, sex, and full-scale IQ. For VEP, the follow-up model also includes order and the group-by-order interaction. ASD=autism spectrum disorder; VEP=visual evoked potential.

TABLE 2. Time 1 group discrimination for primary and secondary variables^a

Enlarge table

Six-week stability.

Intraclass correlations for the typically developing and ASD groups are presented in Table 1. Values were generally moderate for slope (Figure 1C) and alpha power, and low for gamma power.

Group discrimination.

When covariates were included, slope (Figure 1A,B), alpha power, and gamma power did not differ by group (Table 2). Exploratory analyses of the other frequency bands (delta, theta, and beta) did not show group differences (see Table S6 in the online supplement).

Phenotype correlations.

As shown in Table 3, a more negative (steeper) slope and lower gamma power were related to better face memory. Lower alpha power and lower gamma power were associated with higher IQ. Only alpha and full-scale IQ remained significant after covarying for age and number of trials.

TABLE 3. Time 1 correlations between the primary and secondary biomarkers for the resting-state EEG, faces ERP, and VEP experiments and child behaviors in the ASD group at time 1^a

Measure	Trials	Age	Verbal IQ	Nonverbal IQ	Full-Scale IQ	Face Memory
Resting state
Resting slope	−0.261**	−0.177**	−0.110	−0.050	−0.080	−0.163*
Resting alpha	−0.214**	−0.261**	−0.147*	−0.146*	−0.171**^b	−0.104
Resting gamma	−0.303**	−0.298**	−0.144*	−0.141*	−0.165*	−0.149*
Faces task
Upright faces P100 latency	−0.134	−0.185**	0.019	0.020	0.015	0.025
Upright faces N170 latency	−0.286**	−0.350**	−0.115	−0.107	−0.126	−0.141*
Visual evoked potential
VEP N1 amplitude	0.109	0.241**	0.093	0.013	0.049	0.123
VEP P100 amplitude	−0.039	−0.119	0.039	0.039	0.040	0.127*^b

^aFace memory was assessed using standard score on the memory for faces subtest of the NEPSY-II. ASD=autism spectrum disorder; ERP=event-related potential; VEP=visual evoked potential.

^bSignificant at p<0.05 when covarying for age and valid number of trials in the ASD group.

*p<0.05. **p<0.01.

TABLE 3. Time 1 correlations between the primary and secondary biomarkers for the resting-state EEG, faces ERP, and VEP experiments and child behaviors in the ASD group at time 1^a

Enlarge table

Faces Assay

Acquisition.

Valid data for the faces task were provided by 77% of the ASD sample and 97% of the typically developing sample. Sites differed at T1 as a result of very high rates of acquisition at one site (91%, compared with 64%–78% in the others). Children with ASD for whom data were available were more likely to be older, have higher verbal, nonverbal, and full-scale IQ, have fewer autism symptoms, and have better social and communication skills than those who did not provide data (see Table S3 in the online supplement).

Compared to the typically developing group, the ASD group had fewer trials for upright faces (Table 2). Among children in the ASD sample, more data were provided by those who were older, had higher IQ scores, and had fewer autism repetitive behaviors and pragmatic problems (see Table S4 in the online supplement).

Construct performance.

Construct performance was established by comparing the ERP response to upright faces versus upright houses (Figure 1D). As expected, the typically developing group exhibited a more negative N170 amplitude and faster latency response to faces compared to houses (F values >6.97, p values <0.01).

Six-week stability.

As shown in Table 1, all values were moderate to high in the typically developing and ASD groups for the N170 latency (Figure 1F) and P100 latency, as well as the amplitudes of these components (see Table S5 in the online supplement).

Group discrimination.

As hypothesized, the ASD group demonstrated significantly slower upright faces N170 latency (Figure 1E) and upright faces P100 latency compared to the typically developing group (Table 2). This pattern was consistent when covariates were included.

Phenotype correlations.

As shown in Table 3, faster N170 latency was related to better face memory (r=−0.141, p<0.05) (see also Figure 1F). The P1 latency was uncorrelated with any prespecified variables.

VEP Assay

Acquisition.

Valid data were provided by 85% of the ASD sample and 96% of the typically developing sample; rates of acquisition did not differ significantly by site or experiment order (see Table S2 in the online supplement). Children with ASD who provided data had higher IQ and better social and communication skills on average than those who did not provide data (see Table S3 in the online supplement).

Compared to the typically developing group, the ASD group had fewer data trials (“VEP Trials” Table 2). In the ASD group, children who were older, had higher IQ scores, and had fewer repetitive behaviors and pragmatic problems provided more valid trials (see Table S4 in the online supplement).

Six-week stability.

As shown in Table 1, intraclass correlations over 6 weeks were moderate to high (Figure 1I) except for the low value found for the N1 in children with ASD who had a full-scale IQ ≤75.

Group discrimination.

In contrast to our predictions, the P100 (Figure 1H) and N1 amplitudes did not differ between groups (Table 2). In adjusted analyses, the group-by-order interaction was significant (F=3.97, df=1, 343, p=0.047; η_p²=0.011) (see Table S7 in the online supplement). Within groups, the P100 was larger for typically developing children for whom the experiment was in third position in the battery compared with fourth position (F=4.05, df=1, 108, p=0.05; η_p²=0.036), but there was no difference by order within the ASD group (F=0.52, df=1, 231, p=0.47; η_p²=0.002).

There were no significant differences in the VEP N1 amplitude response by group.

Phenotype correlations.

A larger P100 was related to better face memory (Table 3). There were no behavioral correlates for the N1.

Biological Motion Assay

Acquisition.

Valid data for the BM assay were provided by 67% of the ASD sample and 88% of the typically developing sample (see Table S2 in the online supplement). Given the low rate of valid data availability in the ASD group, the findings for this group are presented in the online supplement.

Construct performance.

The BMS effect was not observed in the typically developing group. No significant stimulus effects were detected at the N200 amplitude (F=0.08, df=1, 103, p=0.77), and the P3 amplitude was greater to scrambled motion than to biological motion (F=4.79, df=1, 103, p=0.03; η_p²=0.044) (Figure 1J).

Six-week stability.

All BMS intraclass correlation values were low (Table 1; see also Figure 1L as an example).

Group discrimination and phenotype correlations.

There were no group differences or phenotypic relations for the BMS at any component (Figure 1K; see also Tables S8 and S9 in the online supplement).

Discussion

Acquisition Rates

We prespecified that successful acquisition across sites and across key demographic and clinical factors should be >70% in the ASD groups in order to be viable in future clinical trials. Values were acceptable for the resting-state, faces, and VEP assays across diagnostic groups but fell below this threshold for the BM assay in the ASD group. The primary data loss was at the stage of signal validation (number of attended trials; presence of target component), rather than at participant acquisition (ability to wear the net). Our fixed protocol demanded sustained attention (faces task, BM task), which could be addressed through use of briefer assays (e.g., fewer experimental trials) or alternate implementation of the biomarker battery (e.g., removing experiments or implementing them over multiple days).

Children with ASD who were older, had higher IQ, and had greater social and communication skills provided more valid EEG data. Similarly, children with ASD who were older, had higher IQ, and had fewer repetitive behaviors had more valid trials. In contrast, in the typically developing group, only age was related to provision of data. These results suggest a potential complication for future clinical trials, with reduction in potential enrollment or restricted range of child characteristics if EEG is required for inclusion. Further, treatments may alter skills associated with acquisition (e.g., increasing attention), and thus EEG improvement might result from target engagement or also from changes in signal (and lower artifact) available for analysis. Future efforts will need to consider how to disentangle child ability, amount of data, EEG signal-to-noise ratio, and treatment effects.

Notably, we fully manualized our process for site and individual recording quality control, and we used manualized criteria to review all biomarker values for validity. While these procedures are laborious, they reduce the likelihood of introducing bias into the process of acquisition and data derivation. We were also able to implement these steps with post-baccalaureate staff in a manner that is comparable to the training and reliability standards of diagnostic and intelligence assessments. Given that EEG signals directly assess perception and attention, ignoring acquisition processes may reduce the objectiveness of the measurement.

Construct Performance

Our resting-state, faces, and VEP assays evoked the predicted components and EEG component contrasts based on the literature. Overall, our results indicated quantitative variability in the EEG responses between groups. In contrast, the BM assay did not demonstrate construct validity by specified metrics, suggesting that it may not be viable for future clinical trials.

Six-Week Stability

Our protocol focused on stability across 6 weeks to parallel a short-term clinical trial (27). We note that published research from the ABC-CT feasibility study (28) indicates that slope intraclass correlations (ICCs) were higher over a shorter time frame (6 days, ICC=0.699) in comparison with those reported here. Increased reliability over a shorter period likely reflects less age-related or treatment changes.

The majority of biomarkers assessed showed moderate stability that was similar in the ASD and typically developing groups and similar to social-affective clinical measures (Autism Impact Measure social reciprocity subscale, ICC=0.68; Pervasive Developmental Disorder Behavior Inventory social approach behaviors subscale, ICC=0.78; Faja et al., unpublished data), social-cognitive measures (NEPSY-II memory for faces subtest, ICC=0.63), and eye-tracking measures (ICC=0.23–0.87 by variable; [29]) across the same time period. We also note that the sample was clinically stable (Clinical Global Impressions severity scale score, ICC=0.957; Faja et al., unpublished data).

Values were also similar within assay (upright faces P100 latency, N170 latency) and for the same components abstracted across different assays (P100 amplitude). The BMS may have performed poorly because it is a difference score, as the response to biological motion itself showed moderate stability (see Table S4 in the online supplement).

Group Discrimination

All prespecified biomarkers demonstrated group discrimination in smaller samples in previous research. In the present study, P1 and N170 latency to upright faces (at T1 and T2) exhibited predicted latency delays in children with ASD at both T1 and T2. Resting slope differed between groups (at T1 and T2); controlling for covariates resulted in significance at T2 but not at T1. VEP responses did not differ between groups. Across all markers, lack of or inconsistency in group differences suggests limited viability for our biomarkers for diagnostic use.

Phenotype Correlations

Several of our biomarkers demonstrated statistically significant relationships with intellectual function in children with ASD. EEG power across all bands was related to IQ (30). Multiple biomarkers (Table 3) were related to face memory, potentially reflecting reliance on multiple neural systems, including basic visual attention, memory, and face perception (31). Standardized cognitive and language tasks, the face memory task, and the EEG stimuli require attention and inhibition, and were all acquired in the lab in temporal proximity. Our measures of autism symptoms, in contrast, were parent report and reflect a broader survey of behaviors over extended periods of time.

Our a priori hypotheses regarding relationships of EEG variables to social function were not confirmed. Exploratory analyses indicated that greater P100 amplitude to upright faces was associated with greater social ability (Vineland Adaptive Behavior Scales, 3rd ed.) and reduced social-communicative symptoms (Social Responsiveness Scale, 2nd ed.). We did not observe this relationship for the P100 amplitude elicited by VEP or the BM task, suggesting that it may be specific to early-stage face processing.

Biomarker Viability

In terms of meeting basic acquisition and psychometric standards, three assays merit further investigation: the resting-state, faces, and VEP assays. Based on poor acquisition, lack of construct performance, and lack of phenotype correlations, the BM task does not demonstrate potential utility for clinical trials in this population as acquired.

The eventual use of the biomarker is of critical importance, as the impact of these performance metrics may differ based on use. We note that overlap in distributions among typically developing and ASD children suggest that none of these assays are likely to be viable diagnostic biomarkers for ASD. If used for an inclusion or stratification marker, higher intraclass correlations may be critical, as the biomarker needs to be stable enough to consistently identify the same individual (pretreatment). A shorter retest period would be needed to eliminate potential developmental effects or clinical change that could influence our 6-week measure. For a surrogate endpoint, it may be more important that the marker track clinical change or the trial team may use the test-retest value to adjust the target treatment change value. While further validation of these markers needs to occur in the contexts of use (32), Loth has also noted (33) the need for markers that might predict placebo response.

Several caveats apply to the study. First, our results may not generalize to other ages and neuroimaging methods. Second, as our study was focused on reproducibility, the reported results focus on a subset of potential analyses to be conducted on data-rich EEG signals. We actively share our protocols, experiments, and data to enable the broader scientific community to partner with us in these exploratory investigations. Lastly, the translation of EEG measures to preclinical models will also be critical to understanding the mechanisms of treatment response; within our battery, two of the tasks are broadly relevant, while the face and BM tasks are likely only applicable to primate models.

In summary, our results provide support for the viability of well-standardized EEG acquisition in school-age children with ASD, and experiments that assess face processing, basic visual processing, and resting-state brain activity demonstrate desirable measurement properties for exploration as biomarkers that may be of use in clinical trials.

Center for Child Health, Behavior, and Development and Seattle Children’s Research Institute, Seattle (Webb, Borland, Benton, Santhosh, Shic); Department of Psychiatry and Behavioral Sciences, University of Washington School of Medicine, Seattle (Webb, Bernier); Yale Child Study Center (Naples, Carlos, McAllister, Chawarska, McPartland), Yale Center for Clinical Investigation (Seow), and Department of Emergency Medicine (Dziura), Yale University, New Haven, Conn.; Department of Neurology, Boston Children’s Hospital, Boston (Levin); Department of Neurology, Harvard Medical School, Boston (Levin); Department of Psychiatry and Biobehavioral Sciences (Hellemann, Jeste, Senturk, Sugar) and Department of Biostatistics (Senturk, Sugar), University of California Los Angeles, Los Angeles; Department of Computer Science, University of Colorado, Colorado Springs (Atyabi); Duke Center for Autism and Brain Development (Dawson, Sabatos-DeVito) and Department of Psychiatry and Behavioral Sciences (Dawson), Duke University, Durham, N.C.; Department of Pediatrics, Harvard University, Boston (Faja, Nelson); Division of Developmental Medicine, Boston Children’s Hospital, Boston (Faja, Nelson); Department of Medical Social Sciences, Northwestern University, Chicago (Murias); Graduate School of Education, Harvard University, Boston (Nelson); Department of Pediatrics, University of Washington, Seattle (Shic).

Send correspondence to Dr. Webb ([email protected]) and Dr. McPartland ([email protected]).

Interim data were presented at the annual meeting of the International Society for Autism Research, Rotterdam, the Netherlands, May 9–12, 2018.

Support for the Autism Biomarkers Consortium for Clinical Trials was provided by NIMH grant U19 MH108206. NIH scientific partners and members of the Foundation for the National Institutes of Health (FNIH) Biomarkers Consortium served on the Steering Committee and Biomarkers Consortium Project Team and provided consultation on study design and analysis. A representative from Janssen served on the FNIH Biomarkers Consortium Project Team and provided in-kind support in terms of sharing experiences and preliminary results of the Janssen Autism Knowledge Engine (JAKE) study.

Dr. Webb has served as a consultant for Janssen Research and Development. Dr. Dawson has served on the scientific advisory boards of Akili Interactive, Hoffmann–La Roche, Janssen Research and Development, LabCorp, Tris Pharma, and Zynerba; she has served as a consultant for Apple, Axial Ventures, Gerson Lehrman Group, Guidepoint Global, and Teva Pharmaceutical; she serves as CEO of DASIO, LLC; she has received book royalties from Guilford, Oxford University Press, and Springer Nature Press; she has developed technology, data, and/or products that have been licensed to Apple or Cryo-Cell International, from which she and Duke University have benefited financially; and she holds a patent (10,912,801) and has patent applications (62,757,234, 25,628,402, and 62,757,226). Dr. Shic has served as a consultant for BlackThorn Therapeutics, Janssen Research and Development, and Hoffmann–La Roche. Dr. McPartland has received funding from Janssen Research and Development; he has served as consultant for Blackthorn Therapeutics, BridgeBio, Customer Value Partners, and Determined Health; he has served on the scientific advisory boards of Modern Clinics and Pastorus; and he receives royalties from Guilford, Lambert Press, and Springer. The other authors report no financial relationships with commercial interests.

Repository data: We refer in this article to a number of experiments as well as support documents detailing our standard operation procedures and manuals of operation for the ABC-CT feasibility phase and main study phase; these documents can be accessed by request from the principal investigator ([email protected]), and data are available via the NIMH Data Archive (identifier: 2288; https://nda.nih.gov/edit_collection.html?id=2288).

Important contributions were provided by members of the Autism Biomarkers Consortium for Clinical Trials (ABC-CT), including Madeline Aubertine, Cynthia Brandt, Shou-An A. Chang, Kelsey Dommer, Alyssa Gateman, Simone Hasselmo, Julie Holub, Toni Howell, Ann Harris, Alexander Hoslet, Kathryn Hutchins, Kelsey Jackson, Scott Johnson, Lily Katsovitch, Minah Kim, Beibin Li, Samantha Major, Samuel Marsan, Andriana S. Méndez Leal, Lisa Nanamaker, Leon Rozenblit, Laura Simone, Dylan Stahl, Cindy Voghell, Andrew Yuan. Consultation was provided by the EU Aims Longitudinal European Autism Project team, including Declan Murphy, Eva Loth, Emily J.H. Jones, and Luke Mason.

References

1. Ewen JB, Sweeney JA, Potter WZ: Conceptual, regulatory, and strategic imperatives in the early days of EEG-based biomarker validation for neurodevelopmental disabilities. Front Integr Neurosci 2019; 13:45Crossref, Medline, Google Scholar

2. Jeste SS, Frohlich J, Loo SK: Electrophysiological biomarkers of diagnosis and outcome in neurodevelopmental disorders. Curr Opin Neurol 2015; 28:110–116Crossref, Medline, Google Scholar

3. McPartland JC: Developing clinically practicable biomarkers for autism spectrum disorder. J Autism Dev Disord 2017; 47:2935–2937Crossref, Medline, Google Scholar

4. Sahin M, Jones SR, Sweeney JA, et al.: Discovering translational biomarkers in neurodevelopmental disorders. Nat Rev Drug Discov 2018; 18:235Google Scholar

5. Keil A, Debener S, Gratton G, et al.: Committee report: Publication guidelines and recommendations for studies using electroencephalography and magnetoencephalography. Psychophysiology 2014; 51:1–21Crossref, Medline, Google Scholar

6. Webb SJ, Bernier R, Henderson HA, et al.: Guidelines and best practices for electrophysiological data collection, analysis, and reporting in autism. J Autism Dev Disord 2015; 45:425–443Crossref, Medline, Google Scholar

7. McPartland JC, Bernier RA, Jeste SS, et al.: The Autism Biomarkers Consortium for Clinical Trials (ABC-CT): scientific context, study design, and progress toward biomarker qualification. Front Integr Neurosci 2020; 14:16Crossref, Medline, Google Scholar

8. Webb SJ, Neuhaus E, Faja S: Face perception and learning in autism spectrum disorders. Q J Exp Psychol (Hove) 2017; 70:970–986Crossref, Medline, Google Scholar

9. Bentin S, Allison T, Puce A, et al.: Electrophysiological studies of face perception in humans. J Cogn Neurosci 1996; 8:551–565Crossref, Medline, Google Scholar

10. Taylor MJ, Batty M, Itier RJ: The faces of development: a review of early face processing over childhood. J Cogn Neurosci 2004; 16:1426–1442Crossref, Medline, Google Scholar

11. de Haan M, Johnson MH, Halit H: Development of face-sensitive event-related potentials during infancy; in Infant EEG and Event-Related Potentials. Edited by de Haan M. Hove, UK, Psychology Press, 2007, pp 77–100Google Scholar

12. Webb SJ, Jones EJH, Merkle K, et al.: Developmental change in the ERP responses to familiar faces in toddlers with autism spectrum disorders versus typical development. Child Dev 2011; 82:1868–1886Crossref, Medline, Google Scholar

13. McPartland J, Dawson G, Webb SJ, et al.: Event-related brain potentials reveal anomalies in temporal processing of faces in autism spectrum disorder. J Child Psychol Psychiatry 2004; 45:1235–1245Crossref, Medline, Google Scholar

14. Kang E, Keifer CM, Levy EJ, et al.: Atypicality of the N170 event-related potential in autism spectrum disorder: a meta-analysis. Biol Psychiatry Cogn Neurosci Neuroimaging 2018; 3:657–666Medline, Google Scholar

15. Happe F, Cook JL, Bird G: The structure of social cognition: in(ter)dependence of sociocognitive processes. Annu Rev Psychol 2017; 68:243–267Crossref, Medline, Google Scholar

16. Kröger A, Bletsch A, Krick C, et al.: Visual event-related potentials to biological motion stimuli in autism spectrum disorders. Soc Cogn Affect Neurosci 2014; 9:1214–1222Crossref, Medline, Google Scholar

17. Varcin KJ, Nelson CA III: A developmental neuroscience approach to the search for biomarkers in autism spectrum disorder. Curr Opin Neurol 2016; 29:123–129Crossref, Medline, Google Scholar

18. Kovarski K, Malvy J, Khanna RK, et al.: Reduced visual evoked potential amplitude in autism spectrum disorder, a variability effect? Transl Psychiatry 2019; 9:341Crossref, Medline, Google Scholar

19. LeBlanc JJ, Nelson CA: Deletion and duplication of 16p11.2 are associated with opposing effects on visual evoked potential amplitude. Mol Autism 2016; 7:30Crossref, Medline, Google Scholar

20. Wang J, Barstein J, Ethridge LE, et al.: Resting state EEG abnormalities in autism spectrum disorders. J Neurodev Disord 2013; 5:24Crossref, Medline, Google Scholar

21. Webb SJ, Shic F, Murias M, et al.: Biomarker acquisition and quality control for multi-site studies: the Autism Biomarkers Consortium for Clinical Trials. Front Integr Neurosci 2020; 13:71Crossref, Medline, Google Scholar

22. Voytek B, Kramer MA, Case J, et al.: Age-related changes in 1/f neural electrophysiological noise. J Neurosci 2015; 35:13257–13265Crossref, Medline, Google Scholar

23. Palva JM, Zhigalov A, Hirvonen J, et al.: Neuronal long-range temporal correlations and avalanche dynamics are correlated with behavioral scaling laws. Proc Natl Acad Sci USA 2013; 110:3585–3590Crossref, Medline, Google Scholar

24. Loth E, Charman T, Mason L, et al.: The EU-AIMS Longitudinal European Autism Project (LEAP): design and methodologies to identify and validate stratification biomarkers for autism spectrum disorders. Mol Autism 2017; 8:24Crossref, Medline, Google Scholar

25. Di Russo F, Martínez A, Sereno MI, et al.: Cortical sources of the early components of the visual evoked potential. Hum Brain Mapp 2002; 15:95–111Crossref, Medline, Google Scholar

26. Cicchetti DV: Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess 1994; 6:284–290. doi: 10.1037/1040-3590.6.4.284Crossref, Google Scholar

27. Rodrigues R, Lai MC, Beswick A, et al.: Practitioner review: Pharmacological treatment of attention‐deficit/hyperactivity disorder symptoms in children and youth with autism spectrum disorder: a systematic review and meta‐analysis. J Child Psychol Psychiatry 2021; 62:680–700Crossref, Medline, Google Scholar

28. Levin AR, Naples AJ, Scheffler AW, et al.: Day-to-day test-retest reliability of EEG profiles in children with autism spectrum disorder and typical development. Front Integr Neurosci 2020; 14:21Crossref, Medline, Google Scholar

29. Shic F, Naples AJ, Barney EC, et al.: The Autism Biomarkers Consortium for Clinical Trials: evaluation of a battery of candidate eye-tracking biomarkers for use in autism clinical trials. Mol Autism 2002; 1:1–7Google Scholar

30. Schmid RG, Tirsch WS, Scherb H: Correlation between spectral EEG parameters and intelligence test variables in school-age children. Clin Neurophysiol 2002; 113:1647–1656Crossref, Medline, Google Scholar

31. Webb SJ, Merkle K, Murias M, et al.: ERP responses differentiate inverted but not upright face processing in adults with ASD. Soc Cogn Affect Neurosci 2012; 7:578–587Crossref, Medline, Google Scholar

32. FDA-NIH Biomarker Working Group: BEST (Biomarkers, Endpoints, and Other Tools) Resource. Silver Spring, Md, US Food and Drug Administration, and Bethesda, Md, US National Institutes of Health, 2016–2021. https://www.ncbi.nlm.nih.gov/books/NBK326791Google Scholar

33. Loth E: Placebo effects and participant heterogeneity in clinical trials of autism spectrum disorder. Lancet Psychiatry 2022; 9:184–185Crossref, Medline, Google Scholar

Issue Highlights

Volume 180
Issue 1

January 01, 2023
Pages 41-49

Metrics

Keywords

PDF download

History

Received 12 May 2021

Revised 14 March 2022

Accepted 2 May 2022

Published online 24 August 2022

Published in print 1 January 2023

Sign In

Change Password

Your password must have 6 characters or more:

Password Changed Successfully

Create your account

Forget yout Password?

Forgot your Username?

The Autism Biomarkers Consortium for Clinical Trials: Initial Evaluation of a Battery of Candidate EEG Biomarkers

Abstract

Objective:

Methods:

Results:

Conclusions:

Methods

ABC-CT Protocol

Participant Characteristics

EEG Acquisition Standardization

EEG equipment.

EEG protocol.

Resting-state assay.

Faces assay.

VEP assay.

Biological motion perception assay.

Analytic Plan

Acquisition.

Construct performance.

Six-week stability.

Group discrimination.

Phenotypic correlations.

Results

Resting Assay

Acquisition.

Six-week stability.

Group discrimination.

Phenotype correlations.

Faces Assay

Acquisition.

Construct performance.

Six-week stability.

Group discrimination.

Phenotype correlations.

VEP Assay

Acquisition.

Six-week stability.

Group discrimination.

Phenotype correlations.

Biological Motion Assay

Acquisition.

Construct performance.

Six-week stability.

Group discrimination and phenotype correlations.

Discussion

Acquisition Rates

Construct Performance

Six-Week Stability

Group Discrimination

Phenotype Correlations

Biomarker Viability

Issue Highlights