Individuals with complex communication needs (CCN) comprised approximately 1.3% of the total United States population in 2013, or approximately 4 million people (Beukelman & Mirenda, 2013); however, this figure has likely increased along with the increasing incidence rates of autism spectrum disorder (ASD) and other disabilities impacting communication and language skills (Light & McNaughton, 2012). Augmentative and alternative communication (AAC) interventions for children with intellectual and developmental disabilities and complex communication needs increase functional communication skills (Drager et al., 2010), increase language and social competence (Light et al., 2003), and can increase natural speech in some individuals (Millar et al., 2006). The ability to effectively and efficiently communicate wants and needs and actively participate in social exchanges can improve quality of life and independence (Chan & Zoellick, 2011). Communication increases social closeness, or relationships between individuals characterized by positive exchanges, mutuality, sharing, and mutual enjoyment (Howes, 1983).

AAC includes unaided systems such as manual sign and gestures as well as aided systems such as the Picture Exchange Communication System (PECS; Bondy & Frost, 1994) and speech generating devices (SGD). Aided AAC systems include high tech speech generating aided communication systems (e.g. apps on Apple™ devices, GoTalk9+, AMDi Tech/Plus 32, ablenet® BIGmack; Logan, Iacono, & Trembath, 2017) and low tech communication systems (e.g. PECS, eye gaze board, symbols on a ring or in a notebook). Aided AAC specifically refers to systems requiring external or additional supports outside of the speaker’s body. AAC modalities refers to the different systems of AAC: aided or unaided, low or high tech, and the different access methods (e.g. eye gaze, touchscreen, head pointer). Unaided AAC systems such as gestures and manual sign require gross and fine motor skills (Fuller, Lloyd, & Stratton, 1997), functional use of hands and fingers and trained communication partners familiar with idiosyncratic gestures and sign languages (Mirenda, 2003). Aided AAC systems are most often selected for individuals with CCN (Mirenda, 2001) due to their accommodation of alternate access for individuals with fine motor impairments, intelligibility to unfamiliar listeners without training, and visual symbols that resemble their referents (Mirenda, 2001).

The growing number of reviews that summarize the extant AAC intervention literature has utility in informing clinical decisions and aiding policymakers in making research-based decisions about educational and medical policies. However, reviews of studies are subject to publication bias (Shea et al., 2007, 2017) and can include studies of poor methodological quality (Sharif et al., 2013). Synthesizing results across systematic reviews, literature reviews, and meta-analyses is difficult due to the use of different methodologies, conceptual frameworks, and, if used, effect size calculations (Shea et al., 2007, 2017). Moreover, methodological procedures, rigor, and purposes and findings of systematic reviews often differ, contributing to potential practitioner confusion in determining whether a particular intervention or practice has sufficient evidence to warrant adoption (Wells & Littell, 2009). A previous practitioner megaanalysis on special education interventions summarized 18 meta-analyses to indicate which interventions have the largest effect sizes. This paper was the first to indicate a need for synthesizing syntheses of research to inform practices, encourage selection of highly effective interventions for use with individuals with disabilities, and recommend discontinuation of use of ineffective practices (Forness et al., 1997).

Mega-review methodology originated in the healthcare field as a method by which researchers gather systematic reviews into a single synthesis of combined findings about an intervention or topic of interest (e.g., Dragioti et al., 2017; Seida et al., 2009). Researchers conduct mega-reviews, also called umbrella reviews, to collect, collate, and summarize the evidence base provided by systematic reviews of individual studies to present a cohesive synthesis of the extant literature, identify gaps and blind spots with a field’s published evidence base, and identify promising practices for researchers and practitioners (Ioannidis, 2009). Mega-reviews also provide an opportunity to examine the methodological quality of included reviews. Mega-review methodology applied to AAC interventions is very limited: to date, there is a single overview of review studies (Nam et al., 2018) examining five reviews on AAC for individuals with developmental disabilities. This mega-review was limited to reviews of single case research designs comparing AAC systems and did not assess review methodological quality. There has been an increase of published systematic reviews and meta-analysis in recent decades, thus a systematic review of reviews in the area of AAC implementation is warranted.

What Works Clearinghouse (WWC) standards for single-case research design studies provide rigorous methodological requirements for researchers that are used to determine study quality and establish evidence-based practices (Kratochwill et al., 2010). There are no similarly clear standards for conducting and reporting reviews of literature. The Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines focus on reporting of randomized control trial data using quantitative methodologies (Moher et al., 2009). There is no quality indicator measure to compare review quality across group and single case research designs. Due to the inconsistent reporting of quality of evidence on AAC interventions, researchers and clinicians are unable to make clear and conclusive recommendations to practitioners and families about best practices or research-based interventions (e.g., Ganz et al., 2017; Holyfield et al., 2017).

The Scottish Intercollegiate Guidelines Network (SIGN) checklist was developed to assess methodological quality of systematic reviews and meta-analyses of single case research (Manolov et al., 2017); however, SIGN does not address methodological quality of other design types. A MeaSurement Tool to Assess systematic Reviews (AMSTAR) was developed by Shea et al. (2007) and revised to include non-randomized studies (AMSTAR 2; Shea et al., 2017). AMSTAR 2 has been evaluated for inter-rater reliability (median Cohen’s kappa range: 0.51–0.71) and found to have moderate to substantial agreement (Lorenz et al., 2019; Shea et al., 2017). AMSTAR 2 was also found to have validity of appraising systematic reviews and determined to be easy to administer (Lorenz et al., 2019; Pieper et al., 2015) and had good inter-rater reliability with the risk of bias in systematic reviews tool (Pieper et al., 2019).

The purposes of this mega-review are to summarize published peer-reviewed literature reviews about aided AAC interventions that included children with developmental disabilities, to present an overview of the most frequently used aided AAC interventions, and to appraise each review’s quality using AMSTAR 2. The following research questions were asked: (a) what is the methodological quality of peer-reviewed published literature reviews on interventions for teaching communication skills using aided AAC to children with developmental disabilities and complex communication needs; (b) has the methodological quality of reviews of aided AAC interventions for children with developmental disabilities and complex communication needs changed over time; (c) how are aided AAC interventions described regarding participants, interventionists, and interventions; (d) what are the review-reported communications outcomes of aided AAC interventions and review-reported recommendations for future research; and (e) what intervention methodologies have been used to produce and report knowledge about aided AAC interventions for children with developmental disabilities and complex communication needs?

Method

A mega-review of the literature on interventions for individuals with CCN who use aided AAC was conducted to summarize the published literature and provide guidance to researchers on areas of further research, ways to increase methodological rigor, and any blind spots within the literature. Aided AAC was operationally defined as AAC requiring materials external to the body, such as icons and/or voice-output systems. This mega-review was conducted as a systematic review of literature reviews including systematic reviews, meta-analyses, and narrative reviews with systematic and replicable methods replacing individual studies. By completing a review of reviews, the authors aimed to summarize extant literature and identify areas commonly featured in aided AAC research and bring to light areas of need identified by authors reviewing individual studies. Unpublished and gray literature, including dissertations, book chapters, and reports, were excluded from this mega-review due to limiting searches to peer-reviewed journals.

Search Strategy

The electronic databases ERIC, PsycINFO, and Academic Search Premier were systematically searched by the first author using combinations of the following free term keywords: (a) “augmentative and alternative communication or AAC”, OR (b) “assistive technology”, OR (c) “aided communication”, AND (a) “literature review”, OR (b) “systematic review”, OR (c) “meta-analysis”. In this paper, meta-analyses referred to reviews that statistically combine data from multiple studies, assess effect sizes, and analyze research results (Uman, 2011) and literature review referred to a review, including systematic reviews, that reported completing systematic searches but did not explicitly declare a priori methods or research question(s) and also did not use meta-analysis procedures. The following restrictions were placed on the electronic searches: (a) peer-reviewed publications; (b) published between January 1, 1999 and August 1, 2020 to capture the last 20 years of research in this area; (c) published in English; and (d) no dissertations, books, book chapters, or government reports. Narrative and scoping reviews (e.g. Schlosser & Sigafoos, 2006) were excluded from the present mega-review due to AMSTAR 2 being developed for use with systematic reviews/meta-analyses. Scoping reviews were excluded due to their not being research question-driven, disparities in reporting replicable methods of the review process, and scoping reviews are often not intended to synthesize research evidence (Munn et al., 2018).

Initial searches returned 1215 articles. Titles and abstracts of each article were reviewed by the first author after removal of duplicates, and 185 met inclusion criteria. Hand searches of the most frequently appearing journals (i.e., Augmentative and Alternative Communication and Journal of Speech, Language, and Hearing Research) were completed by the first author in July 2018, looking at titles and keywords for “review” or “summary” and “AAC” or “assistive technology”. The journal Augmentative and Alternative Communication was searched from January 1999–August 1, 2020 and yielded 1 new article. A hand search of the Journal of Speech, Language, and Hearing Research was conducted from January 1999 and August 1, 2020; no new articles were identified. The first author also completed ancestral searches by reviewing the articles cited by reviews identified for inclusion and published in the last 20 years; no new articles were identified. Four reviews identified via expert nomination were included for full text analysis. During title and abstract screening, 15 scoping and qualitative reviews were excluded.

During full text coding, each article was reviewed by the first author against the following inclusion criteria: (a) literature review, systematic review, or meta-analysis; (b) replicable methods that included single case or group design research methodologies; (c) reported or addressed use of aided AAC; (d) reported communication outcomes, operationalized as a change in receptive or expressive communication skills (e.g. syntax, verbal operants, initiations, responses) for at least one participant per study; (e) included at least one participant diagnosed with a developmental disability; (f) reported at least one participant 0–18 years; and (g) evaluated the results of the studies or interventions using a narrative summary, table, or quantitative analysis.

A total of 185 reviews were identified that met inclusion criteria and were coded via full text review. During full text coding, 101 reviews were excluded. Reviews were excluded from analysis for the following reasons: (a) qualitative methodology (n = 8); (b) narrative reviews without systematic replicable methods (n = 11); (c) scoping reviews (n = 5); (d) participants’ principal diagnosis was sensory impairment (n = 6); (e) no communication outcomes reported (n = 36); (f) only adult participants (n = 5); (g) only unaided AAC used in interventions (n = 11); and (h) duplicates (n = 19). Reviews that did not synthesize intervention results (Binger & Light, 2008) or did not report intervention results were also excluded (e.g. Gevarter et al., 2013). Searches and application of inclusion and exclusion criteria are detailed in Fig. 1. A list of excluded reviews is available upon request. There were 84 reviews included in this mega-review. Population, intervention, comparator group, outcome, and study design (PICOS) information for this mega-review are depicted in Fig. 3, available in supplemental materials.

Fig. 1
figure 1

PRISMA Diagram for mega-review

Data Extraction

The first author created a coding sheet in Microsoft Excel™ to code each article selected for full text review (available upon request). Variables and coding categories were identified through close reading of each review, then counted via event recording by the first author for each occurrence. Primary coded variables were (a) type of designs; (b) number of included studies; (c) age of participants; (d) participant and interventionist demographics; (e) type of intervention; (f) intervention setting; (g) inter-rater agreement; (h) generalization; (i) maintenance; (j) outcomes and findings reported by review authors; (k) limitations; and (l) recommendations. Additional variables that were coded were: (a) participant race/ethnicity; (b) participant gender; (c) type of interventionist (e.g., clinician, teacher, or parent); (d) disability diagnoses; (e) intervention setting: and (f) type of AAC (e.g. PECS/PCS, SGD, mobile tablets as SGD. Specific types of SGD were coded but were aggregated due to a wide variety of devices and systems used over 20 years of technology development. Types of interfaces and complexity were not coded due to lack of reporting by review authors. Behavioral intervention was defined as the use of an intervention strategy based on principles of applied behavior analysis (e.g., forward chaining, prompt hierarchy, time delay). Due to intervention development and terminology evolution and inconsistent reporting of intervention strategies by review authors, specific intervention types were not coded. Variables were coded using key words and quantitative data reported by each review and hand-typed or copied and pasted into the Excel sheet. A complete coding form is available upon request.

Following coding for variables of interest, the first author coded each review using the AMSTAR 2 tool. Items not applicable to each review’s included designs were coded “not applicable” and not calculated in total points possible (e.g., a literature review was coded not applicable for having done appropriate statistical analyses). AMSTAR 2 results were compared across reviews by calculating percent of points possible: number of items marked “Yes” or Partial Yes” divided by the total number of applicable items × 100. Because the authors of AMSTAR 2 did not create guidance around cutoff levels for review quality, no reviews were excluded due to review quality.

Reliability

Search and inclusion reliability were independently conducted by the fourth author, an assistant professor in special education. Electronic database searches were completed using free term keywords and applying inclusion and exclusion criteria to titles and abstracts following a 1-h training via Zoom. Training procedures are available upon request and included modeling free term searches in each database, tracking number of hits per database, and applying inclusion and exclusion criteria to each title and abstract. Agreement across the three electronic databases averaged 93.3% (range: 87.5%–98%), calculated by the total number of agreements divided by the total number of agreements plus total number of disagreements × 100 for each database. Disagreements at the title/abstract level were discussed until the two authors reached consensus on inclusion/exclusion of reviews.

Data extraction inter-coder agreement (ICA) was completed by random number generation to select 37% of reviews (n = 31) from an alphabetical list of included references. The third author, a special education doctoral student, compared a copy of the completed coding sheet for each review against review full text and marked each Excel cell for disagreement in red. Data extraction ICA was 98.26%, calculated by dividing 396 agreements by 403 possible agreements × 100. AMSTAR 2 coding agreement was similarly completed by the third author reviewing scores for another 31 randomly selected articles. The third author used a copy of the completed data codes and marked disagreements in red. Initial independent AMSTAR 2 coding ICA was 89.72%, calculated by dividing 445 agreements by 496 possible agreements × 100. Disagreements on variable coding and AMSTAR 2 ratings were discussed between the first and third authors until consensus was reached.

Results

Included reviews were coded according to reported variables and results are summarized in narrative form. Results were calculated by frequency count of variables per review and reported by content area. Most of the reviews that met inclusion criteria were identified as systematic reviews (n = 36), followed by literature reviews (n = 26) and meta-analyses (n = 22). Research designs encompassed solely single case research designs (n = 43), single case and group designs (n = 31), single case and group designs in combination with qualitative methodology (n = 6), mixed methods (n = 2), and not explicitly reported (n = 2). Table 1 aggregates how many reviews reported variables of interest such as type of designs, ages of participants, and type of AAC used. Reviews and meta-analyses reporting on the same studies or multiple publications about a single set of searches were counted as individual separate reviews, thus variables of interest could have been coded and counted multiple times.

Table 1 Variables of Interest Reported by Reviews

Participants

The most commonly reported participant disability condition was autism spectrum disorder (ASD), reported by 62 reviews. Thirty-six reviews reported disabilities that fall under the disability category aggregate of other health impairments, including cerebral palsy and other reported conditions (e.g., epilepsy, cleft palate, behavior and attention disorders). Developmental disabilities (n = 34), intellectual disability (n = 33), and speech language impairment (n = 7) were the next most frequently reported participant disabilities. Reviews also reported that participants had multiple disability diagnoses or aggregated diagnoses as developmental disabilities. Participants in single-case research and group designs were mostly male, had a diagnosed disability impacting their functional communication, and were school-aged or 3–18 years old.

Due to variations in review reporting of ages and school levels, the authors created participant age categories of (a) birth through kindergarten or 0–5 years old (n = 50); (b) elementary school age of kindergarten through sixth grade or 6–10 years old; n = 62); (c) adolescents 11–17 years old or sixth grade through twelfth grade (n = 55); and (d) adults or ages 18 years and up (n = 38). Reviews reported participants by numerical age as well as age category (e.g., elementary or secondary) and often reported multiple age categories, thus the sum of these categories exceeds 84. In reporting participant demographics, four reviews included race/ethnicity data (Barton et al., 2017; Chung et al., 2012; Ganz et al., 2014a; Shire & Jones, 2015), 31 reviews reported participant gender, and type of disability was reported in all 84 reviews.

Interventionists

Most reviews did not report on (a) study interventionists other than the researchers; (b) participant demographics, especially data specific to race or ethnicity; (c) interventionist or researcher demographic information; or (d) the identity, expertise or training level, or role within the study of the interventionist. Fifteen reviews reported a teacher or paraprofessional as the interventionist, 13 reviews reported the researcher was the sole interventionist, and 9 reviews reported parent interventionists. One review reported the education level and race/ethnicity of communication partners (Shire & Jones, 2015).

Settings

Interventions took place in (a) a general education or inclusive classroom setting (n = 37); (b) other settings such as residential facility or hospital (n = 28); (c) a self-contained or special education classroom (n = 23); (d) participant’s home (n = 23); or (e) an outpatient clinic setting (n = 13).

AAC Modalities and Interventions

The most frequently reported AAC modalities were SGDs (n = 38), PECS or picture symbols (n = 34), other (n = 15), and specified iPadFootnote 1/iPod as an SGD (n = 7). The use of behavioral interventions or strategies, such as PECS (Tincani & Devis, 2011); prompting, shaping, and modified interrupted chain procedures (Alwell & Cobb, 2009); practitioner-implemented functional communication training focusing on single mands, intraverbals, or tacting (Alzrayer et al., 2014), was reported by most reviews (n = 65). Sixteen reviews reported interventions that were not explicitly based on principles of behavior, and seven reviews did not report enough information about the interventions of included studies to determine whether behavioral strategies were used.

Review authors determined some AAC interventions to have enough well-designed studies to be established as evidence-based practices. These included PECS (Tincani & Devis, 2011), the use of assistive technology for activities of daily living (Cumming & Draper Rodríguez, 2017), the use of high-tech AAC to teach social communication skills (Morin et al., 2018), computer-aided instruction (Barton et al., 2017), tablet-based video modeling (Hong et al., 2017), aided AAC modeling (Lynch et al., 2018), and inclusive education for increasing communication skills (Alquraini & Gut, 2012). Claims regarding evidence-based practices were made by reviews on the basis of moderate to strong intervention effects on dependent variables of interest and by quality of evidence analyses; however, reviews used varied effect size calculations and different evaluations of methodological rigor from one another.

Interventions with sufficient evidence and quality to meet the criteria of emerging evidence-based practices according to review authors included aided AAC input in which the communication partner/interventionist also uses the AAC system (Branson & Demchak, 2009; Ganz et al., 2012b; Sennott et al., 2016; Shire & Jones, 2015; Therrien et al., 2016), AAC-based interventions to reduce challenging behavior (Walker & Snell, 2013; Walker et al., 2018), tablets and mobile devices as AAC (Cumming & Draper Rodríguez, 2017; Kim & Kimm, 2017; Lorah et al., 2015; Stephenson & Limbrick, 2015), and using symbols and icons to represent words and phrases in aided AAC systems (Roche et al., 2014).

Reviews that compared different types of AAC (e.g., comparison of high- to low-tech systems) found that PECS compared to speech-generating devices did not have significant differences in communication outcomes (Ganz et al., 2014). Greater communication skills gains were found for individuals using aided compared to unaided AAC, and individuals with ASD made greater gains in communication skills than individuals with intellectual disabilities (Holyfield et al., 2017). A systematic review of studies comparing participant preference for SGDs compared to other modes of AAC found that 67% of participants who had developmental disabilities had some degree of preference for using SGD and 33% of participants preferred picture exchange (van der Meer et al., 2011).

Generalization, Maintenance, and Social Validity

Very few reviews reported maintenance or generalization data from the included studies, a criticism noted by many review authors about individual studies and which prompted calls to action for researchers to program for and report these data. Reviews that measured social validity (n = 5), generalization (n = 33), and maintenance (n = 30) were in the minority of the total included 84 reviews.

Experimental Designs

AAC interventions for individuals with intellectual and developmental disabilities who have complex communication needs were most frequently studied in single-case research designs (n = 38), using primarily multiple-baseline or multiple probe designs. Alternating treatment or multielement designs were used to compare contexts, communication partners, and most frequently used to compare investigate replacement of challenging behavior with functional communication responses.

Reviews including group designs investigated effects of PECS interventions (e.g., Allen et al., 2017; Alsayedhassan et al., 2016; Flippin et al., 2010; Ostryn et al., 2008; Preston & Carter, 2009) or comparisons of different types of AAC (e.g., Brunner & Seung, 2009; Lancioni et al., 2007; Branson & Demchak, 2009; Schlosser & Wendt, 2008). Summaries of type of AAC intervention reported outcomes and recommendations made by review authors are presented in Table 2, available in supplemental materials.

Table 2 Interventions, Outcomes, and Recommendations Reported in Each Included Review

Review Methodological Quality

Since 2001, when the first systematic review specific to AAC and assistive technology was published (Lancioni et al., 2001), review quality, as evaluated by AMSTAR 2, has been highly variable but has improved over time. There was considerable variation in review quality, but an overall increasing trend, as shown in Fig. 2, which indicates greater adherence to best practices in completing systematic reviews and meta-analyses over time.

Fig. 2
figure 2

AMSTAR 2 scores in chronological order by publication year. Each review’s AMSTAR 2% achieved is indicated by sections within each publication year bar

Areas of strength in methodological quality included explicitly stating the research question and reporting information on participants, intervention, comparison groups, and outcomes (n = 73), explicitly established the methods before conducting the review (n = 80), explained inclusion/exclusion criteria (n = 81), comprehensively searched at least two databases, conducted a reference search, and searches within 24 months of completion of the review (n = 80), and described included studies with sufficient detail (n = 78). Data on the percent of reviews that scored a “yes” for each AMSTAR 2 item are reported in Table 3, available in supplemental materials. Areas of need included providing a list of excluded studies and reasons for exclusion (n = 4), accounting for risk of bias in individual studies when interpreting the review results (n = 22), and providing a satisfactory explanation for heterogeneity in the results of the review (n = 21).

Table 3 Percent of Reviews Scoring Yes or Partial Yes for Each AMSTAR 2 Indicator

The reviews with the highest AMSTAR 2 scores were primarily systematic reviews and meta-analyses, which could reflect the validity of the tool to assess the methodological quality of group designs. These reviews were all published after the introduction of AMSTAR, and were all published in the last 10 years with one exception (Rispoli et al., 2010). The 11 highest scoring reviews scored 80% or greater of possible points on AMSTAR 2, and the four next top scoring reviews achieved 77% of items possible. Each included review’s AMSTAR 2 scores and total achieved are shown in Table 4, available in supplemental materials.

Table 4 AMSTAR 2 Scores by Each Included Review

Discussion

This mega-review synthesizes information reported by literature reviews, meta-analyses, systematic reviews, and one mega-review on aided AAC interventions for children with intellectual and developmental disabilities who have complex communication needs. Data from this overview of research literature indicate that review methodological quality is improving slightly over time. Most of the research used single-case research designs, utilized behavioral interventions, was conducted with participants using a speech-generating device, was conducted with children diagnosed with autism spectrum disorder ages 5–18 in general education settings, and is most frequently synthesized in systematic reviews of literature.

Thematic Issues Raised by Reviews as Areas of Need in Aided AAC Research

Assessing and reporting pre-intervention communication skills, particularly vocal communication skills (Walker et al., 2018), intervening on younger children rather than waiting for natural speech before beginning AAC interventions (Alper & Raharinirina, 2006), and training natural communication partners were most frequently mentioned as areas of future research by review authors. Researching the impact of different user interfaces was recommended in two reviews (Lorah et al., 2015; van der Meer et al., 2011), and investigating symbol arrays on communication outcomes was recommended in three reviews (Lynch et al., 2018; Morin et al., 2018; Roche et al., 2014). Rapid developments and changes in technology, user interfaces, symbol languages, and specifically advances in multi-use mobile technology were mentioned as reasons to research AAC devices other than dedicated SGDs. These review authors also recommended investigating user experiences when interacting with technology and exploring different designs and interfaces with various users to better meet their needs (Lorah et al., 2015).

Another area of further research recommended by review authors was investigating the effects of different types of speech output (Drager et al., 2010; Schlosser & Blischak, 2001). Comparisons of intelligibility of digitized and synthesized speech led to further questions about whether environmental noise impacts intelligibility of AAC voices, and recommendations that researchers examine the intelligibility and comprehension of AAC device voices by the user’s same age peers (Drager et al., 2010).

Researching comparative effectiveness of types of AAC, (e.g., aided and unaided AAC), was an area of future research recommended by authors of seven reviews (Branson & Demchak, 2009; Brunner & Seung, 2009; Cumming & Draper Rodriguez, 2017; Logan et al., 2017; Lorah et al., 2015; Preston & Carter, 2009; Therrien et al., 2016). While some reviews addressed comparing effectiveness of different types of AAC (e.g., Ganz et al., 2012b; Ganz et al., 2017; Holyfield et al., 2017; van der Meer et al., 2011), further research is warranted so that clinicians and educators have an evidence based rationale for selecting specific types of AAC for an individual’s needs and preferences. Recommendations included that researchers perform comparison studies of different AAC modalities (Brunner & Seung, 2009), mobile technology to more traditional AAC (Cumming & Draper Rodriguez, 2017), and different AAC programs and apps (Lorah et al., 2015).

Multiple reviews recommended further investigation into interventions aimed at training natural communication partners such as parents and teachers to communicate using AAC with individuals who have complex communication needs (Kent-Walsh et al., 2015; Logan et al., 2017; Mancil, 2006; Morin et al., 2018; Shire & Jones, 2015; Snell et al., 2006; Stauter et al., 2017; Tincani & Devis, 2011; Walker & Snell, 2013). Training communication partners other than the researcher on interventions and using them as natural change agents could lead to more socially valid intervention outcomes, more successful generalization, higher maintenance of communicative behaviors, and integrate AAC interventions into natural contexts.

Research on effective dosage, or amount of treatment, of AAC interventions was explicitly mentioned by one review (Sennott et al., 2016) with specific recommendations to examine variables relating to service delivery by speech-language pathologists in school settings. Sennott et al. (2016) recommended that researchers examine the impact of language interventions across the lifespan and how contextual and personal changes affect AAC intervention needs, dosage, and communication skills such as vocabulary.

Further research on the effects of AAC interventions on academic skills was recommended by 4 reviews (Ganz et al., 2014a; Ganz et al., 2017; Hong et al., 2017; Still et al., 2014). Other reviews recommended further research on the impact of technology aided instruction on literacy outcomes (Barker et al., 2012; Boyle et al., 2017; Machalicek et al., 2010; Stauter et al., 2017).

Investigating the impact of AAC intervention on independence and social interactions such as friendships was recommended in 6 reviews (Bryant et al., 2010; Chung et al., 2012; Cumming & Draper Rodríguez, 2017; Holyfield et al., 2017; Østvik et al., 2017; Therrien et al., 2016). While social interactions were identified as areas for future research, the reviews included specific calls to action to explore the impact of AAC interventions on peer interactions and supportive relationships for students with disabilities in middle and high school settings (Chung et al., 2012). Areas of future research specific to social relationships were developing indicators of social relationships and operationalizing definitions of friendship (Østvik et al., 2017; Therrien et al., 2016).

Integrating and increasing collaboration among parents, researchers, teachers, and service providers throughout the research process was recommended by 5 reviews (Alper & Raharinirina, 2006; Alquraini & Gut, 2012; Andzik et al., 2016; Logan et al., 2017; Pinto et al., 2009). These authors called for increasing input from stakeholders throughout all parts of the research process, which is a recommendation of participatory research practices (Bailey et al., 2015). Increasing collaboration and participation by stakeholders could also increase social validity and generalizability of empirical interventions to more natural settings (Alper & Raharinirina, 2006). Desideri and colleagues (2013) recommended transitioning research away from the decontextualized experiences of researchers to incorporate effective models for low incidence disabilities, specifically children with multiple disabilities, and increasing collaboration with individuals with disabilities, hearing their experiences, and designing research with participants to best meet their areas of need. They also recommend evaluating the cost effectiveness of assistive technology interventions to address the financial burden on the family.

Participant Demographics

Reviews reporting on included studies’ reporting of race and ethnicity data for participants were very few (n = 4). Reporting this information is crucial not only to answer the question of for whom does this intervention work, but also to assist researchers working to identify mediators and moderators of intervention effectiveness. This paucity of race/ethnicity reporting possibly reflects the need for greater participant diversity in special education research (Sinclair et al., 2018). Non-binary gender identifications were not reported in any of the reviews, further contributing to the oppression and erasure of the identities of individuals with multiple sites for marginalization (Duke, 2011), or intersectionality (Crenshaw et al., 1996). Research participant demographics do not proportionately represent national demographics or reflect school classroom populations, limiting generalizability and recommendations for diverse populations of students (Spear et al., 2013). Examining whether family poverty, urban or rural environments, race and ethnicity, and specific factors related to disability categories impact the effectiveness of interventions and best practices will be crucial as educational policy and practices shift (Singer et al., 2017).

Researchers can create richer, more relevant, and more robust AAC interventions to increase the field’s knowledge of different experiences with living with AAC and increase points of entry and topics for intervention to increase communicative competencies and quality of life for individuals with complex communication needs (Chan & Zoellick, 2011). Research on the experiences and outcomes of AAC interventions with linguistically diverse individuals with disabilities who use aided AAC is a growing area within the literature (Kulkarni & Parmar, 2017). Determining language preference for instruction and language output of aided AAC (Aguilar et al., 2016; Kunze et al., 2019) and incorporating cultural views of AAC users, their families, and stakeholders are areas requiring exploration (Kulkarni & Parmar, 2017) in the field of AAC interventions.

The predominance of single-case research designs in the field of special education affords AAC researchers the ability to report more specific and individualized information about participants to aid in direct and systematic replication of effective interventions and increase the known pool of students for whom specific interventions work (Horner et al., 2005). The cornerstone of the Individuals with Disabilities Educational Improvement Act (2004) and the individualized education program emphasize the “individual” as the driving force behind development of interventions for a student with disabilities. Understanding research as a practice in partnership with participants and understanding that researchers bring their own theoretical frameworks, implicit biases, and personal understandings of constructs including dis/ability, race, gender, sexual orientation, and appropriate behavioral outcomes (Arzubiaga et al., 2008), will help move research on AAC interventions forward.

Social Validity

One crucial area that the AMSTAR-2 does not address is the social validity of either the studies included in the reviews and the reviews themselves. One way to improve social and ecological validity is by increasing the participation of research subjects, their families, educators, and other community stakeholders in the development of research questions, selection of conceptual frameworks, types of interventions used, which outcomes are measured, and what data is reported and through which channels. While individual studies in the included reviews may have assessed and reported social validity, many included reviews did not report on or discuss social validity measures or findings. Without data on the acceptability of treatments and interventions, researchers and practitioners may be reluctant to select interventions that are not be acceptable to stakeholders, potentially leading to lack of contextual fit, lower rates of intervention treatment fidelity, and lower rates of treatment adherence (Spear et al., 2013).

For children who use AAC to communicate, assessing consumer satisfaction should involve the AAC user to the greatest extent possible. A familiar adult and same age peer could ask the AAC user for their input, providing the researcher with a richer analysis of the user’s experience told to different and ecologically valid communication partners. Directly asking the user about their experiences before, during, and after intervention, especially involving the user from the beginning stages of research development would increase targeting user-important skills and increase intervention relevance to the user, possibly increasing buy-in and motivation to adhere to and complete the intervention (Leko, 2014).

Generalization and Maintenance

Generalization and maintenance were identified by many review authors as areas of need, both for study authors to program from inception and for increased reporting out in manuscripts. Other researchers have analyzed and offered suggestions for increasing programming for generalization and collecting data on maintenance of skills at longer intervals after intervention ends (Schlosser & Lee, 2000) yet underutilization or underreporting of both persist. In AAC research, bringing functional communication skills under stimulus control may require additional programming during intervention or follow up interventions after the user has learned the communication skill to then teach discrimination of under which conditions to exhibit that target behavior. Specifically, researchers could program for intervention in the natural environment of the AAC user (Schlosser & Lee, 2000). Conducting an intervention session in the school cafeteria, in the classroom with the general education teacher as the interventionist, or on a field trip could decrease stimulus control of a specific learning setting and increase behavioral momentum of the target skill (Mace & Nevin, 2017). If changing intervention settings is not feasible, incorporating more communication partners such as the AAC user’s schoolmates and teachers could increase generalization and increase the social validity of using an AAC system across multiple communication partners.

Training the most frequent communication partners of individuals who use AAC would enhance both generalization and maintenance of the newly acquired skills. Generalization in the absence of reinforcement places the target behavior on extinction, resulting in sharp decreases in responding and potential increases in problem behavior (Mace & Nevin, 2017). Training communication partners on strategies such as sitting close enough to the AAC user to see the communication system and signal their availability as an audience, assessing and using the communication partner’s skills as a jumping off point for coaching and intervention on responsiveness, and training the communication partner to recognize and respond to the user’s communicative initiations could increase the frequency of the AAC user’s communication behaviors and opportunities to contact natural reinforcement (Calculator, 1988).

Communication Breakdown

Another aspect of programming for generalization and accounting for maintenance of acquired skills over time would involve teaching the AAC user what to do in the event of communication breakdown (Calculator, 1988). Speech-generating devices can be unreliable (Baxter et al., 2012) and can run out of battery or be unavailable due to being sent out for repair or updates (Cooper et al., 2009). AAC interventions incorporating backup strategies for the user to employ if the high-tech communication device breaks or is unavailable, or if the communication partner does not understand the communicative act by the user (Calculator, 1988) would increase communicative opportunities for the person who uses AAC. Teaching the AAC user to use picture symbols, gestures, or other means of aiding the communication partner in understanding the message (Halle et al., 2004) would increase generalization of the specific target behaviors across novel contexts and communication partners. This in turn could increase maintenance of the target skills over time since the user has multiple means of conveying their message to others, thus reinforcing the behavior with natural consequences outside of the clinical setting and with novel communication partners. Future research on increasing response variability and use of alternate methods of communication are warranted.

Maintenance

When planning maintenance probes in coordination with school calendars, planning for maintenance data collection further out than 1 to 2 weeks after an intervention ends would help researchers and clinicians make decisions about effective interventions for lasting behavior change. Just as teachers use maintenance data over fall and winter breaks to assess whether students require extended school year services to maintain skills gained during the school year, researchers could probe maintenance after students return from holiday breaks.

With increased use, ease of access, and prevalence of technological communication systems, researchers can check in with AAC users and stakeholders without much time and cost investment. Researchers can check in via email, video teleconferencing software, text message, or direct message through social media with links to online surveys or conduct a telehealth-type consultation for data collection. Surveys or interviews could also be used to inform future intervention development as well as assist researchers with decision rules to re-teach the target skill or modify the original intervention for re-teaching.

The use of videoconferencing software such as Zoom for Healthcare is an emerging area within the evidence base made all the more important by recent school and clinic closures due to the global pandemic. Telehealth intervention delivery can address chronic shortages of trained and skilled professionals, provide services to families living in remote areas or who cannot travel, and increase access to evidence based practices by both families and practitioners. Clinicians and researchers can use telehealth technology to provide direct services to individuals with complex communication needs, train parents/caregivers, and train practitioners to deliver evidence-based practices. With the increasing reliance on telehealth technology due to the COVID-19 global pandemic, rigorously designed research studies exploring distance technology for direct service provision and communication partner intervention delivery is both warranted and sorely needed. Research into reducing disparities in access and technology exacerbated by telehealth is also needed: reaching rural, remote, and underresourced individuals with CCN remains an area of need for further research and professional development for practitioners.

Vocabulary

AAC vocabulary is often selected by adults, particularly parents and professionals (Trembath et al., 2007), who carefully curate frequently used or socially appropriate vocabulary for the AAC user, potentially without considering what the user’s peers are actually saying to each other. Observing the AAC user’s same age peers and collecting language samples in a variety of contexts and with a variety of conversation partners (teachers, parents, friends) would yield more ecologically valid communication targets and vocabulary (Schwartz & Baer, 1991). Incorporating socially valid but not necessarily school appropriate vocabulary (Ashby et al., 2016) such as potty-themed jokes for a kindergartner, masturbation terms for a high schooler, or alcoholic beverages for an adult is often ignored by parents and professionals who may find discussion of these topics difficult and uncomfortable. These are topics the user may want to access to increase social interactions and participation with peers (Bryen, 2009; Smith, 2015).

Limitations and Future Directions

This mega-review has several limitations, primarily that reviews of reviews have the inherent weakness of relying on information provided about individual studies within the review. Authors of prior reviews may have poorly defined their inclusion and exclusion criteria, failed to correctly extract data from the articles, or may have inadequately analyzed and reported on the findings across the reviewed studies. Reviews were not excluded from analysis based on quality and summarized findings include poorly and well-designed studies. Well-designed and highly rigorous literature reviews should be positioned as potentially more inclusive of the literature and their evidence claims more heavily weighted. Researchers and practitioners benefit from having an assessment of the quality of the methods of reviews before making research-based decisions.

Additionally, this mega-review excluded grey literature, dissertations, articles written in a language other than English, and articles in journals and databases not accessible through those searched by the author. Articles identified during electronic database searches that were not located in full manuscript format through the university library systems, interlibrary loan, or Google Scholar® searches were also excluded. A further limitation of electronic searches was the use of free term keywords rather than each database’s recognized thesaurus search terms. This may have resulted in omission of reviews that met inclusion criteria due to differences in indexing keywords, although unlikely given the large number of documents acquired in the original electronic search and the final number of reviews that were found and included. Exact overlap in reviewed articles across included literature reviews was not assessed, therefore results could have been inflated by potential double counting of article variables or author-reported themes regarding effectiveness. A further limitation is the possibility that literature reviews and meta-analyses pertaining to aided AAC intervention with children with developmental disabilities were overlooked and not included in this review; however, the use of hand searches and ancestral searches minimized this risk.

Due to the preponderance of single-case research design studies completed in this area and few randomized control trial or group design studies analyzed in this mega-review, we were limited in our ability to compare interventions, AAC devices and programs, and dependent variables such as participant age, race, gender, disability, setting of intervention, interventionist, and dosage of intervention. Effect sizes reported by meta-analyses and systematic reviews were not comparable due to measuring different dependent variables, using different measurement systems, and reporting non-standardized effect sizes. We could not compare different designs or report intervention effects across reviews. Researchers conducting meta-analyses could use multilevel modeling and report design-comparable standardized effect sizes to facilitate comparisons across single case studies and group designs. Due to the inability to combine and compare effect sizes across designs and reviews, this mega-review only reported the conclusions made by review authors.

Finally, the inability to evaluate the articles for publication bias might be considered a limitation. Selection of studies for publication is possibly skewed toward studies with positive results, particularly large effect sizes, resulting in omission of nonsignificant results and studies with poor outcomes from inclusion in meta-analyses (Hedges, 1984; Shadish et al., 2014). While single case designs are increasingly reporting statistical effect sizes, decisions on whether to publish studies demonstrating strong functional relations and not publishing weak or no effects could result in publication bias (Shadish et al., 2014). Tests for publication bias assess whether studies with low precision have different effects than studies with high: this does not actually assess publication bias. Statistical tools cannot be used to evaluate publication bias for the reviews included in the present mega-review because the included reviews focused on different types of interventions and had different dependent variables that were impossible to aggregate using current statistical measures and raw data were not provided in the reviews, which precluded generating standardized data to calculate effect sizes.

Future Directions

There are many areas in which researchers can add to the knowledge base on aided AAC interventions for individuals with developmental disabilities, including comparing interventions, programming for generalization throughout intervention phases, assessing maintenance at longer intervals, using natural change agents and settings, and reporting between case effect sizes. Another future direction for the field is to compose an evidence and gap map to synthesize systematic reviews and primary intervention studies to summarize the entire field of AAC research (Saran & White, 2018).

Establishing evidence-based practices is another future direction for the field. Synthesizing well-designed and methodologically rigorous studies, applying criteria for evidence-based practices, and disseminating this information across researchers, clinicians, practitioners, and consumers is are areas of need. Many AAC interventions are emerging or promising practices, but further efforts to establish evidence-based practices will improve quality of life outcomes for consumer and clinician decision-making and increase communication access for individuals with complex communication needs. AAC as an intervention practice was recently established as an evidence based practice by the National Clearinghouse on Autism Evidence and Practice Review Team (Steinbrunner et al., 2020), but further investigation into specific AAC interventions for individuals with autism spectrum disorder and AAC as an evidence-based practice for individuals with other categories of human difference is warranted.

Conclusion

This mega-review highlights steps toward improving the quality of procedures and information provided by systematic literature reviews and meta-analyses and suggests several areas of future research for interventions for children with developmental disabilities and complex communication needs who use aided augmentative and alternative communication. Increasing generalization and maintenance programming before intervention begins, reporting out greater detail on participant demographics and skills, and ensuring that lasting, socially important behavior change takes place will improve quality of life for individuals who use AAC.