ABCA7 p.G215S as potential protective factor for Alzheimer's disease

e 0.80). Notably, our results are not driven by an enrichment of loss of function variants in ABCA7 , recently reported as main pathogenic factor underlying AD risk at this locus. In summary, our study con ﬁ rms the role of ABCA7 in AD and provides new insights that should address functional studies. (cid:1) 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

a b s t r a c t Genome-wide association studies (GWASs) have been effective approaches to dissect common genetic variability underlying complex diseases in a systematic and unbiased way.Recently, GWASs have led to the discovery of over 20 susceptibility loci for Alzheimer's disease (AD).Despite the evidence showing the contribution of these loci to AD pathogenesis, their genetic architecture has not been extensively investigated, leaving the possibility that low frequency and rare coding variants may also occur and contribute to the risk of disease.We have used exome and genome sequencing data to analyze the single independent and joint effect of rare and low-frequency protein coding variants in 9 AD GWAS loci with the strongest effect sizes after APOE (BIN1, CLU, CR1, PICALM, MS4A6A, ABCA7, EPHA1, CD33, and CD2AP) in a cohort of 332 sporadic AD cases and 676 elderly controls of British and North-American ancestry.We identified coding variability in ABCA7 as contributing to AD risk.This locus harbors a low-frequency coding variant (p.G215S, rs72973581, minor allele frequency ¼ 4.3%) conferring a modest but statistically significant protection against AD (p-value ¼ 0.024, odds ratio ¼ 0.57, 95% confidence interval ¼ 0.41 e0.80).Notably, our results are not driven by an enrichment of loss of function variants in ABCA7, recently reported as main pathogenic factor underlying AD risk at this locus.In summary, our study confirms the role of ABCA7 in AD and provides new insights that should address functional studies.
Ó 2016 The Authors.Published by Elsevier Inc.This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Introduction
Alzheimer's disease (AD) is the most common cause of progressive dementia in the elderly.Aging and genetic factors play a critical role for the disease development.Rare coding and fully penetrant mutations in APP, PSEN1, and PSEN2 explain part of the AD autosomal-dominant cases.On the other hand, APOE ε4 allele and rare coding variants in TREM2 represent the main risk factors for late-onset and apparently sporadic AD (Chartier-Harlin et al., 1994;Guerreiro et al., 2013).In the last 5 years, genome-wide association studies (GWASs) identified over 20 main risk loci influencing AD susceptibility (Harold et al., 2009;Hollingworth et al., 2011;Lambert et al., 2009Lambert et al., , 2013;;Naj et al., 2011;Seshadri et al., 2010).Among these, 9 have been replicated by at least 2 independent GWASs and present the strongest effect sizes after APOE (BIN1, CLU, CR1, PICALM, MS4A6A, ABCA7, EPHA1, CD33, and CD2AP).
GWAS arrays tag common, low penetrant, and generally noncoding variants that likely exert a subtle regulatory effect (0.8 < odds ratio [OR] < 1.5) on a trait, affecting gene expression, CpG islands methylation and splicing, in cis or trans (Ramasamy et al., 2014;Visscher et al., 2012).Whereas, low frequency (1% < minor allele frequency [MAF] < 5%) and rare variants (MAF < 1%) with a modest penetrance remain mostly undetected either because they are not in the array or because, even with the implementation of imputation, the detection of variants with MAF < 2% is not sufficiently accurate.As an illustrative example, APOE GWAS hit maps to an intronic region and it is likely driven by the APOE ε4 allele, which is a common coding haplotype (rs429358, p.C130R and rs7412, p.R176R, MAF ¼ 15%) in exon 4, that is not tagged by the custom genotyping arrays mostly used.
Therefore, we have used exome and genome sequencing data (1) to identify rare and low-frequency coding variants in BIN1, CLU, CR1, PICALM, MS4A6A, ABCA7, EPHA1, CD33, and CD2AP and (2) to investigate their single independent and combined effect on AD susceptibility.Both the single-variant and the gene-based association tests confirmed ABCA7 as susceptibility locus associated with AD.Importantly, although ABCA7 loss of function (LoF) mutations (indels, nonsense, and splice-site mutations) have been recently reported as main mechanism increasing AD risk at this locus (Steinberg et al., 2015), our results are not driven by such variants.Whereas, we report an enrichment for ABCA7 common and low-frequency coding variants with a potential protective effect, that is mainly responsible for our gene-based signal.Among these, ABCA7 p.G215S is the main low-frequency missense hit in the single-variant analysis in the discovery cohort.The potential protective role of this variant has been further confirmed in an independent European and North-American cohort.Our results show that ABCA7 p.G215S exerts a mild but statistically significant influence, lowering the risk for AD.Thus, confirming ABCA7 to be a good potential target to address functional studies.

Materials and methods
The discovery cohort was composed of 332 apparently sporadic AD cases and 676 elderly controls, neuropathologically and clinically confirmed, originating from the UK and North America.The mean age at disease onset was 71.66 years (range 41e94 years) for cases and the mean age of ascertainment was 78.15 years (range 60e102 years) for controls (Table 1).Most of the AD cases (77%) were late onset (>65 years at onset) (LOAD).
Among the cases and controls, 42% and 51% were female, respectively; 58% and 47% of the cases and controls carried the APOE ε4 allele, respectively.The APOE ε4 allele was significantly associated to the disease status in the National Institutes of Health (NIH) and Alzheimer's Disease Neuroimaging Initiative (ADNI) series (p-value ¼ 0.02 and 1.19 Â 10 À9 , respectively).The threshold call rate for inclusion of the subject in analysis was 95%.On this cohort, we performed (1) gene-based analysis (SNP-set Sequence Kernel Association Test [SKAT] and c-alpha tests) and (2) single-variant association analysis, targeting 23.5 kilobase pairs (Kbs) of coding sequence.Finally, we followed up, in an independent Caucasian data set, ABCA7 p. G215S, the only nominal significant low-frequency missense variant in the singlemarker association test in our discovery set (Fig. 1).
The follow-up data set was composed of 307 late-onset apparently sporadic AD cases from North America and Europe and 501 elderly Caucasian controls from North America (Coriell repositories), Europe, Australia, and Canada (Table 1).Written informed consent was obtained for each clinically assessed individual, and the study was approved by the appropriate institutional review boards.All samples had fully informed consent for retrieval and were authorized for ethically approved scientific investigation (UCLH Research Ethics Committee number 10/H0716/3, BYU IRB, Cardiff REC for Wales 08/MRE09/38þ5, REC Reference 04/Q2404/130, National Research Ethics Service).

Exome sequencing
DNA was extracted from blood or brain for cases and brain only for controls using standard protocols.Library preparation for next generation sequencing used DNA (between 1 mg and 3 mg) fragmented in a Covaris E210 (Covaris Inc).DNA was end-repaired by 5'phosphorylation, using the Klenow polymerase.A polyadenine tail was added to the 3'end of the phosphorylated fragment and ligated to Illumina adapters.After purification using an AMPure DNA Purification kit (Beckman Coulter, Inc), adapter-ligated products were amplified.The DNA library was then hybridized to an exome capture library (NimbleGen SeqCap EZ Exome v2.0, Roche Nimblegen Inc or TruSeq, Illumina Inc) and precipitated using streptavidin-coated magnetic beads (Dynal Magnetic Beads, Invitrogen).These exome libraries were polymerase chain reaction amplified and then DNA hybridized to paired-end flow cells using a cBot (Illumina, Inc) cluster generation system.Samples were sequenced on the Illumina HiSeq 2000 using 2 Â 100 paired-end reads cycles.

Whole genome sequencing
Genome sequencing was performed in 199 controls, from the Cache County Study on Memory in Aging.All samples were sequenced with the use of Illumina HiSeq technology.
Purified sequences were analyzed on an ABI 3730 DNA Analyzer (Applied Biosystems, CA, USA) and chromatograms were visualized in Sequencher software (version 4.2 Gene Codes Corporation, MI, USA).

Bioinformatics
Sequence alignment and variant calling were performed against the reference human genome (UCSC hg19).Alignment was performed with the use of CASAVA software, and variant calling was performed with the use of SAMtools (Li et al., 2009) and the Genome Analysis Toolkit (GATK) (McKenna et al., 2010).Paired-end sequence reads (2 Â 100 bp paired-end read cycles) were aligned using the Burrows-Wheeler aligner (Li and Durbin, 2009).Format conversion and indexing were performed with Picard (www.picard.sourceforge.net/index.shtml).GATK was used to recalibrate base quality scores, perform local realignments around indels, and to call and filter the variants (McKenna et al., 2010).VCFtools was used to annotate gene information for the remaining novel variants.We used ANNOVAR software to annotate the variants (Wang et al., 2010).Variants were checked against established databases (1000 Genomes Project and dbSNP v.134).Calling algorithms, pipelines, and reference panels were the same as the pooled data sets.The protein coding effects of variants were predicted using SIFT, Poly-phen2, and SeattleSeq Annotation (gvs.gs.washington.edu/SeattleSeqAnnotation).All variants within the coding regions of the 9 risk loci (ABCA7 ) have been collected and analyzed.Indels were excluded from the merged data set because they were not targeted in the ADNI subcohort (Fig. 1) (Further details are provided in the Supplementary Materials).

Statistical analysis
In the single-variant analysis, allele frequencies were calculated for each low frequency and rare coding variant in cases and controls, and Fisher's exact test on allelic association was performed.
To study the joint effect of the variants detected, we performed a gene-based analysis with SKAT and c-alpha test, and we analyzed together for each gene the whole spectrum of allelic variability (common, low frequency, rare, coding, and noncoding).
C-alpha test and SKAT are closely related, being both nonburden tests, analyzing and collapsing the effect of genetic variants of different frequency (common and rare), effect (protective, damaging, and neutral), and effect size (modest, moderate, and strong).SKAT can be considered an expansion of the c-alpha test because it overcomes some of its limits.Indeed, SKAT (1) can be applied also to the study of continuous traits and (2) does not need any permutation.
Low frequency and rare variants were defined as having a 1% < MAF < 5% and MAF <1%, respectively, either in cases or controls.
A p-value of 0.05 was set as a nominal significance threshold.Based on multiple testing correction, the thresholds for singlevariant and gene-based association tests are defined by p-value ¼ 1.25 Â 10 À3 (0.05/40 [total number of coding low frequency and rare variants detected in our study]) and 5.5 Â 10 À3 (0.05/9 genes), respectively.Furthermore, we excluded singletons from the singlevariant analysis because a variant observed only once is not largely informative about the overall distribution (Neale et al., 2011).However, we pooled the singletons together and analyzed their collective effect in the gene-based analysis (SKAT and c-alpha test).
In addition, we report the complete list of coding variants detected in these GWAS loci in the supplementary table (Supplementary Table 1).

Results
The discovery set consisted of a total of 332 sporadic and mainly late-onset AD cases and 676 elderly controls of British and North-American ancestry (Table 1).
A total of 289 single-nucleotide variants were identified.Among these, 128 (44.3%) were nonsynonymous, 72 (24.9%) synonymous, 83 (28.7%) were untranslated region (UTR) and 6 (2%) intronic variants.Among the missense variants, 99 (77.34%) were rare (MAF < 1%) and 72 of these (72.72%) were singletons (a variant observed only once either in cases or controls).Fifteen nonsynonymous variants (11.7%) were low frequency (1% < MAF < 5%) and 16 (12.5%)were common (MAF !5%).In addition, we report 14 novel coding variants (not reported in ExAC, released 13 January 2015, or dbSNP 137).None of the detected low frequency and rare coding variants clusters within common haplotypes (MAF !5%) and, therefore, could have been missed by GWASs and chip based finemapping approaches (Supplementary Tables 1 and 2).We report the presence of 2 or 3 low frequency and/or rare variants in the studied genes in the same individual, both for cases and controls (Supplementary Table 3a and b).Overall, the total variant frequency of the 9 GWAS loci in our study was in line with the one reported for the European-American cohort in the Exome Variant Server (http://evs.gs.washington.edu/EVS/).The only exception was represented by CR1, that showed a 2.7-fold higher relative frequency of total variants, compared to the Exome Variant Server database (Supplementary Table 4).
PICALM harbors the lowest burden of low frequency and rare coding variants (3.27 coding variants per kb of coding sequence).By contrast, CD33, presents the highest relative frequency of coding variants and the lowest relative frequency of damaging variants (9.14 and 0.91 coding and damaging variants per kb of coding sequence, respectively), suggesting that most coding variability in CD33 is likely nonfunctional (Supplementary Table 5).
BIN and ABCA7 display the highest relative proportion of damaging variants (3.92 [87.3% of its coding variability] and 3.72 [60% of its coding variability] damaging coding variants per kb of coding sequence, respectively), thus arguing for a potential functional impact of missense mutations at these loci (Supplementary Table 5).Moreover, ABCA7 was the only gene harboring nonsense mutations.
Most of the low frequency and rare coding variability identified within these loci exerts generally a relatively modest effect (mean OR ¼ 1.1) that is comparable to those observed for common and generally noncoding variants identified by GWASs (Supplementary Table 1).

Single coding variant association test
The main hits of the single variants association test map mainly to ABCA7 (Table 2).Particularly, we report ABCA7 p.G215S (rs72973581), that was the only low-frequency (MAF ¼ 4.3%) missense variant showing a trend toward significance in the singlemarker association test (p-value ¼ 0.02 and corrected p-value ¼ 0.8) in the discovery set and was statistically significant after Bonferroni correction (p-value ¼ 6 Â 10 À4 and corrected p-value ¼ 0.024) in the combined data sets (discovery set and follow-up data set).
Rs72973581 [A] results in a glycine to serine amino acid change at the position 215 of ATP-binding cassette subfamily A member 7 (ABCA7; G215S) and its frequency was 1.56-fold higher in controls compared to cases (MAF ¼ 4.66% and 7.24% for cases and controls, respectively), arguing for a protective effect (OR ¼ 0.6, 95% confidence interval [CI] ¼ 0.38e0.95).This variant was present in homozygosity in one control.The study possessed relatively low power to detect a significant association between cases and controls for low frequency and rare variants.Therefore, we have followed up ABCA7 p.G215S, carrying out Sanger sequencing in an independent data set composed of 307 Caucasian late-onset AD cases and 501 elderly Caucasian controls (p-value ¼ 0.012; OR ¼ 0.54, 95% CI 0.31e0.89).In this follow-up data set, we confirmed a higher frequency of the ABCA7 p.G215S variant in controls compared to cases (carrier frequency ¼ 13.5% vs. 7.8% [1.7-fold] and MAF ¼ 7% vs. 4.3% [1.6-fold], respectively).

LoF mutations in ABCA7
LoF mutations in ABCA7 have been recently reported as main mechanism explaining the GWAS signal and the increased susceptibility to AD.
Importantly, the enrichment for ABCA7 LoF mutations and novel variants in controls did not rely on the sequencing strategy (exome sequencing vs. genome sequencing).These variants have not been indeed mainly detected in the 199 BYU controls that underwent genome sequencing (Supplementary Table 8).
Moreover, 3 very rare indels have been identified in controls in the NIH-UCL cohort (p.1402delT, p.1638delCTT, and p.1749delC-TACTG).ABCA7 p.1749delCTACTG is a novel mutation, and ABCA7 p.1402delT was also present in one case.These indels have been excluded from the pooled data set because they are not targeted in the ADNI subcohort (Supplementary Table 9; Fig. 1).
Finally, 3 nonsense mutations (p.W749X, p.W903X, and p.R1754X) and one splice-site mutation, (c.4416þ2T>G) did not pass either the sample or variant quality control (QC) criteria and, therefore, have not been included in the study (Supplementary Table 10).

Gene-based association test
In addition to single-marker analysis, we carried out gene-wide analysis to combine the joint signal from multiple variants (coding variants and flanking UTRs) within a gene and to provide greater statistical power than that for single-marker tests.All the variants (nonsynonymous, synonymous, UTRs, and singletons) located within the studied genes and their exon-intron flanking regions were collapsed together and their combined effect was studied.ABCA7 was the main hit both in the SKAT and c-alpha test, nominally and statistically significant, respectively, after multiple testing correction (corrected p-value ¼ 0.6 and 5.3 Â 10 À3 , respectively) (Tables 3 and 4).Importantly, given the exclusion of indels in the merged discovery data set, the presence of only 2 nonsense singleton mutations in ABCA7 (p.Y1579X and p.E1974X, both detected in controls) and 4 putative splice-site mutations (rs3752229, rs2242437, c.231e12C>A, and rs182233998, the latter one nominally significant in controls), our findings are not influenced by a burden of LoF mutations in ABCA7.Considering the very rare frequency of these LoF variants, their detection with sufficient power would have required a very large sample size.By contrast, the top signals are represented mainly by common and lowfrequency coding variants with a higher frequency in controls compared to cases and with a modest to intermediate protective effect (0.329 < OR < 0.755) (Supplementary Table 6).

Discussion
We report the results of single-variant and gene-based association tests performed in BIN1, CLU, CR1, PICALM, MS4A6A, ABCA7, EPHA1, CD33, and CD2AP in a cohort composed of 332 apparently sporadic and mainly late-onset AD cases and 676 elderly Caucasian controls from North America and the UK.In the single-variant association test, we have analyzed the effect of low frequency and rare coding variants (MAF < 5%), aiming to identify potential functional variant(s) underlying the GWAS hit(s).In the gene-based analysis (SKAT and c-alpha test), we collapsed the full spectrum of variants identified in these loci to study their collective effect.
We do not report any pathogenic mutation in APP, PSEN1, and PSEN2 in our cohort.However, one of the controls was a heterozygous carrier of the protective variant APP p.A673T (MAF 7 Â 10 À4 in our cohort and MAF 5 Â 10 À4 among the European non-Finnish, ExAC database, released 13 January, 2015) (Jonsson et al., 2012).
TREM2 p.R47H, the second most common risk factor for sporadic AD, has been detected in 6 cases (1.8%) and 4 controls (0.59%) and, likely given our small sample size, with a MAF ¼ 0.2%, was not significantly associated to AD (p-value ¼ 0.09).
ABCA7 was the only significant hit in the c-alpha test and harbors a low-frequency coding variant (p.G215S, rs72973581), whose minor allele confers a modest (OR ¼ 0.57, 95% CI ¼ 0.41e0.80)but statistically significant protection (corrected p-value ¼0.024) against AD.Importantly, this single-nucleotide polymorphism (SNP) was not present in several GWAS or exome SNP arrays and does not cluster within common haplotypes identified by tagging SNPs, whereas it has been detected through ABCA7 direct sequencing in the present study.Therefore, rs72973581 would have stayed likely undetected using common fine-mapping genotyping arrays.In addition, it does not cluster in the risk haplotypes identified by GWAS main hits (rs3764650, rs115550680, rs3752246, and rs4147929), suggesting an independent signal and a likely different pathogenic mechanism of the major allele (Hollingworth et al., 2011;Liu et al., 2014;Naj et al., 2011;Reitz et al., 2013).Importantly, the ABCA7 p.G215S significant protective role against AD is supported by a targeted resequencing study of ABCA7 in a Belgian cohort, where rs72973581 (A) frequency was 1.34-fold higher in controls compared to cases (p-value ¼ 0.055) (Cuyvers et al., 2015).Notably, the main variant associated to LOAD in this Belgian cohort was a low-frequency intronic variant (rs78117248) that did not pass our QC filter.However, in line with our findings, Cuyvers et al. report an enrichment for common and low-frequency polymorphism with a modest protective role in ABCA7.Importantly, among the top 10 genetic variants identified in our study, 3 missense mutations (rs74176364, rs114782266, and rs117187003)  have been described associated also to autism spectrum disorder, strongly pointing toward a functional role of these amino acid changes and suggesting a possible shared pathogenic mechanisms underpinning neurodegenerative and neurodevelopmental diseases (He et al., 2014).
Interestingly, several lines of evidence reported that a significant decrease in ABCA7 levels is associated to AD.At this regard, different and likely not mutually exclusive mechanisms have been described to influence the protein level: (1) common and generally noncoding variants in regulatory regions; (2) alternative splicing; (3) increased CpG island methylation (Humphries et al., 2015;Vasquez et al., 2013).Recently, also LoF mutations in ABCA7 have been shown to significantly increase the susceptibility to AD in the Islandic population (Steinberg et al., 2015).This has been replicated in 2 different populations (Caucasian North American and Belgian) by 2 independent studies (Cuyvers et al., 2015;Vardarajan et al., 2015).Therefore, we report another potential mechanism, through which low-frequency protein coding variability in ABCA7 may influence AD risk.
ABCA7 is mainly expressed in leukocytes and in myelolymphatic tissues (thymus, spleen, and bone marrow) and microglia in the brain (http://web.stanford.edu/group/barres_lab/brain_rnaseq.html;http://www.uniprot.org/;Kim et al., 2008).ABCA7 encodes for ABCA7, a multi-pass protein, present on the cell, Golgi, and endosome membranes (http://www.uniprot.org/).In vitro and in vivo experiments have shown ABCA7 pivotal role in phagocytosis and a likely modest role in high-density lipoprotein biogenesis.In Abca7 À/À mice, macrophages and microglia display impaired phagocytosis and clearance of amyloid from the brain, which leads to cognitive impairment (Iwamoto et al., 2006;Tanaka et al., 2011).
Therefore, ABCA7, likewise TREM2 and CD33, may play an important role in regulating microglial uptake and clearance of amyloid-beta debris.
ABCA7 p.G215S clusters within the extracellular topological domain of ABCA7.Remarkably, at the homologous residue, the serine is the reference amino acid in ABCA7 in different mammals and in the homologous protein ABCA1 in humans.Thus, suggesting that this amino acid may confer some biologic advantage and may have been positively selected during the evolution (Supplementary Figs. 1 and 2).
Likewise other low-frequency and rare protective variants at the GWAS loci (Supplementary Table 11), ABCA7 p.G215 is a relatively conserved residue among different species (Supplementary Fig. 1) and this amino acid change (glycine to serine) may only slightly modify the protein activity (À5.86, 56, Gerd and Grantham score, respectively).Moreover, it has been reported as a tolerated change and benign, arguing against any possible LoF or significant impairment of ABCA7, that has been indeed associated to increased risk for AD (Steinberg et al., 2015).The biological effect of this substitution may therefore lead to a mild ABCA7 gain of function, possibly strengthening the interaction with a binding protein or regulating its expression.Although ABCA7 p.G215 has not been predicted to be a coding target for miRNA (https://www.umm.uniheidelberg.de),a possible posttranscriptional or posttranslational regulation should not be excluded.Importantly, the substitution of a glycine with a serine may imply an additional substrate for serinekinases or proteases.Moreover, in a similar way, ABCA1, whose LoF variants have been associated to AD (Kim et al., 2012;Nordestgaard et al., 2015), has been reported to be particularly enriched for low frequency and rare coding variants with an average 1.5-fold higher frequency in controls compared to LOAD cases and a modest protective effect in a Greek cohort (OR ¼ 0.96e0.38;Lupton et al., 2014;Supplementary Table 12).
Thus, understanding the effect of ABCA7 p.G215S has the potential of unraveling new pathogenic mechanisms underpinning AD and may provide a promising therapeutic target that would not significantly alter ABCA7 overall physiological function, which is critical for AD development.
Finally, we support the resequencing study of the GWAS loci by Vardarajan et al. (2015), confirming a burden of damaging variants in ABCA7 and BIN1 (Supplementary Table 5) and to a lesser extent in CD2AP, EPHA1, and MS4A6A (main hits in the gene-based analysis; Tables 3 and 4), highlighting their potential role as susceptibility loci for LOAD.
However, we could not replicate the main hits detected by Vardarajan et al. in the single-variant analysis, either because such variants have been targeted but not detected in our cohort (ABCA7 p.E1679X, EPHA1 p.P460L, and BIN1 p.K358R) or because the variants have been targeted but eliminated by the QC filter (CD2AP p.K633R).Thus, suggesting a possible lack of replication compared to the previous studies attributable to the different population but also different sequencing strategies, capture, and coverage.Nevertheless, EPHA1 and CD2AP harbor 2 of the main hits detected in the single-variant analysis in our cohort (rs11768549 and rs143297472, respectively) (Table 2), with rs11768549 already associated with the rapid progression of the disease in a cohort of Caucasian North American LOAD cases (Wang et al., 2015).
In summary, we support previous studies, suggesting that (1) ABCA7 significantly influences AD risk; (2) ABCA7 p.G215S is likely to reduce the susceptibility to AD; (3) GWAS hits are pleomorphic loci harboring a complex spectrum of variants synergistically contributing to the disease phenotype with different mechanisms, effects (damaging, protective, and neutral), and effect sizes (0 < OR < 4); and (4) gene-based approaches are effective methods to mine genetic data and to accurately filter potential candidate genes.

Table 1
Description of the different cohorts used in this study : cDNA, complementary DNA; CI, confidence interval; Corr, corrected p-value, p-value after Bonferroni correction (p-value* 40 [number of variants considered in the single-variant association test]); ExAC, Exome Aggregation Consortium; MA, minor allele; MAF, minor allele frequency; ExAC, Exome Aggregation Consortium; OR, odds ratio.

Table 3
Results from the c-alpha test performed Corrected p-value, p-value after Bonferroni correction (p-value * 9 [number of genes considered in the single gene-based analysis]).

Table 4
Results from the SKAT test performed Position is in hg19/GRCh37.Corrected p-value, p-value after Bonferroni correction (p-value * 9 [number of genes considered in the single gene-based analysis]).