Introduction

Searching for molecules related to human cognition is instrumental for understanding the biological mechanisms related to such complex traits as memory capacity and for identifying pathways possibly amenable to pharmacological interventions. Recent advances in the development of high-density genotyping platforms allow for high-resolution genome-wide association studies (GWAS) of genetically complex traits and have already led to a substantial increase in knowledge of the genetic underpinnings of physiological and pathological conditions of human cognition.1 Such studies can be performed using both pooled and individual DNA samples.2 Pooled GWAS followed by individual genotyping of the most significant variants of the DNA pools are a cost-effective way to perform genome-wide surveys in large cohorts. Although pooled GWAS may be a worthwhile and fast approach as a preliminary screen, they lack the ability of retrospectively stratifying the genotyped pooled cohort by secondary phenotypic traits and control variables.2

Here we performed a GWAS in individual DNA samples in a homogenous cohort of Swiss healthy young adults (n=1198) assessed for verbal episodic memory performance, as quantified by an unexpected delayed free-recall test of 30 previously learned words (see Materials and methods). Single-nucleotide polymorphisms (SNPs) surpassing genome-wide correction for multiple comparisons were analyzed in an independent sample of healthy young adults from Serbia (n=524), who were also characterized for verbal episodic memory performance. To further validate the genetic findings of the behavioral studies we analyzed data from studies on gene expression in human post-mortem brain tissue and also used functional magnetic resonance imaging (fMRI), which can detect genotype-dependent differences in brain activations during cognitive tasks.3 We show that variants of CTNNBL1 (encoding beta-catenin-like protein 1) are consistently associated with episodic memory performance, with mRNA expression levels of CTNNBL1 in the human cortex, and with activation in episodic memory-related brain regions.

Materials and methods

Memory testing

Zurich sample (GWAS subsample 1): We recruited 709 healthy, young Swiss university students or age-matched employees/trainees (521 females, 188 males). Age was 21.9±0.1 years (mean±s.e.). After complete description of the study to the subjects, written informed consent was obtained. The ethics committee of the Canton of Zurich, Switzerland, approved the study protocol. Subjects viewed six series of five semantically unrelated nouns presented at a rate of one word per second with the instruction to learn the words for immediate free recall after each series. In addition, subjects underwent an unexpected delayed free-recall test of the learned words after 5 min (episodic memory). The number of correctly recalled words (hits) was the relevant output. Attention and concentration were assessed with the d2 cancellation test4 and working memory with the digit span task (forward)5 in a subset of the Zurich sample. In the d2 test, items consist of the letters d and p with one to four dashes, arranged either individually or in pairs above and below the letter. Subjects were instructed to scan, in a timed manner, across each letter line to identify and delete each d with two dashes. In the digit span test (forward), increasing numbers of orally presented digits at a rate of one per second had to be repeated in the same order. The test started with three digits. In every other trial the number of digits was increased by one. When errors in two consecutive trials were made, the test was ended and the number of correctly recalled trials was counted.

In addition to the behavioral tasks, all subjects were tested with a standardized mental health questionnaire for current and lifetime major psychiatric diseases, smoking and the use of any medication. Subjects also provided information on possible familiar aggregation of major psychiatric diseases, and female participants also gave information on the use of oral contraceptives and the actual menstrual cycle. None of these control variables affected the results presented herein. Blood was drawn by using 2 × 9 ml EDTA tubes (Sarstedt, Germany). Saliva was collected with the Oragene DNA sample collection kit (DNA Genotek, ON, Canada). DNA isolation was done according to standard protocols.

Basel sample (GWAS subsample 2): We recruited 489 healthy, young Swiss university students or age-matched employees/trainees (324 females, 165 males). Age was 22.5±0.1 years. After complete description of the study to the subjects, written informed consent was obtained. The ethics committee of the Canton of Basel, Switzerland, approved the study protocol. Subjects viewed six series of five semantically unrelated nouns presented at a rate of one word per second with the instruction to learn the words for immediate free recall after each series. The number of correctly recalled words (hits) after 5 min was the relevant output. All subjects were tested with a standardized mental health questionnaire for current and lifetime major psychiatric diseases, smoking and the use of any medication. In addition, subjects provided information on possible familiar aggregation of major psychiatric diseases, and female participants also gave information on the use of oral contraceptives and the actual menstrual cycle. In addition to the word-based episodic memory task, 872 subjects from GWAS samples 1 and 2 performed a picture-based episodic memory task (see this task's description in the fMRI experiment section). Saliva was collected with the Oragene DNA sample collection kit (DNA Genotek). DNA isolation was done according to standard protocols.

Serbian sample (replication sample): Study participants were 524 healthy young subjects (353 female and 171 male students of the School of Medicine of the University of Belgrade, age 20.3±0.04 years) who agreed to participate in a study of the genetic basis of human cognition and gave informed consent. Free recall of a short story narrative (immediate and 30 min delayed recall of items from Story A of the Logical Memory test; Wechsler Memory Scale-revised)5 was used to assess verbal memory performance. Working memory performance was assessed with the Digit Ordering Task,6 verbal fluency with a category (fruits and vegetables) verbal fluency task7 and visuospatial abilities with the Hooper Visual Organization Test.8 Subjects also completed a questionnaire related to general health, living circumstances, parental education and subjective learning capacity.

GWAS statistics, population structure and imputation

GWAS and the replication study were run under the assumption of an additive model. False discovery rate (FDR) and correction for 582 514 comparisons were used to adjust for genome-wide multiple testing. The FDR and corrected significance levels were set to 5%. Golden Helix SNP and Variation Suite 7 (SVS7, version 7.3.1; Golden Helix, Bozeman, MT, USA) and SPSS 17 were used for statistical analyses and visualization of results.

Population stratification was assessed with EIGENSTRAT9 by analyzing all genome-wide, array-based autosomal SNPs passing quality control (QC) criteria. Principal component analysis was first applied to reduce genetic variation to a few dimensions. For principal component analysis, default parameters were used (that is, definition of ten principal components in five iterations, outlier criterion was 6 s.d.). We also applied the Genomic Control program that is implemented in the EIGENSTRAT package to compute the inflation factor λ10 before and after removal of the individuals identified as outliers. In the total GWAS sample, λ was low (λ=1.017), while EIGENSTRAT identified 125 participants deviating from a large genetically homogenous population cluster. After outlier removal, λ indicated the absence of population stratification in both subsamples (λZurich=1.001, λBasel=1.000; Supplementary Figure 1).

All SNPs in the CTNNBL1 region reported in the latest release of the 1000 Genomes project (www.1000genomes.org) and in the CEU sample in HapMap Phase II were imputed and analyzed with IMPUTE,11 which aligns SNPs between haplotype and genotype databases on the basis of base-pair position. The quality criterion for inclusion and subsequent analysis of an imputed SNP was set to proper_info 0.5.11

Array-based SNP genotyping, SNP filtering

Samples were processed as described in the Genome-Wide Human SNP Nsp/Sty 6.0. User Guide (Affymetrix, Santa Clara, CA, USA). Briefly, genomic DNA concentration was determined by using a Nano-Drop ND-1000 and adjusted to 50 ng μl−1 in water. A quantity of 250 ng of DNA was digested in parallel with 10 units of StyI and NspI restriction enzymes (New England Biolabs, Beverly, MA, USA) for 2 h at 37 °C. Enzyme-specific adaptor oligonucleotides were then ligated onto the digested ends with T4 DNA ligase for 3 h at 16 °C. After adjustment to 100 μl with water, 10 μl of the diluted ligation reactions were subjected to PCR. Three PCR reactions of 100 μl were performed for Sty-digested products and four PCR reactions for Nsp. PCR was performed with Titanium Taq DNA Polymerase (Clontech, Mountain View, CA, USA) in the presence of 4.5 μM PCR primer 002 (Affymetrix), 350 μM each dNTP (Clontech), 1 M G-C Melt (Clontech), and 1 × Titanium Taq PCR buffer (Clontech). Cycling parameters were as follows: initial denaturation at 94 °C for 3 min, amplification at 94 °C for 30 s, 60 °C for 45 s and extension at 68 °C for 15 s repeated a total of 30 times, final extension at 68 °C for 7 min. Reactions were then verified to migrate at an average size between 200 and 1100 bp using 2% Tris-borate-EDTA (TBE) gel electrophoresis. PCR products were combined and purified with the Filter Bottom Plate (Seahorse Bioscience, North Billerica, MA, USA) using Agencourt Magnetic Beads (Beckman Coulter, Fullerton, CA, USA). Purified PCR products were quantified on a Zenith 200rt microplate reader (Anthos-Labtec, Cambridge, UK). A quantity of 4–5 μg μl−1 were obtained on average for each sample. From this stage on, the SNP Nsp/Sty 5.0/6.0 Assay Kit (Affymetrix) was used. Around 250 μg of purified PCR products were fragmented using 0.5 units of DNAse I at 37 °C for 35 min. Fragmentation of the products to an average size less than 180 bp was verified using 4% TBE gel electrophoresis. Following fragmentation, the DNA was end labeled with 105 units of terminal deoxynucleotidyl transferase at 37 °C for 4 h. The labeled DNA was then hybridized onto Genome-Wide Human SNP 6.0 Array at 50 °C for 18 h at 60 r.p.m. The hybridized array was washed, stained and scanned according to the manufacturer's (Affymetrix) instructions using Affymetrix GeneChip Command Console (version 3.0.1.1214). Generation of SNP calls and Array QC were performed using the command line programs of the Affymetrix Power Tools package (version: apt-1.12.0). According to the manufacturer's recommendation, contrast QC was chosen as QC metric, using the default value of greater or equal than 0.4. Mean call rate for all samples averaged 98.5%. All samples passing QC criteria were subsequently genotyped using the Birdseed (v2) algorithm. A total of 909 600 SNPs were genotyped and imported into Golden Helix SNP and Variation Suite 7 (version 7.3.0). For association testing, 19 162 markers with fewer than two alleles, 38 218 markers with call rate <0.95, 220 877 markers with minor allele frequency <0.05 and 113 918 markers with Hardy–Weinberg equilibrium P<0.05 were skipped leaving a total of 582 514 markers to be analyzed.

Singleplex SNP genotyping

Singleplex genotyping for rs16986890 was done with Pyrosequencing on a PyroMarkID System (QIAGEN, Hilden, Germany). Forward primer: 5′-TCCTGGATCTTGGCCTTTACAT-3′; reverse primer (5′-biotinylated): 5′-GTGCAACCAACATATTTGCATTT-3′; sequencing primer: 5′-TCTATTGTATCTGAGGGTTC-3′. The chip-based SNP_A-1909171 (rs16986890) was also genotyped by Pyrosequencing in the Zurich sample. The level of convergence between array- and pyrosequencing-based genotype calls was 100%.

Cortical gene expression

Data are based on the survey of genetic human cortical gene expression published by Myers et al.12 Gene expression studies of 178 samples from the cerebral cortex of neurpathologically normal brains were done with the Illunima (San Diego, CA, USA) HumanRefseq-8 Expression BeadChip. For genome-wide genotyping, the Affymetrix GeneChip Human Mapping 500 K Array Set was used. The complete data files were downloaded from http://labs.med.miami.edu/myers/. CTNNBL1 transcript probe was GI_29570786 (NM_030877.3), and expression levels of GI_29570786 were used as dependent variable.

fMRI experiment

Participants

A total of 322 healthy young subjects (mean age 22.5±0.2 years, range 18–34 years; 193 females; 290 AA genotype carriers of SNP rs16986890; 29 AG genotype carriers of SNP rs16986890, 3 GG genotype carriers of SNP rs16986890) were included in the study. All subjects were right-handed, free of any lifetime neurological or psychiatric illness, and did not take any medication at the time of the experiment. The experiments were approved by the ethics committee of the University of Basel. Written informed consent was obtained from all subjects prior to participation.

Picture task

Stimuli consisted of 72 pictures that were selected from the International Affective Picture System (IAPS)13 as well as from in-house standardized picture sets that allowed us to equate the pictures for visual complexity and content (for example, human presence). On the basis of normative valence scores (from 1 to 9), pictures were assigned to emotionally negative (2.3±0.6), emotionally neutral (5.0±0.3) and emotionally positive (7.6±0.4) conditions, resulting in 24 pictures for each emotional valence. Four additional pictures showing neutral objects were used to control for primacy and recency effects in memory. Two of these pictures were presented in the beginning and two at the end of the picture task. They were not included in the analysis. In addition, 24 scrambled pictures were used. The background of the scrambled pictures contained the color information of all pictures used in the experiment (except primacy and recency pictures), overlaid with a crystal and distortion filter (Adobe Photoshop CS3, Adobe Systems Inc., San Jose, CA, USA). In the foreground, a mostly transparent geometrical object (rectangle or ellipse of different sizes and orientations) was shown. Pictures were presented in the scanner using MR-compatible LCD goggles (Visuastim XGA, Resonance Technology, Los Angeles, CA, USA). Eye correction was used when necessary.

Encoding phase

The pictures were presented for 2.5 s in a quasi-randomized order so that at maximum four pictures of the same category occurred consecutively. A fixation-cross appeared on the screen for 500 ms before each picture presentation. Trials were separated by a variable intertrial period of 9–12 s (jitter) that was equally distributed for each stimulus category. During the intertrial period, participants subjectively rated the picture showing scenes according to valence (negative, neutral, positive) and arousal (large, medium, small) on a three-point scale (Self-Assessment Manikin) by pressing a button with a finger of their dominant hand. For scrambled pictures, participants rated form (vertical, symmetric or horizontal) and size (large, medium, small) of the geometrical object in the foreground. The encoding phase of the picture had a total duration of 22 min. Participants were not told that they had to remember the pictures for later recall. Participants were instructed and trained on the picture task before being positioned in the scanner. Training consisted of presentation and rating of five pictures including scenes and scrambled pictures, which were not used during scanning. Free recall: 10 min after picture presentation, memory performance was tested using a free-recall task, which required participants to write down a short description of the previously seen pictures. Remembered primacy and recency pictures as well as training pictures were excluded from the analysis. No time limit was set for this task. Two trained investigators independently rated the descriptions for recall success (inter-rater reliability >99%).

Procedure

After receiving general information about the study and giving their informed consent, participants were instructed and then trained on the picture task they later performed in the scanner. After training, they were positioned in the scanner. The participants received earplugs and headphones to reduce scanner noise. Their head was fixated in the coil using small cushions, and they were told not to move their heads. Functional MR images were acquired during the performance of the picture task in two separate sessions (total scanning time 30 min). After finishing the tasks, participants left the scanner and were taken to a separate room for free recall of the pictures. Finally, participants filled out questionnaires, gave saliva for genotype analysis and were debriefed. The total length of the experimental procedure was 3 h. Participants received 25 Swiss Francs (CHF) per hour for participation.

fMRI data acquisition and processing

Measurements were performed on a Siemens (Munich, Germany) Magnetom Verio 3T whole-body MR unit equipped with a twelve-channel head coil. Functional time series were acquired with a single-shot echo-planar sequence using parallel imaging (GRAPPA). We used the following acquisition parameters: TE (echo time)=35 ms, FOV (field of view)=22 cm, acquisition matrix=80 × 80, interpolated to 128 × 128, voxel size: 2.75 × 2.75 × 4 mm3, GRAPPA acceleration factor R=2.0. Using a midsaggital scout image, 32 contiguous axial slices were placed along the anterior–posterior commissure plane covering the entire brain with a TR=3000 ms (α=82°). The first two acquisitions were discarded due to T1 saturation effects.

Preprocessing and data analysis was performed using SPM5 (Statistical Parametric Mapping, Wellcome Department of Cognitive Neurology, London, UK; http://www.fil.ion.ucl.ac.uk/spm/) implemented in Matlab 2008a (The Mathworks, Natick, MA, USA). Volumes were slice-time corrected to the first slice, realigned to the first acquired volume, normalized into standard stereotactic space (template provided by the Montreal Neurological Institute), and smoothed using a 8-mm full-width-at-half-maximum Gaussian kernel. A 128 s cut-off high-pass filter was added to the confound partition of the design matrix to account for low-frequency drifts, and a correction for intrinsic autocorrelations was included in the analysis. For each subject, evoked hemodynamic responses to event types were modeled with a delta (stick) function corresponding to presentation of each stimulus category (negative, positive, neutral and scrambled pictures, respectively) convolved with a canonical hemodynamic response function within the context of a general linear model. The pictures accounting for possible primacy and recency effects as well as button presses during valence and arousal ratings were modeled separately. In addition, six movement parameters from spatial realigning were included as regressors of no interest. The contrast between brain activity during encoding of pictures subsequently remembered versus pictures subsequently forgotten was calculated individually using a fixed-effects model (first-level analysis). The resulting contrast parameters were then used for genotype-dependent analyses in a random-effects model (second-level analysis). Specifically, we used a regression model to analyze gene–dose-dependent differences in brain activity (with the number of G-alleles as covariate) and two-sample t-tests to analyze genotype-dependent differences under the dominant model (GG vs [AG and AA]).

For genotype-independent analyses, we used a threshold of P<0.05, Family-Wise Error (FWE)-corrected for multiple comparisons in the whole brain. For genotype-dependent analyses, we used an uncorrected threshold of P<0.001 in a minimum number of 10 adjacent voxels (k=10) for an exploratory analysis of the whole brain.

Results

Genome-wide survey and replication study

All samples (n=1198) were processed on the Affymetrix Genome-Wide Human SNP Array 6.0 in one centralized microarray facility. After standard QC and correction for minor allele frequency and deviation from Hardy–Weinberg equilibrium (see Materials and methods), a total of 582 514 out of the 909 600 array SNPs were used for association analyses under an additive model. Analysis of genetic background in the GWAS sample showed that the level of population stratification was low (λ=1.017). Principal component analysis using all available autosomal array markers passing QC identified 125 participants deviating from one large population cluster (see Materials and methods, Supplementary Figure 1). Omission of these subjects from the analysis led to an ideal population structure (λ=1.000). Therefore, genome-wide association analyses were done with the remaining 1073 participants. CTNNBL1 marker SNP rs16986890 reached the highest level of statistical significance for association with verbal episodic memory and survived both FDR correction and correction for 582 514 multiple comparisons (Pnominal=7.7 × 10−8, PFDR=0.002, Pcorrected=0.046; Table 1; Figure 1; Supplementary Tables 1 and 4). The effect size (ES) of SNP rs16986890 was small and thereby within the expected range of small ESs observed in genetic studies of polygenic behavioral traits (Table 1). This SNP was additionally genotyped on a different, singleplex platform to prevent false interpretations due to possible array-related genotyping errors (see Materials and methods). The level of convergence between array- and singleplex-based genotype calls was 100%. Because the GWAS population consisted of two subsamples, each recruited independently in a different Swiss city (Zurich, n=625; Basel, n=448), we performed a post-hoc analysis to rule out possible recruitment location-related bias. The additive effect on episodic memory was observed in both subsamples (Table 1) and showed no significant SNP × sample interaction (P=0.9).

Table 1 Genotype-dependent performance in word-based episodic memory tasksa in the GWAS sample (Switzerland, n=1073) and the replication sample (Serbia, n=524)
Figure 1
figure 1

Genome-wide association results. (a) Genome-wide significances (Y axis, −log10P) are shown in chromosomal order for individually genotyped SNPs that were tested for association with word-based episodic memory performance in 1073 healthy Swiss young adults. (b) Zoom-in of the association results (Y axis, −log10P) to a region of chromosome 20q11.23 harboring the CTNNBL1 locus. Chromosomal positions were retrieved from the March 2006 UCSC genome browser assembly. (c) LD structure (r2-values) of the chromosomal region shown in panel b as calculated in the entire sample of 1073 healthy Swiss young adults. Known transcripts in this region are visualized in the lower part. The red arrow indicates the position of rs16986890, which is located in the first intron of CTNNBL1.

PowerPoint slide

The significance of association rapidly decreased with increasing distance from the CTNNBL1 locus and did not extend to nearby loci (Figure 1; Supplementary Table 1). To better characterize the profile of the association signal we used data from the 1000 Genomes project (www.1000genomes.org) and reinvestigated this region using imputation, which allowed for analysis of virtually all common SNPs in this region and offered a sevenfold increase in marker density over the 6.0 array SNPs. Imputation analysis confirmed the initially observed pattern of association with highly significant intragenic SNPs and SNPs adjacent to CTNNBL1 and rapid decrease in significance with increasing distance from the CTNNBL1 locus (Supplementary Figure 2).

To study possible pleiotropic effects of rs16986890 on cognition, we additionally analyzed this SNP's effects on immediate recall, working memory and attention (see Materials and methods; Supplementary Table 2). No significant associations with the outcome of these cognitive tasks were observed. These findings suggest that the CTNNBL1 genotype–dependent differences in episodic memory processes were not biased by genotype effects on attention and short-term memory.

Quantile–quantile plotting and analysis further supported the robustness of the initial finding in the GWAS sample. Over 99.999% of the P-values distributed in accordance with random expectation, suggesting that the GWAS results are most likely to be attributable to true genetic variation rather than to potential bias, such as genotyping errors or population stratification (Supplementary Figure 3). The significant association between CTNNBL1 SNP rs16986890 and episodic memory performance was replicated in a sample of 524 healthy young adults from Belgrade, Serbia (P=0.003; additive model and direction of effect as in the GWAS sample; Table 1). Also in this sample, rs16986890 was not associated with performance in tasks quantifying working memory performance, verbal fluency and visuospatial abilities (all P-values >0.3; see Materials and methods; Supplementary Table 3). In addition to SNP rs16986890, the nine most significant SNPs of the Swiss sample, which were not in tight linkage disequilibrium (LD) (that is, r2<0.8) with rs16986890, were tested for replication in the Serbian sample (Supplementary Table 5). Four out of ten tested SNPs (including rs16986890) were nominally significantly associated with episodic memory also in the Serbian sample (Puncorrected<0.05; Supplementary Table 5).

Genetic association with cortical gene expression

SNP rs16986890 is located in the 5′ end of CTNNBL1 exon 1 and is tightly linked to 5′ untranslated region variants (Figure 1, Supplementary Figure 1). To study the possible relation of CTNNBL1 polymorphisms with mRNA expression levels, we examined the association between CTNNBL1 SNPs and cortical expression of the CTNNBL1 transcript GI_29570786 (NM_030877.3) in the brains of 178 non-demented deceased subjects (see Materials and methods). A subset of CTNNBL1 SNPs reported in our genetic study of human memory performance is also represented on the lower marker density 500 k SNP Array set, which was used in the study of cortical gene expression. SNP rs16986890 was not analyzed due to insufficient minor allele frequency (<0.05) in this sample. We therefore analyzed SNP rs7363432, which was the closest, tightly linked SNP (expectation-maximization (EM) r2=0.89) with minor allele frequency 0.05 (154 AA carriers, 23 AG carriers, 1 GG carrier; Hardy–Weinberg equilibrium, P=0.88). Comparison between genotype groups revealed significantly elevated levels of CTNNBL1 mRNA in AG compared with AA carriers (312.7±11.8 vs 289.2±4.0, respectively; T=2.1, d.f.=175, P=0.037; due to low number of observations (n=1) GG genotypes were omitted from statistical analysis). Thus, in the post-mortem gene expression sample, RNA expression levels of CTNNBL1 were increased in minor allele carriers of rs7363432. Interestingly, SNP rs7363432 was among the most significant SNPs in the Swiss GWAS sample (episodic memory, n=1073, P=0.0000015; Supplementary Table 1), with minor allele carriers showing better memory performance.

Genetic variability within CTNNBL1 is linked to human brain function

The above-described behavioral genetics experiments used episodic memory tests, which consisted of delayed free recall of verbal information. In 872 Swiss participants (a subsample of the above-described GWAS sample), an additional picture-based episodic memory task was performed. In this sample, CTNNBL1 SNP rs16986890 was also associated with performance in this picture-based memory task, albeit with lower ES compared with the word-based task. Specifically, we observed a significant additive effect of rs16986890 with a positive correlation between the number of minor alleles and performance during free recall of pictures (P=0.027; ES: d=0.15, r2=0.006). In 322 subjects, this picture-based episodic memory task was performed during fMRI. This allowed us to investigate the impact of CTNNBL1 genetic variability on activity of memory-related brain regions. These regions were identified by comparing activity during encoding of subsequently remembered versus subsequently forgotten pictures (that is, subsequent memory analysis). The genotype-independent subsequent memory analysis revealed several memory-related brain regions (Supplementary Figure 4), including prefrontal and parietal regions as well as the hippocampus and parahippocampal gyrus—brain regions known to have an important role in episodic memory.14, 15

In the fMRI sample, there were 290 AA carriers, 29 AG carriers and 3 GG carriers of CTNNBL1 SNP rs16986890. As expected due to the lower number of subjects in the fMRI sample (n=322) than in the behavioral genetics sample (n=872), we did not observe significant (P>0.05) genotype-dependent differences on the behavioral level. Genotype-dependent, subsequent memory analyses of the fMRI data revealed significant, whole-brain FDR-corrected, gene dose-dependent (with increasing number of G-alleles) increases in activity in the parahippocampal gyrus (peak activation at [19 −30 −12]; Pwhole-brain FDR−corrected=0.04, Pnominal<0.001) and in the medial frontal gyrus/anterior cingulate (peak activations at [−14 55 0]; [−30 47 −12]; [22 44 −4]; Pwhole-brain FDR−corrected0.03, Pnominal<0.001) (Figure 2). There were no other significantly increased brain activations in this analysis and there were no supra-threshold activations when calculating the negative correlation (with decreasing number of G-alleles). Reanalysis under the dominant model (GG vs [AG and AA]) revealed similar findings.

Figure 2
figure 2

CTNNBL1 rs16986890 genotype-depended differences in memory-related brain activity (subsequent memory analysis, see main text) in 322 individuals. Displayed are gene dose-dependent (with increasing number of G-alleles) increases in activity in the parahippocampal gyrus (left figure, coronal section, peak activation at [19 −30 −12]; Pwhole-brain FDR−corrected=0.04, Pnominal< 0.001) and in the medial frontal gyrus/anterior cingulate (right figure, sagittal section, peak activation at [−14 55 0]; Pwhole-brain FDR−corrected=0.01, Pnominal<0.001). Activations were overlaid on sections of a T1-weighted magnetic resonance image of SPM5, displayed at an uncorrected threshold of P=0.001 and using color-coded t-values.

PowerPoint slide

Discussion

Here we present evidence suggesting that the gene encoding beta-catenin-like protein 1 (CTNNBL1) has a significant role in human episodic memory. Specifically, marker SNPs in the CTNNBL1 locus were associated with episodic memory performance after correction for multiple comparisons in a homogenous and carefully phenotyped sample of Swiss healthy young adults, and were replicated in an independent sample of healthy young adults recruited in Belgrade, Serbia. Furthermore, gene expression analysis in human post-mortem brain tissue showed that a CTNNBL1 marker associated with better episodic memory was also associated with higher mRNA levels of CTNNBL1 in the human cortex. In addition, fMRI experiments during episodic memory tasks revealed CTNNBL1 genotype-dependent differences in brain activations in regions typically involved in episodic memory processes. The significant marker SNPs are located either in intronic or in upstream, putatively regulatory regions of CTNNBL1, which also harbor the hypothetical LOC100287792.

Recent studies reporting associations of candidate genes with specific memory-related phenotypes observed medium to large ESs (for example, SNPs in BDNF,16 GRM3,17 ADRA2B18). The GWAS sample of 1073 participants, which was used for unbiased genome-wide screening in this study, was sufficiently powered also for the detection of small ES. Assuming a genetic marker with rather small ES (0.01r20.05),19 the power of our GWAS sample to detect associations at a genome-wide-corrected α-error probability (PFDR0.05) was 90%. The power of the replication sample (n=524) to detect associations with a similarly small ES at an α-error probability 0.05 was >95%. The actual ESs (r2) in this study varied between 0.017 and 0.038 (Table 1), hence they were within the expected range of small ESs observed in GWAS dealing with polygenic cognitive traits.

CTNNBL1, which was identified and cloned recently, spans over 178 kb on human chromosome 20q11.23 (UCSC Genome Browser; GRCh36/hg18 assembly) and contains 16 exons. The encoded protein (beta-catenin-like protein 1) is a widely expressed nuclear protein and is highly conserved in mammals (96% amino acid identity with the mouse ortholog). Its biological function is not well understood, however, it seems to be a pleiotropic protein involved in such important processes as apoptosis and antibody diversification, and is part of the spliceosomal complex.20 Moreover, beta-catenin-like protein 1 shares significant structural similarity with the armadillo repeat family of proteins, including Drosophila melanogaster armadillo protein and its mammalian ortholog beta-catenin. Beta-catenin is a pleitropic adherens junction protein, which regulates numerous cellular processes and also functions as a component of the Wnt-signaling pathway. Beta-catenin regulates numerous neuronal functions; for example, it is a mediator of dendritic development through enhancing dendritic arborization.21 In rodents, beta-catenin is required for memory consolidation.22 Interestingly, the results reported herein show that CTNNBL1 genotypes associated with better episodic memory performance were also associated with increased brain activation during encoding of information and with increased mRNA levels of CTNNBL1 in the human brain. It is tempting to speculate that CTNNBL1 acts on human memory via beta-catenin-related mechanisms. However, it is hitherto completely unclear, whether beta-catenin-like protein 1 shares any similar functions with beta-catenin, especially because both proteins show only structural, rather than sequence similarity.

Taken together, the present high-density GWAS followed by a replication study and supported by gene expression data and fMRI experiments suggests a role for the beta-catenin-like protein 1 in human episodic memory. These results demonstrate the usefulness of unbiased genome-wide approaches and their potential to identify important and novel molecular pathways related to human cognition.