Human longevity: 25 genetic loci associated in 389,166 UK biobank participants

We undertook a genome-wide association study (GWAS) of parental longevity in European descent UK Biobank participants. For combined mothers' and fathers' attained age, 10 loci were associated (p<5*10−8), including 8 previously identified for traits including survival, Alzheimer's and cardiovascular disease. Of these, 4 were also associated with longest 10% survival (mother's age ≥90 years, father's ≥87 years), with 2 additional associations including MC2R intronic variants (coding for the adrenocorticotropic hormone receptor). Mother's age at death was associated with 3 additional loci (2 linked to autoimmune conditions), and 8 for fathers only. An attained age genetic risk score associated with parental survival in the US Health and Retirement Study and the Wisconsin Longitudinal Study and with having a centenarian parent (n=1,181) in UK Biobank. The results suggest that human longevity is highly polygenic with prominent roles for loci likely involved in cellular senescence and inflammation, plus lipid metabolism and cardiovascular conditions. There may also be gender specific routes to longevity.


INTRODUCTION
For centuries there has been great interest in factors enabling some individuals to live to very old ages. Extreme longevity can be achieved in animal models by knocking out many specific genes [1]. In humans there is a growing number of inherited (germline) genetic variants linked to overall [2] and longer human lifespan [3,4] in conventional genome wide association studies (GWAS), and in studies weighted for the large numbers of previously reported variant associations with related diseases and traits [5,6].

AGING
Human longevity or lifespan has moderate heritability in twin studies, estimated at 20-30% [7], with health related behaviors, environmental exposures and chance likely explaining much of the remaining variability. GWAS have mostly studied general survival or compared longer lived individuals (e.g. aged 80+, 90+ or centenarians) to younger controls. An alternative approach to studying older individuals themselves is to study parental longevity. Biological offspring of longer lived parents inherit a combination of parental genetic variants, and tend to remain healthier and live longer than the offspring of shorter lived parents [8]. In the US Health and Retirement Study (HRS), we found that mortality in offspring declined progressively with later parental ages of death [9]. In addition, offspring of longer lived parents had progressively lower incidence of cardiovascular disease and cancers, plus reduced rates of cognitive impairment [10]. These findings were replicated in 186,151 non-adopted UK Biobank participants [11]. Statistical associations of mothers' and fathers' attained ages with offspring health outcomes in HRS were additive. These findings suggest that combined parental longevity is a quantitative trait of slower aging, likely with a significant genetic component.
We [12] and separately Joshi et al. [13] previously undertook a GWAS of parental longevity in the first one third UK Biobank (UKB) participants aged 55-70 [12]. We determined the percent of variance of parental age at death explained by the directly genotyped variants on the UKB arrays (n=845,997): the variance attributed was greatest for the sum of normalized mother's and father's age at death (10.2%, SD=1.26%) and smaller for mothers (6.08% SD=1.21%) and fathers (5.79% SD=1.23%) separately.
In the current analysis we aimed to extend our previous GWAS to include data from the newly available fully genotyped UKB cohort, essentially tripling our previous sample size. We aimed to identify all common genetic variants (prevalence ≥0.1%) associated with longer parental lifespan. We studied combined parents' attained ages but also tested associations with both parents having longest 10% survival, to establish whether variants were associated with long life rather than early death. We also examined mothers' and fathers' age of death separately. We also tested a combined genetic risk score of our findings in two US aging cohorts, the HRS and the Wisconsin Longitudinal Study (WLS). (Table 1) were 389,166 UK Biobank participants of European descent with data recorded on parents' current ages or parents' ages of death (n=181,048 had one or more living parents). GWAS analyses had >99% power to detect an allele of 1% minor allele frequency, accounting for 0.1% of the variance in the phenotype, after correction for multiple statistical testing (alpha=5x10 -8 ). Variants with minor allele frequencies of >0.1 were included.

GWAS of parental attained age
We examined associations with both parents' combined attained ages (including ages of living parents), using Martingale residuals from Cox's proportional hazards regression models, following the approach used by Joshi et al. [13]. In this analysis we found 10 loci associated at genome wide significance (p<5x10 -8 ) ( Associations with APOE related rs429358 (p=5x10 -22 ) variant were confirmed, as were associations with the CHRNA3/5 (nicotinic acetylcholine receptor) locus (rs1317286, 33.2% minor allele frequency, p=1.2x10 -26 ). Associations were also found for six other loci already implicated in disease specific GWAS; the strongest association was with rs55730499 (8% frequency, p=1.7x10 -18 ) in an intron of LPA gene, associated with lipoprotein A and LDL cholesterol levels, plus CAD [14]. Variants in the 9p21 region, which includes CDKN2B-AS1 (ANRIL) and CDKN2A/B, were also associated (lead SNP rs1556516, 49.8% frequency, p=4.7x10 -16 ); this locus is involved in cellular senescence and associated with myocardial infarction and peripheral vascular disease [14]. Genetic variants across the 9p21.3 locus have previously also been associated with type-2 diabetes and cancers [14] and in longevity/survival analyses (see below), but only the CAD-associated loci were associated (p<5x10 -8 ) with parent's lifespan (Figure 2; Supplementary Table 2).

A parents' attained age genetic risk score in independent cohorts
We calculated a genetic risk score (GRS) combining the 10 attained parental age associated variants, weighted by the estimated effect sizes. Genotyped samples of European descent from the Health and Retirement Study (HRS, n=12,940 in genetic analyses) and the Wisconsin Longitudinal Study (WLS, n=8,952 from 6,683 families) were too small to test individual variant associations: the sample size needed for 80% power to detect an association between the APOE variant and parents' lifespan at p<0.05 (R 2 =0.0005 in UK Biobank) is 15,598, but much larger samples were required for the smaller-effect variants discovered in UK Biobank. We therefore tested the parents' attained age GRS; the sample size needed for an 80% chance of detecting an association of this magnitude (R 2 =0.002 in UKB) with p<0.05 was 3,924. In Cox's proportional hazards survival models against both dead and alive parents attained ages in HRS (n=12,345, number of events=9,809), increasing GRS was associated with longer survival (mortality Hazard Ratio=0.961, 95% CIs=0.942 to 0.980, p=6x10 -5 , adjusted for offspring age, sex, and genetic principal components 1-5). In the WLS (n=7,437, number of events=6,683, errors clustered by family), increasing GRS was associated with longer survival (mortality Hazard Ratio=0.963, 95% CIs=0.938 to 0.988, p=4x10 -3 , adjusted for age at saliva collection, sex, and genetic principal components 1-5).

Functional implications and pathways analysis
We investigated effects of the parents' attained age SNPs on expression of genes in multiple tissues, such as whole blood [15], using the FUMA tools [16]. Expression of 53 genes were associated, including 8 of the 10 nearest genes to the variants identified above (Supplementary Table 5

& 6).
In MAGENTA gene-set analysis, only two pathways were significantly enriched after correction for multiple testing, both related to nicotinic acetylcholine receptor binding and downstream events (Supplementary Table  4).

Sensitivity analyses of combined parents' attained age analyses
We generated QQ plots for all GWAS (Supplementary Figure 2) and observed some genomic inflation (Lambda GC for parents' attained age = 1.199), consis-tent with human longevity being highly polygenic, but perhaps also representing some effect of relatedness in inflating results. The BOLT-LMM software used for analysis accounts for relatedness in the methods, but we investigated further; the 10 variants associated with attained parental age were tested in Cox proportional hazards (CoxPH) models against parents' attained age, to assess whether the associations observed with ranknormalized Martingale residuals were robust. These models also excluded related (≥3 rd -degree) participants, leaving a reduced sample size of n=326,864, including n=153,101 with censored data due to at least one parent still being alive. Five of the 10 loci remained genome wide significant (p<5x10 -8 ) in these models; APOE, CHRNA3, LPA, ANRIL, and ATAXN2/SH2B3 (Supplementary Table 3) with the remaining five directionally consistent and associated at p≤3.1*10 -6 .
We performed sensitivity analyses only including parents who had died ( Figure 1B; Table 2), as the ages of living parents at the UKB study baseline are clearly an incomplete measure of their longevity. This analysis included a smaller sample size of n=208,118 UKB participants with both parental ages of death known. The same five of the ten loci reported above remained genome-wide significant (APOE, CHRNA3, LPA, ANRIL, and SH2B3) with the remaining loci remaining nominally significant. Two additional variants not identified in the previous analysis emerged as candidates for follow-up: rs1627804 (32% frequency, p=4x10 -8 ), in an intron of BEND3, a transcriptional repressor which associates with the NoRC (nucleolar remodeling complex) complex and playing a key role in repressing ribosomal DNA transcription; and rs146254978 (2% frequency, p=4.6x10 -8 ), located in an intron of both FPGT and TNNI3K.

GWAS of both parents achieving longest 10% survival
We next tested binary associations with having both parents surviving to the longest 10% of lifespans, using sex-specific age-at-death cut-offs (father ≥87, mother ≥90, including living parents aged over these age cutoffs), to assess whether associations were attributable to longer lives rather than early deaths. This analysis confirmed associations ( Figure 1C; Table 1) with variants in the APOE, CHRNA3, LPA and CDKN2B-AS1, with a suggestive association with ATXN2 (p=3.4x10 -6 ). Two further loci just reached significance in this analysis: rs28926173 (4.7% frequency, p=2.3x10 -8 ) in an intron of gene MC2R, and rs139137459 (0.4% frequency, p= Lead SNP = SNP with lowest p-value in 1Mb region BP = Genomic position, build 37 (hg19), A1F = frequency of A1, the effect allele, P = BOLT LMM p-value (A1F, BETA, and SE are for p-value in bold), SE = Standard Error Beta = A1 effect on outcome. For attained age (Martingale residuals) a negative BETA = reduced hazard, i.e. increased attained age Implicated gene(s) = Variants in locus usually intronic, or exonic (indicated by underlining). If two genes separated by dots, the SNP is intergenic AGING 2.7x10 -8 ) in an intron of USP2-AS1 (near USP2). The MC2R variant was associated at p=6.4x10 -6 with parents' attained age, but the USP2-AS1 variant had a low population frequency (0.4%) and was not significant in the continuous trait analyses.

Genetic risk score associations with centenarian status and offspring health status in UKB
There were 691 UKB participants with a centenarian mother (the longest lived 0.1% of maternal deaths in the sample), and 747 with a father aged ≥98 years (the corresponding age cut-off of longest lived 0.1%); only five participants had both parents in the longest lived 0.1%. Although this relatively small group (0.3% of the full attained age sample) were included in the discovery analysis, we tested the GRS for association with having a centenarian parent (in the un-related UK Biobank participants, n=1,181), against controls whose (both) parents died before age 80 (n=66,280). Increasing attained age GRS was associated with increased odds of having a centenarian parent (OR per standard deviation of GRS=1.193, 95% CIs=1.130 to 1.260, p=3.0x10 -10 ).

GWAS of mothers' and fathers' age at death separately
We tested for parental gender differences by conducting separate GWAS for mothers' and fathers' attained ages (including living parents). Eleven additional genetic variants were identified, three only associated with mother's age at death, eight associated with father's age at death ( A previous meta-analysis comparing older individuals (age 80 or 90+) to younger controls reported a genome wide association at the 5q33.3 locus, but no significant associations were found for the other two loci (p>0.01). Various other variants, including in mitochondrial DNA, have been suggested as associated with longevity. In our GWAS we tested associations with mitochondrial variants, plus variants on the X and Y chromosomes, using the directly genotyped microarray data: none were significantly associated (p>1x10 -5 ) with the studied phenotypes.

DISCUSSION
In this analysis we aimed to extend our previously published GWAS study of parental longevity to include the newly available full UKB data, achieving an unprecedented sample size for longevity studies. We have confirmed associations with APOE and CHRNA3 variants, and additionally identified 8 novel genome wide significant variants for our principal phenotype of combined parents' attained age, plus several additional variants for our related longevity phenotypes. Overall, the findings suggest that human longevity is a polygenic trait, influenced by large numbers of modest effect variants. Many of the variants identified have been previously implicated in age-related diseases or traits, notably with coronary artery disease and lipid levels. Several loci are near genes involved in inflammation, an important pathway in the development of biological aging [21]. In additional analyses of mothers' and fathers' attained ages separately we identified 11 additional genetic variants, three only associated with mother's age at death and eight associated only with father's age at death.
Parental lifespan-associated variants included rs1556516, located in an intron of CDKN2B-AS1 (ANRIL), a long non-coding RNA; variants in this region, 9p21.3, have previously been associated with CAD, type-2 diabetes, and cancer [14]. ANRIL is known to regulate neighbor tumour suppressors CDKN2A (encoding proteins p14arf and p16) and CDKN2B (encoding protein p15) epigenetically through directing polycomb gene regulators to modify the chromatin state [22]. ANRIL therefore plays a key role in regulation of cell proliferation and senescence. It is interesting to note that removal of CDKN2a expressing senescent cells achieved significant rescue of aging phenotypes in mice [23]. Intriguingly, the variants in 9p21 associated with lifespan are predominantly related to arterial disease (coronary and peripheral arterial disease), rather than those influencing cancer or type II diabetes risk ( Figure  2). Decreased expression of specific ANRIL isoforms is reported to affect expression of ADIPOR1 (adiponectin receptor with a role in fatty acid catabolism and inflammation), VAMP3 (membrane protein with a role in phagocytosis) and C11orf10 (in proximity to genes associated with metabolic syndrome) [24]. The lifespanincreasing allele for the lead SNP (rs1556516-G) is known to increase expression of ANRIL [25]; atherosclerosis-related phenotypes are associated with lower expression of ANRIL, suggesting a causal mechanism of action for this SNP on lifespan. rs11065979 (chromosome 11) is located near the ATXN2 gene, which is known to be associated with autoimmune disease and blood traits [14]. The variant is also highly correlated (R 2 =0.91) with an exonic variant in SH2B3, previously linked to CAD, blood pressure, TNF-α, cell counts and colorectal cancer [14]. SH2B3 encodes the lymphocyte adapter protein (LNK), which plays a key role linking inflammation and hypertension, particularly in the kidney and blood vessels [26].
EPHX2 is a hydrolase involved in the degradation of vasoactive and anti-inflammatory acids, with possible function in LDL-cholesterol metabolism [27]. Exonic polymorphisms in EPHX2 are associated with coronary artery calcification [28]. PROX2 is a transcription factor of unknown function implicated in cardiac muscle and neuron differentiation by gene ontology [29]. Genetic variants in FURIN, a gene located in chromosome 15, were previously associated with blood pressure and are here associated with parents' attained age; the protein is a widely-expressed membrane-bound protease, and is essential for regulatory and effector T-cell function [30].
Other novel genes implicated include MC2R, where the rare allele is associated with 15.7% increased likelihood of both parents reaching the top 10% of survival. This gene encodes the adrenocorticotropic hormone receptor, involved in the secretion of cortisol. This pathway is key to the stress response, which is regarded as a pillar of the aging process [31]. Lack of this functioning receptor can result in chronic hormone-imbalance and various conditions, including subclinical Cushing syndrome, plus impaired blood sugar regulation and immune system function [32].
For four of the novel parents' attained age-associated variants it is the rare allele that is associated with increased lifespan; rs602633 near CLESR2 and PSRC1; rs28383322 near HLA-DRB1 and HRL-DQA1; rs7844965 in EPHX2; and rs61978928 in PROX2. This is unusual, as for the other genetic variants it is the common allele that confers the advantage (the rare variant tends to be associated with increased diseaserisk, such as CAD).
We have shown that a genetic risk score based on the 10 variants associated with combined parents' age of death is associated with parental survival in two independent US aging cohorts, suggesting that our overall findings are robust. In addition to the more common variants identified (i.e. minor allele frequency >1%) there were several rarer variants, such as rs139137459 in USP2-AS1 (0.4% frequency): these associations are less certain, as the GWAS significance level of p<5*10 -8 may not be sufficiently rigorous for testing low frequency variants. The identified SNPs previously linked to GWAS traits (such as a variant in IL6R) have a high prior probability and are especially likely to be robust.
This study is limited to European ancestry (white British) UK Biobank participants, and results may not be applicable to other populations. UK Biobank is a healthier volunteer study with a sample that was less likely to be obese, to smoke or to drink regularly, compared to the general population [33], perhaps reducing our power to detect variants associated with health behaviors, although the sample still contains many participants who were exposed to these factors. We were not able to verify reported parental ages or ages of death, perhaps introducing misclassification. Also, ages of birth of deceased parents were not recorded, so it is not possible to compare study and national cohort survival.
We are limited by the coverage of the available genotyping microarray, and have used data on imputed SNPs from the Haplotype Reference Consortium panel, with imputation using the UK10K reference panel to follow. As with all GWAS discovered loci, the identification of affected genes is not straightforward, but we have followed the convention of highlighting the nearest genes, which for 8 of the 10 attained age variants was confirmed by the eQTL data.
We studied the normal range of parental ages at death, as well as the top end of longevity (top 10% of survival), to establish whether associations were driven by early deaths. Our longest lived 10% analysis is similar in age (87+ years for fathers, 90+ for mothers) to those studied in many of the previous GWAS of longevity [3,4]. Analyses with bigger samples of those attaining extreme ages are needed, but unfortunately our sample with extreme surviving parents (e.g. centenarians) is too small for GWAS analyses.
In addition to these limitations, our analysis has several strengths. Our sample size yielded unprecedented power to detect variant associations: indeed the power (>99% to detect an allele of 1% minor allele frequency accounting for 0.1% of phenotype variance) is sufficient to suggest that we have identified all moderate to larger effect common genotyped or imputed variants in our studied population. Our analyses were undertaken in a single cohort: previous longevity GWAS combined data from diverse cohorts, which has been suggested as explaining difficulties in detecting and replicating longevity variants [34].

Future work
Replication and extension of the novel associations in other large cohorts and in different ancestries is needed, especially for variants narrowly achieving genome wide significance. The associations identified should be examined in future studies of age-related traits and disease. More work is needed to characterize the variants identified for biological functions; this could include epigenetic and gene expression studies, such as analysis of allele-specific expression imbalance in heterozygote individuals to determine the gene of action for genetic variants [35]. The variants identified in our analysis may provide useful information for designing targeted prevention or precision treatments to promote human longevity, perhaps particularly related to targets in inflammatory pathways and cardiovascular disease.

CONCLUSIONS
Our analysis of parental longevity involved the largest sample size thus far achieved, with ample power (>99%) to detect even modest associations. The results indicate that human longevity is a polygenetic trait, including likely prominent effects involving cell senescence, inflammation, autoimmunity and the stress response, plus lipid levels and vascular disease. The results suggest that the identified variants are involved in achieving long lifespans, and are not limited to causing early deaths. The results also suggest that there may be gender differences in routes to longevity, with autoimmune variants being particularly important in women. Overall these results are consistent with some previous analyses in humans and with certain key hallmarks or pillars of the biology of aging. The variants identified may be useful for identifying targeted prevention and precision treatments for extending human lifespans.

Parental age at death and longevity phenotypes
Participants were asked the age at which their parents had died (or their current age if still alive). Analyses were performed separately on mother's age at death and father's age at death, and also on a combined phenotype. To reduce the effect of higher ages at death of mothers (compared to the fathers) we first z-transformed the mothers and fathers age at deaths before combining the z-scores into a single summed phenotype. Offspring of parents who died prematurely were excluded because the cause of death of the AGING participant's parents was not asked, so we could not exclude accidental deaths explicitly. To determine the age at death considered to be too young to be part of the normal distribution of age-related mortality, we used previously described methods to define a normal distribution of ages at death for mothers and fathers separately, as described previously [9]). In brief, nonlinear least square regression models were used to fit a normal curve on age at death data from the modal (M) age at death to the oldest ages; the younger (left) half of the curve is then imposed, to avoid the severe skewing of the distribution caused by the excess of deaths below the modal age. The analysis identified the following early death cut-points (based on modal age at death minus 1 Standard Deviation): mothers 57 years, fathers 46 years.
In addition, combined parents' age (including alive parents) were derived from Cox's proportional hazards regression models using Martingale residuals, as described by Joshi et al. [13]. Events were defined as the both parents having died. We also defined a binary phenotype by determining the top 10% of age at death for mothers (≥90 years) and fathers (≥87 years) separately. Controls were participants where both parents died before the age of 80. We were able to increase the number of "long lived" parents by including alive participants beyond the cut-off.

UK Biobank
Between 2006 and 2010, 503,325 volunteers (aged 45 to 69 years old) were recruited from across the United Kingdom to the UK Biobank study [36]. Of these, 389,166 participants met the inclusion criteria for at least one analysis: participants with complete genetics data and either date of death of their parents or their current age if alive. Participants were excluded if they reported themselves as adopted, or whose parents died more than 1 standard deviation below the modal age of death (mothers <57 years, fathers <46 years). Several longevity phenotypes are defined below based on the age at death of the participant's parents.
Genetic data was available on 488,377 UK Biobank participants after genotype calling and quality control performed centrally by the UK Biobank team [37]. We selected 451,447 participants identified as 'white European' through self-report and verified through principal components analysis based on genotypes. Briefly, principal components were generated in the 1000 Genomes Cohort using high-confidence SNPs to obtain their individual loadings. These loadings were then used to project all of the UK Biobank samples into the same principal component space and individuals were then clustered using principal components 1 to 4. Related individuals were identified through kinship analysis, although these participants were included in genome-wide analysis using BOLT-LMM (see below), those related to the third-degree or closer were excluded in sensitivity analyses. Imputation of 39,235,157 genetic variants from the Haplotype Reference Consortium panel was performed using IMPUTE4 centrally by the UK Biobank team [37]. After filtering for variants with MAF ≥0.1%, missingness <1.5%, imputation quality >0.1 and with Hardy-Weinberg equilibrium (HWE) P>1x10 -6 within the white British participants 11,516,125 imputed autosomal variants were eligible for the analyses.
We also utilized data directly from the microarrays for variants on the X (n=19,381) and Y (n=284) chromosomes, and on the mitochondrial genome (n=135), which were unavailable in the imputed dataset.

Health and Retirement Study (HRS)
In HRS v2 data, phase 1-3 samples were combined (n=15,708) and SNPs were kept if no discordant genotype call among duplicate samples (2,315,518 SNPs). Related or duplicated participants and genotyping controls were removed, as well as those with a missing call rate equal to or greater than 2%. After the sample quality control, 15,454 samples were kept and 84% of them (n=12,940) were of European descent.
2,075,208 out of 2,315,518 SNPs passed the composite filters by the University of Washington (Hardy-Weinberg equilibrium p-value < 1e-4, missing call rate ≥ 2%, sex difference in allelic frequency ≥ 2% and sex difference in heterozygosity > 0.3) and had a minor allele frequency greater than zero. As recommended, SNPs with a minor allele frequency smaller than 2% further were removed. At the end, 1,437,608 SNPs were carried forward for SNP pruning. The purpose is to choose less correlated SNPs for principal components analysis.
The SNP pruning was performed following the procedure: a) consider a window of 50 SNPs, b) calculate LD between each pair of SNPs in the window, c) remove one of a pair of SNPs if the LD is greater than 0.5, d) shift the window 5 SNPs forward and repeat the procedure. 514,246 SNPs remained after the SNP pruning and were used to run principal components analysis. As a result, the top 10 principal components were used to adjust for ancestry in a regression model. Both SNP pruning and principal components analyses were performed in PLINK1.9 The samples in the genetic data were mapped to the phenotype data of HRS to retrieve parental longevity AGING phenotypes. We used the phenotype data from the most recent wave or visit of HRS where parental data was missing if the parent died before the last visit. To recover the data, we sequentially looked back at the last visit until the parental data was available. Father's and mother's data therefore could be from different visits and the participant's age varied with phenotypes. Mothers dying before 61 years and fathers dying before 46 years were excluded from the analysis, based on the same analysis of distribution of ages of death applied in UK Biobank. Parent's age was set to missing if he or she was still alive and younger than the child's age +10.
The genetic risk score analysis was conducted in a Cox regression model (against combined parents' attained age, alive or dead: derived by summing the ztransformed mother's and father's attained ages) with adjustment for age and sex of the participant and the top 10 principal components.

Wisconsin Longitudinal Study (WLS)
In the WLS saliva samples were collected by mail in 2006-7 and during in-person interviews in 2011 for those who did not mail back samples. Genotyping on the Illumina HumanOmniExpress array was completed at Johns Hopkins' Center for Inherited Disease Research (CIDR) and data cleaning and imputation was performed in collaboration with the Genetic Analysis Center at the University of Washington. After quality control, there are a total of 9,012 respondents and 688,596 SNPs available for analysis. The sample has 4,601 singletons and 2,263 families with more than one member genotyped. Only 66 respondents are not of European descent. Imputation of 33,549,349 genetic variants from the 1000 Genomes Project reference panel (Phase 3) was performed using IMPUTE2, with 686,143 study SNPs serving as the imputation basis. Of the 10 variants used to construct the GRS, two were directly genotyped and eight were imputed. Parent's age was constructed by subtracting the year of birth from the year of death. When the latter was not available, the year of the most recent interview was used instead and the parent was classified as not dead. Mothers dying before 56 years and fathers dying before 46 years were excluded from the analysis, based on the same modelling of distributions of ages of death applied in UK Biobank. Parent's age was also set to missing if the disagreement between sibling reports exceeded 10 years. The analysis phenotype was the standardized sum of both parents' ages and was set to missing if either element was not available.
The genetic risk score analysis was conducted in a Cox regression with censored data, adjusting for age at saliva collection, sex of the participant and the top 5 principal components. Standard errors were clustered at the family level. Limiting the sample to the primary WLS respondents had a negligible effect on the results.

Power Calculations
R package 'pwr' and function `pwr.f2.test` was utilized for power calculations of regression analysis. The numerator degrees of freedom was set to 1, significance level to 0.05, and power to 0.8.

Genome-Wide Association Study
We used BOLT-LMM v2.2 to model the associations between imputed variants (dosages) and each phenotype [38], which uses a linear mixed effects model approach. We looked at the results for variants with imputation quality >0.1, HWE p-values >1x10 -6 and minor allele frequencies >0.1% in the white/British subset used for all analyses. For variants on the X, Y and mitochondrial chromosomes only in the directly-genotyped data we used Plink (v1.9) [39] in linear (additive) or binary (fisher) models, as appropriate, adjusted for the same covariates as above including the first 5 principal components from FlashPCA.

Pathways Analysis and Functional Annotation
We utilized FUMA (Functional Mapping and Annotation of Genome-Wide Association Studies) to investigate the genetic variants associated with parents' attained age [16]. In particular, FUMA performs gene-set analysis (using MAGMA) to identify pathways enriched amongst the significant genes (weighted by the SNP-associations in proximity to them). In addition, FUMA searched eQTL databases to identify SNPs that significant alter the expression of genes in various tissues. /1. LF is supported by the Intramural Research Program of the National Institute on Aging, U.S. National Institutes of Health. Input from CK and GK was supported by the University of Connecticut Health Center. The Health and Retirement Study (HRS) is a longitudinal project sponsored by the National Institute on Aging (NIA U01AG009740) and the Social Security Administration. This research uses data from the Wisconsin Longitudinal Study (WLS) of the University of Wisconsin-Madison. Since 1991, the WLS has been supported principally by the National Institute on Aging (AG09775, AG21079 and AG33285), with additional support from the Vilas Estate Trust, the National Science Foundation, the Spencer Foundation and the Graduate School of the University of Wisconsin-Madison. A public use file of data from the WLS is available from the Wisconsin Longitudinal Study, University of Wisconsin-Madison, 1180 Observatory Drive, Madison, Wisconsin 53706 and at (http://www.ssc.wisc.edu/wlsresearch/data). This material is the result of work supported with resources at the William S. Middleton Memorial Veterans Hospital, Madison, Wisconsin.