Introduction

In 2020, approximately 10% of the world’s population was over 65 years of age [1]; and the proportion of older adults is growing at a faster rate than all other age groups. As aging represents the most profound risk factor for many chronic diseases, the prevalence of age-related diseases such as dementia and ischemic heart disease is also proportionately increasing [2]. The geroscience hypothesis posits that by targeting aging mechanisms, we can potentially reduce the risk of several age-related diseases and extend the health span (i.e., duration of life lived in good health) [3]. As such, increasing attention has been placed on the hallmarks of aging which may better capture underlying biological differences and thus an individual’s risk of disease.

One of the most studied and promising aging biomarkers is epigenetic age. It estimates biological age based on DNA methylation (DNAm) levels at multiple cytosine-phosphate-guanine dinucleotides (CpGs) across the genome [4,5,6,7,8,9]. While epigenetic age can be estimated from any tissue, when measured in the blood, it reflects system-wide biological age [4, 10]. Over the previous decade, a number of epigenetic clock algorithms have been developed [4,5,6,7,8,9]. The first-generation epigenetic clocks, namely, HorvathAge [5] and HannumAge [6], were generated to predict chronological age using the specific CpGs sites which are strongly associated with chronological age. HorvathAge [5] was developed on 353 CpGs sites across multiple tissue samples whereas HannumAge [6] was trained on 71 CpGs sites from blood samples. Second-generation clocks such as PhenoAge [7] and GrimAge [8, 11] were trained in blood samples by incorporating additional age-related clinical markers. PhenoAge [7] used 513 CpGs sites which are related to chronological age and nine phenotypic aging biomarkers including albumin, creatinine, and C-reactive protein levels whereas GrimAge [8] considered 1030 CpGs sites associated with smoking-pack-years and a selection of plasma proteins including adrenomedullin, beta-2-microglobulin, and cystatin-C. Further, using the same set of 1030 CpGs that underline the original GrimAge, GrimAge2 was recently developed by additionally incorporating two new DNAm-based plasma proteins—high sensitivity C-reactive protein, and hemoglobin A1C [11]. The second-generation clocks have been found to outperform the first-generation clocks in the risk prediction of age-related diseases and all-cause mortality [5,6,7,8, 11, 12]. For the first- and second-generation clocks, the residual taken from regressing epigenetic age on chronological age is referred to as epigenetic age acceleration (AA) [10]. A positive value of AA indicates an older epigenetic age relative to chronological age, which has been shown to be predictive of several age-related diseases [10]. More recently, a third generation, the Pace of Aging (DunedinPACE) was developed using 173 CpGs sites in blood samples [9]. In contrast to prior generations, DunedinPACE was built by incorporating longitudinal within-person physiological change together with methylation pattern and thus specifically designed to measure the rate of aging [9].

Other measures have also been developed to estimate biological aging in recent years. Another system-wide measure of biological aging is the deficit accumulation frailty index (FI), a multidimensional tool that measures the accumulation of health deficits across a range of physical, psychological, and mental processes [13]. Unlike AA, FI can capture the clinical state of age-related physiological decline in which a person is more susceptible to developing adverse health outcomes [14, 15]. There are also aging biomarkers that target specific organs. Brain age is estimated using neuroimaging data, and an older brain relative to chronological age, positive brain-predicted age difference (brain-PAD), indicates accelerated brain aging [16]. In recent years, evidence has found that accelerated brain aging was predictive of cognitive performance, and physical disability, particularly in older people [17,18,19]. In literature, few studies have investigated the associations between epigenetic clocks and frailty and/or brain age, showing inconsistent relationships between these biological aging measures [20,21,22]. Further, all these previous studies were investigated using only first or second generations of epigenetic clocks. Thus, it is yet unclear to what extent these other biological aging indices are strongly correlated with different generations of epigenetic AA, and thus, it remains unclear to what extent these capture different aspects of the aging process.

On average, females have a longer lifespan than males [23] but may carry a higher disease and disability burden [24]. Such discrepancies may reflect differences in biological phenotypes, genetic constitutions, and physiological traits between males and females [23, 25], In terms of social behaviors, there are recognized sex differences in health-related behaviors, and in healthcare seeking, including accessing social and familial support [26]. Given these disparities, it is important to understand how the relationships between AA and health status might differ between males and females.

This study aimed to determine the correlation between epigenetic aging, with other measures of biological aging (i.e., system-wide FI, and brain-PAD), in an initially healthy population of males and females aged ≥ 70 years. We additionally explored the extent to which these AA measures were associated with sociodemographic characteristics, clinical measures, and chronic conditions, separately in males and females.

Methods

Study population

This study sub-sampled 560 Australians from the ASPREE (ASPirin in Reducing Events in the Elderly) study, who provided blood samples at baseline [27]. Full details regarding the sampling procedure, inclusion and exclusion criteria, and study design of the ASPREE study have been published previously [27, 28]. Briefly, in Australia, participants aged ≥ 70 years were recruited through a partnership with general practices, from March 2010 through December 2014. The ASPREE exclusion criteria included a history of cardiovascular disease (CVD), atrial fibrillation, anemia, dementia, a score of < 78 on the Modified Mini-Mental State examination (3MS), uncontrolled high blood pressure (i.e., ≥ 180/ ≥ 105 mm Hg), or other major life-limiting health conditions likely to cause death within 5 years.

The ASPREE study complies with the Declaration of Helsinki and was approved by multiple Institutional Review Boards (www.aspree.org). All participants signed written informed consent on participation. The present sub-study was approved by the Monash University Human Research Ethics Committee (MHREC 30734).

Epigenetic age

Blood samples were processed within 4 h of the blood collection and stored at ultra-low − 80° temperatures. All samples have barcodes and are prepared for long-term storage at a purpose-built large biorepository near the ASPREE National Co-ordinating Centre in Melbourne (www.aspree.org).

DNA methylation (DNAm) from peripheral blood samples (buffy coat) collected at baseline was extracted using Qiagen DNeasy Blood and Tissue Kits (https://www.qiagen.com/au). Cytosine-phosphate-guanine (CpG) probe methylation was measured using the Illumina Infinium Methylation EPIC BeadChip (EPIC) and run at the Australian Genome Research Facility, Melbourne, Victoria (https://www.agrf.org.au/). The programming platform R version 4.1.3 with R package minfi was used for preprocessing data for use in Horvath’s online “New DNAm Age Calculator” (https://dnamage.genetics.ucla.edu/new). The preprocessNoob method was used to normalize the EPIC data [29]. According to the online age calculator’s expectation, the beta values from the EPIC data set were limited to the probes that the 450 K array could measure, and the beta values of missing probes that are not present on the 450 K array (33,124 probes) were filled with “NA” in the data set. The list of 33,124 missing probes has been described previously [30].

Next, the sample methylation data file (including beta measurements) and the sample annotation file, which included sample ID, age, sex, and tissue type, were uploaded to the online calculator. The default option (i.e., “Normalize Data”) and an additional option—“Advanced Analysis”—were selected before submitting the data. The output of the online age calculator consisted of HorvathAge [5], HannumAge [6], PhenoAge [7], GrimAge [8], and their AA measures (residuals from regressing epigenetic age on chronological age). A positive value of AA indicates an older epigenetic age relative to chronological age. Further, version-2 of GrimAge [11] and its AA were estimated through the new online software of Clock Foundation (https://dnamage.clockfoundation.org/). We also considered other DNAm estimates—telomere length [31] and DNAm-based components of original GrimAge and GrimAge2 [8, 11] (adrenomedullin (ADM), beta-2-microglobulin (B2M), cystatin-C, growth differentiation factor 15 (GDF-15), leptin, smoking-pack-years, plasminogen activation inhibitor 1 (PAI-1), tissue inhibitor of metalloproteinases 1 (TIMP-1), high sensitivity C-reactive protein (CRP), and hemoglobin A1C (A1C)). The Pace of Aging (DunedinPACE) is a measure of the ongoing rate of deterioration in system integrity. DunedinPACE was estimated using R software (version 4.1.3) and the codes described in Belsky et al. [9].

Brain age

ASPREE participants living in the Greater Melbourne region and without MRI contraindications (e.g., specific foreign bodies or electronic implants, claustrophobia) were invited to participate in the ASPREE-NEURO sub-study [32]. Brain scans were performed at the Monash Biomedical Imaging Facility, Melbourne, Australia, using a 3 Tesla Siemens Skyra scanner (Siemens, Erlangen, Germany). Raw 3D T1-weighted images were pre-processed using the MATLAB Statistical Parametric Mapping software package (University College London, London, UK) [33]. Full details regarding the pre-processing of this data have been described in detail previously [19]. Brain age was calculated using a trained, publicly accessible processing pipeline and prediction algorithm created by Cole et al. (BrainageR, version 2.1, https://github.com/james-cole/brainageR) [16].

The brain-predicted age difference (brain-PAD) refers to the difference between an individual’s estimated brain age and their chronological age. A positive brain-PAD value indicates that the individual’s brain is older than its chronological age.

Frailty index

A deficit accumulation frailty index (FI) of 67 items was constructed based on the method of Rockwood and Mitnitski [13]. The index included chronic disease diagnosis and indicators (24 items), physical deficits interfering in completing the activities of daily living (26 items), mental and psychosocial deficits (11 items), and measures of physical functional engagement and cognitive function (6 items), as described in detail previously [34]. Each item was scored from 0 (deficit absent) to 1 (deficit present). The FI was derived by summing the number of deficits noted for each person and dividing this by the total number of possible deficits. Higher scores indicate greater frailty.

Sociodemographic characteristics

Sociodemographic characteristics included in this study were chronological age, sex (male or female), years of education (< 12 years or ≥ 12 years), living situation (at home alone or with family/others), and socioeconomic status (SES) (very low, low, middle, high, very high). SES was estimated through the Socio-Economic Indices for Areas-Index of Relative Socioeconomic Advantage and Disadvantage (SEIFA-IRSAD) based on the information from the 2011 Australian Census using the residential postcodes of participants [35]. Smoking was categorized into never, former, or current. Alcohol consumption was defined as never, former, current-low risk (i.e., ≤ 10 standard drinks per week and ≤ 4 standard drinks on any 1 day), or current-high risk (i.e., > 10 standard drinks per week or > 4 standard drinks on any 1 day) [36].

Clinical measures, chronic conditions, and self-reported outcomes

Clinical measures included systolic blood pressure (SBP) and diastolic blood pressure (DBP) in mmHg, dominant hand grip strength in kilograms, and gait speed in meters/seconds. Full detailed procedures for taking measurements have been previously reported [37]. In brief, for SBP and DBP, the average scores after three measurements were used. The grip strength of both hands was assessed three times using an isometric handheld dynamometer (Jamar; Lafayette Instruments). The mean score for each participant’s self-reported dominant hand was used in the analysis. Gait speed was assessed as the time spent walking 3 m at a habitual pace on a flat surface from a standing start. Gait speed was measured twice, and the mean score was used in the analysis.

Self-reported outcomes were assessed through the SF-12 version-2 questionnaire [38]. Self-rated health ranged from 1 (excellent) to 5 (poor). Physical (PCS) and mental component scores (MCS) of health-related quality of life (HRQoL) were generated using a norm-based scoring method [38]. Higher scores indicate better health-related quality of life.

Chronic conditions investigated included hypertension (systolic blood pressure ≥ 140 mmHg or diastolic blood pressure ≥ 90 mmHg and/or treatment for high blood pressure), diabetes (fasting glucose ≥ 126 mg/dL or treatment or self-report), dyslipidemia (cholesterol-lowering medications or low-density lipoprotein, LDL > 160 mg/dL (> 4.1 mmol/L), or serum cholesterol ≥ 212 mg/dL (≥ 5.5 mmol/L)), obesity (body mass index ≥ 30 kg/m2) [39], chronic kidney disease (estimated glomerular filtration rate (eGFR) < 60 ml/min per 1.73 m2 or spot urinary albumin: creatinine ratio (UACR) ≥ 3 mg/mmol with eGFR ≥ 60 ml/min per 1.73 m2) [40], and depression (Center for Epidemiological Studies-Depression CES-D-10 score ≥ 8) [41].

Statistical analysis

First, the correlations between different measures of biological age (i.e., HorvathAge, HannumAge, PhenoAge, GrimAge, GrimAge2, DunedinPACE, brain age, and FI) and chronological age were examined. Next, Pearson’s correlation, with pairwise deletion, was used to investigate the correlation between the different biological aging measures. Pearson’s correlation, t-tests, or one-way ANOVA, as appropriate, were also used to investigate the correlations between the AA measures and other DNAm-based estimates and sociodemographic characteristics, clinical measures, and chronic diseases, separately in males and females. Analyses were undertaken in STATA statistical software version 17.0, and the figures and graphs were plotted using the programming platform R version 4.1.3.

Results

Participants’ characteristics

The characteristics of the 560 study participants are shown in Table 1. Approximately, half were female (50.7%), and the chronological age ranged from 70.0 to 92.0 years, with a mean of 74.5 years. The estimated brain age (mean, 72.1 ± 6.9 years), HorvathAge (mean, 70.6 ± 6.1 years), GrimAge (mean, 70.1 ± 5.0 years), and GrimAge2 (mean, 74.1 ± 5.3 years) were similar to the mean chronological age, whereas HannumAge (mean, 59.5 ± 6.1 years) and PhenoAge (mean, 58.9 ± 7.7 years) were younger than the chronological age. DunedinPACE ranged from 0.69 to 1.30, with a mean of 0.97, and the mean FI was 0.10. Compared to females, males had significantly higher epigenetic aging whereas females had higher deficit accumulation frailty scores than males. Brain age was identical between males and females (Table 1).

Table 1 The characteristics of study participants

Correlation between different measures of biological age, and chronological age

Epigenetic age, brain age, and FI were positively correlated with chronological age (Fig. 1). The strongest correlation was found for GrimAge (r = 0.67), while DunedinPACE (r = 0.16) and FI (r = 0.14) had the weakest correlations with chronological age.

Fig. 1
figure 1

Correlations between estimated biological age (y-axis) and chronological age (x-axis). A HorvathAge vs. chronological age (n = 560); B HannumAge vs. chronological age (n = 560); C PhenoAge vs. chronological age (n = 560); D GrimAge vs. chronological age (n = 560); E GrimAge2 vs. chronological age (n = 560); F DunedinPACE vs. chronological age (n = 560); G Brain age vs. chronological age (n = 400); H deficit accumulation frailty index vs. chronological age (n = 560)

Correlation between AA, brain-PAD, and FI

All AA measures were positively correlated with one another (r ranged from 0.08 to 0.57), but DunedinPACE showed weak correlations, particularly with 1st generation HorvathAA and HannumAA (Fig. 2). GrimAA and Grim2AA were strongly correlated with each other (r = 0.95). Brain-PAD was not correlated with any of the AA measures, or FI. FI had the strongest correlation with DunedinPACE (r = 0.19) and Grim2AA (r = 0.15), and significant but weak correlations with HannumAA, PhenoAA, and GrimAA (r ≤ 0.12). Similar correlations were also found in males and females (Supplementary Tables 1 and 2).

Fig. 2
figure 2

Pearson’s correlation matrix on epigenetic age acceleration (HorvathAA, HannumAA, PhenoAA, GrimAA, Grim2AA, DunedinPACE), brain-PAD, and deficit accumulation frailty index. *p-value < 0.05; **p-value < 0.01; ***p-value < 0.001

AA variations according to the sociodemographic characteristics

In males (Supplementary Table 3), we observed an association between smoking and both versions of GrimAA (p-value < 0.001) and DunedinPACE only (p-value = 0.003). In females, however (Supplementary Table 4), we observed associations across all the sociodemographic characteristics. In particular, less than 12 years of education was associated with accelerated DunedinPACE; living alone was linked to an accelerated HorvathAA, low SES, current smoking, and former alcohol consumption were associated with higher in both versions of GrimAA (p-values < 0.05).

Correlation between AA, clinical measures, and self-reported health outcomes

Correlations between AA measures and clinical measures/self-reported health outcomes were also more commonly found in females than males (Fig. 3). In males, correlations were observed between SBP and HannumAA (r = 0.13), between gait speed and Grim2AA (r =  − 0.12), and between PCS and both versions of GrimAA (r =  − 0.15). In females, all clinical measures and self-reported health status, except for MCS, were correlated with AA measures (r ranged from − 0.23 to 0.24) with DunedinPACE having the greatest number of significant correlations.

Fig. 3
figure 3

Correlation matrix between epigenetic age acceleration measures and systolic blood pressure (SBP), diastolic blood pressure (DBP), grip strength, gait speed, self-rated health (higher scores indicating worse health), and physical component score (PCS) and mental component score (MCS) of the SF-12, separately in males and females. *p-value < 0.05; **p-value < 0.01; ***p-value < 0.001

AA variations according to the chronic condition status

Hypertension, diabetes, and chronic kidney disease were associated with accelerated GrimAA (both versions) and DunedinPACE in males (Supplementary Table 5). In contrast, only obesity and depression were associated with accelerated GrimAA (both versions) in females, and obesity was also associated with accelerated DunedinPACE.

Other DNAm-based estimates and their associations with sociodemographic characteristics, clinical measures, self-reported health outcomes, and the chronic condition status

There were also sex differences in the associations between other DNA-based estimates (telomere length, plasma proteins, and smoking-pack-years) and sociodemographic factors and health markers (Supplementary Tables 6 to 15). The correlations between telomere length and clinical measures except for DBP and MCS were found only in females (Supplementary Table 8). Regarding the chronic conditions, only chronic kidney disease was associated with shorter telomere length in females, and no associations were found in males (Supplementary Table 9). Correlations between either plasma proteins or smoking-pack-years and clinical measures were commonly found in both males and females (Supplementary Tables 12 and 13). At least one plasma protein was associated with diabetes, dyslipidemia, obesity, and chronic kidney disease in both males and females, but associations were more commonly found in males than females (Supplementary Tables 14 and 15).

Discussion

In this study, we examined the extent to which different measures of epigenetic aging were correlated with a system-wide measure of biological aging (FI) and an organ-specific biological aging marker (brain-PAD), separately in males and females. We observed that FI had weak but significant positive correlations with AA in both sexes, whereas brain-PAD was not correlated with any of the other biological aging measures. This suggests that these system-wide measures of biological aging are capturing somewhat overlapping, yet distinct aspects of the aging process. We also found that epigenetic aging was correlated with participants’ sociodemographic characteristics and health status, but this varied between males and females. Together, these findings indicate that markers of biological aging may be useful clinical indicators of overall physical, cognitive, and psychosocial deficits even in relatively healthy older people who have reached 70 years without major illness, including CVD and dementia. Further, we also found that there were differences between males and females in the relationships between other DNAm estimates (i.e., telomere length, plasma proteins, and smoking-pack-years) and health status.

Not surprisingly, in our present study, the first-generation AA measures were most strongly correlated with one another, and the most recent of these measures, DunedinPACE, was most significantly associated with the second-generation AA measures where the correlation with Grim2AA was the highest. Our finding is supported by previous research [5, 6, 8, 9] and aligns with the general purpose for which these epigenetic clock algorithms were developed. The first-generation epigenetic clocks were generated to predict chronological age [5, 6] whereas the second-generation clocks, like GrimAge [8, 11], go beyond just estimating chronological age and additionally aim to predict the risk of developing health outcomes. To reflect that purpose, GrimAge algorithms were constructed by incorporating additional age-related clinical and laboratory measures as well as lifestyle factors such as smoking [8, 11]. Thus, clear evidence has been found that second-generation clocks outperform first-generation clocks in the risk prediction of age-related diseases and death [8]. The latest generation clock—DunedinPACE—was specifically designed to capture the gradual, progressive deterioration simultaneously affecting different organ systems with aging [9]. This noticeably contrasts with first-generation clocks which were designed and trained to predict chronological age (HorvathAge [5] and HannumAge [6]). Moreover, prior studies have reported significant but weak correlations between AA and system-wide FI [21, 42]. However, as ours is the first to investigate correlations with GrimAge2 and DunedinPACE, we added new evidence to the field that DunedinPACE and GrimAge2 are more strongly correlated with FI rather than other AAs. This is not surprising given both DunedinPACE and FI have a multidimensional nature. Like DunedinPACE, the FI defines biological aging through the accumulation of age-related health deficits across a range of functions (physical, psychological, and social). Noticeably, our study also demonstrated that the correlations of GrimAge2 with DunedinPACE and FI were slightly stronger than those of the original GrimAge. This finding could be due to the additional inclusion of two commonly used biomarkers (i.e., DNAm-based high-sensitivity CRP, and hemoglobin A1C) in the GrimAge2 development [11]. Further, our finding of GrimAge2 also somewhat reflects the fact that GrimAge2 outperforms the original GrimAge in terms of mortality and morbidity risk prediction [11].

Our finding that males had accelerated epigenetic age in comparison to females is consistent with prior evidence demonstrating that, for individuals with the same chronological age, males tend to be biologically older than females [43]. This also aligns with the established longer life expectancy of females compared to males [23] which could be driven by a complex interaction of biological factors, including genetic influences and sex hormones, as well as behavioral factors [44]. Males may be more likely to engage in risky behaviors and unhealthy lifestyles such as smoking and heavy drinking [45, 46] and have an age-related decline in testosterone concentrations, which could lead to a greater risk of fatal illnesses including heart disease and cancer [47, 48]. There is also evidence that females are more willing to engage with health-related information and are more likely to seek medical treatment for their illness [49] which together could help initiate early detection and intervention. In this manner, females may have less fatal outcomes. In contrast, females in our study had higher deficit accumulation FI scores than males, which captures the overall disease burden. This finding also supports the conclusion of a prior meta-analysis among 37,426 participants, showing that females of all ages have a poorer health status in terms of FI, compared to males [50]. Thus, our finding from this relatively healthy older cohort might also reflect the male–female health-survival paradox, in that females live longer and are more likely to experience disabilities and disease burdens [46, 51].

Several studies have investigated whether AA measures are associated with participants’ sociodemographic characteristics and markers of their health status [52,53,54]. These studies have generally, but not consistently, shown that lower SES status and/or risk factors for chronic diseases, such as high blood pressure and smoking, are associated with accelerated aging. However, very few studies have considered whether these associations may vary between males and females. Our study provides an important new contribution to this field, demonstrating clear evidence of sex differences. In females, several significant correlations were observed between AA and health risk factors such as low SES and living alone, as well as markers of physical functioning (e.g., grip strength and gait speed), SBP, DBP, and self-rated overall and physical health whereas only associations between AA, especially Grim2AA, and SBP, gait speed, and self-rated physical health were found in males. Thus, our cross-sectional findings highlight that accelerated epigenetic aging could be linked with physical health deterioration in both males and females. Hence, a further longitudinal study is suggested to enhance this novel relationship and to examine whether accelerated epigenetic aging could be predictive of physical health deterioration in later life. Moreover, significant associations were found between newer accelerated AA measures and hypertension, diabetes, and chronic kidney disease in males only whereas the associations of AA measures with obesity and depression were observed only in females. Although the exact reasons for these differences are still unclear, biological disparities and previously noted sex differences in the prevalence of health conditions [55, 56] may have contributed to our results. Overall, it highlights the importance of considering separately males and females when investigating causal inference of our observed relationships in future longitudinal studies.

Our study had both strengths and limitations. The strength of this study includes investigating several epigenetic clocks including the most recent ones—GrimAge2 and DunedinPACE and their correlations with the system-wide FI, and brain-PAD, among community-based individuals. Our sex-balanced sample (females 50.7%) allowed us to investigate the relationships between AA and markers of health in males and females separately. We acknowledged that our community-based sample comprises relatively healthy older Australians without dementia or CVD, and about two-thirds had high SES. Thus, this could limit the generalizability of our findings. However, the sample had a good distribution in terms of education level (e.g., 41.8% had < 12 years of education) and ranged in age from 70 to 92 years. The findings add new evidence about the utility of biological aging measures even in relatively healthy older individuals.

In conclusion, our study showed that, in relatively healthy older individuals, epigenetic age was correlated with the system-wide deficit accumulation FI, but not with the neuroimaging-based brain aging biomarker (brain-PAD). We also showed that despite having similar chronological ages, females in our study had lower epigenetic aging, but higher FI, than males. Further, this study provides novel evidence showing sex differences in the relationships between AA measures and markers of health status. Thus, our results highlight the importance of considering sex differences when investigating biological aging and its utility as a marker of health or disease risk.