A tool for translating polygenic scores onto the absolute scale using summary statistics

Pain, Oliver; Gillett, Alexandra C.; Austin, Jehannine C.; Folkersen, Lasse; Lewis, Cathryn M.

doi:10.1038/s41431-021-01028-z

Download PDF

Article
Open access
Published: 04 January 2022

A tool for translating polygenic scores onto the absolute scale using summary statistics

European Journal of Human Genetics volume 30, pages 339–348 (2022)Cite this article

6846 Accesses
14 Citations
16 Altmetric
Metrics details

Subjects

Abstract

There is growing interest in the clinical application of polygenic scores as their predictive utility increases for a range of health-related phenotypes. However, providing polygenic score predictions on the absolute scale is an important step for their safe interpretation. We have developed a method to convert polygenic scores to the absolute scale for binary and normally distributed phenotypes. This method uses summary statistics, requiring only the area-under-the-ROC curve (AUC) or variance explained (R²) by the polygenic score, and the prevalence of binary phenotypes, or mean and standard deviation of normally distributed phenotypes. Polygenic scores are converted using normal distribution theory. We also evaluate methods for estimating polygenic score AUC/R² from genome-wide association study (GWAS) summary statistics alone. We validate the absolute risk conversion and AUC/R² estimation using data for eight binary and three continuous phenotypes in the UK Biobank sample. When the AUC/R² of the polygenic score is known, the observed and estimated absolute values were highly concordant. Estimates of AUC/R² from the lassosum pseudovalidation method were most similar to the observed AUC/R² values, though estimated values deviated substantially from the observed for autoimmune disorders. This study enables accurate interpretation of polygenic scores using only summary statistics, providing a useful tool for educational and clinical purposes. Furthermore, we have created interactive webtools implementing the conversion to the absolute (https://opain.github.io/GenoPred/PRS_to_Abs_tool.html). Several further barriers must be addressed before clinical implementation of polygenic scores, such as ensuring target individuals are well represented by the GWAS sample.

Improving reporting standards for polygenic scores in risk prediction studies

Article 10 March 2021

Hannah Wand, Samuel A. Lambert, … Genevieve L. Wojcik

Tutorial: a guide to performing polygenic risk score analyses

Article 24 July 2020

Shing Wan Choi, Timothy Shin-Heng Mak & Paul F. O’Reilly

Assessing agreement between different polygenic risk scores in the UK Biobank

Article Open access 27 July 2022

Lei Clifton, Jennifer A. Collister, … David J. Hunter

Introduction

A substantial proportion of individual differences in health and disease are explained by genetic variation [1]. Genome-wide association studies (GWAS) have successfully identified thousands of genetic loci associated with a broad range of phenotypes, from anthropometric traits such as height [2], to psychiatric disorders such as major depression [3]. One application of GWAS results is the estimation of genetic risk/propensity for a given phenotype using polygenic scores. Polygenic scores are calculated as the GWAS effect size-weighted sum of alleles carried by an individual [4]. A range of methods exist for processing GWAS summary statistics prior to polygenic scoring to account for the linkage disequilibrium (LD) between variants and improve the predictive utility of polygenic scores [5].

Polygenic scores capture only part of the genetic liability to a disease or trait, but the proportion explained increases with larger GWAS sample sizes and with improvements in polygenic scoring methodology. For example, individuals in the top 8% of cardiovascular disease (CAD) polygenic scores have a three-fold increased risk of developing CAD compared to the general population [6]. The implementation of polygenic scores within a clinical setting are being increasingly discussed and investigated [7, 8], but several barriers exist before these scores can become an established part of healthcare, including the low variance of risk explained and issues with interpretation.

For correct interpretation of a polygenic score, the score must first be standardised according to an ancestry-matched reference, so that the score is transformed to units of standard deviations (SD) from the mean. This standardised score, referred to as a polygenic Z-score, can then be used to calculate the relative risk of disease for an individual, based on their polygenic score. Whilst relative risk estimates are of interest, they are challenging to interpret as they do not consider the predictive utility of the polygenic score or the prevalence of the outcome in the general population. It is well established that differences in risk are most accurately perceived by non-experts when presenting risk on the absolute scale, i.e., the probability an individual will develop the outcome [9, 10]. The absolute risk conferred by a given relative risk is determined by the predictive utility of the polygenic score and the population prevalence of the phenotype. For example, an individual’s polygenic Z-score for schizophrenia may be 1.96, indicating their polygenic score is higher than 97.5% of an ancestry-matched population. However, the absolute risk, or probability of the individual developing schizophrenia, is unknown, as we must account for the predictive utility of the polygenic score and the population prevalence of schizophrenia.

When individual-level data are available for the polygenic score and the outcome of interest, it is possible to calculate the absolute risk conferred by a given polygenic score by measuring the proportion of cases within polygenic scores quantiles. This approach is often used in research to put variance explained estimates into perspective [6], and it is also implemented by 23andMe to help interpretation of results by their customers [11]. However, individual-level data for the phenotype of interest within a representative sample is often unavailable. Therefore, a summary statistic-based approach, requiring only information describing the predictive utility of the polygenic score and the prevalence of the outcome, would greatly improve the availability of interpretable results from polygenic scores.

Within a homogenous population, polygenic scores are normally distributed due to the central limit theorem, and therefore normal distribution theory can be used to define polygenic scores based on the distribution of the phenotype and predictive utility of the polygenic score. Using this approach, our study develops a summary statistic-based tool for converting polygenic scores into absolute estimates for both binary and normally distributed phenotypes. For binary phenotypes, we calculate absolute risk based on the predictive utility of the polygenic score and the population prevalence of the phenotype. For normally distributed phenotypes, we calculate predictions in absolute terms based on the predictive utility of the polygenic scores and the population mean and SD of the phenotype. In addition, we compare several approaches that estimate the predictive utility of polygenic scores using GWAS summary statistics alone, as this value is often unknown.

Another important factor reported to help accurately interpret differences in risk is the use of simple visual aids [9]. Therefore, this study also develops interactive webtools for converting an individual’s polygenic scores to the absolute scale with corresponding graphics.

Materials and methods

UK Biobank (UKB)

UKB is a prospective cohort study that recruited >500,000 individuals aged between 40 and 69 years across the United Kingdom [12]. The UKB received ethical approval from the North West—Haydock Research Ethics Committee (reference 16/NW/0274).

Phenotype data

Eleven UKB phenotypes with well-powered and independent GWAS summary statistics were selected to represent a range of genetic architectures (heritability/polygenicity). Eight phenotypes were binary: depression, type 2 diabetes, coronary artery disease (CAD), inflammatory bowel disease (IBD), rheumatoid arthritis (RheuArth), multiple sclerosis (MultiScler), breast cancer, and prostate cancer. Three phenotypes were continuous: intelligence, height, and body mass index (BMI). Further information regarding phenotype definitions can be found in the Supplementary Material.

Analysis was performed on a subset of ~50,000 UKB participants for each outcome to the reduce computational burden of subsequent analyses whilst maintaining sufficient statistical power. For each continuous trait (Intelligence, Height, BMI), a random sample ~50,000 individuals were selected. For disease traits, all available cases were included, except for high prevalence disease traits, Depression and CAD, where a random sample of 25,000 cases was selected. Controls were randomly selected to obtain a total sample size of 50,000. This sampling procedure was performed once for each phenotype. Sample sizes for each phenotype after genotype data quality control are shown in Table 1.

Table 1 Comparison between observed and estimated values on the absolute scale across polygenic score quantiles for binary and normally distributed phenotypes.

Full size table

Genetic data

UKB released imputed dosage data for 488,377 individuals and ~96 million variants, generated using IMPUTE4 software [12] with the Haplotype Reference Consortium reference panel [13] and the UK10K Consortium reference panel [14]. This study retained individuals that were of European ancestry based on 4-means clustering on the first 2 principal components provided by the UKB, had congruent genetic and self-reported sex, passed UKB test for excessive heterozygosity or missingness (variable ID = “het.missing.outliers”), and removed related individuals (>3rd degree relative, KING threshold >0.044) using relatedness kinship (KING) estimates provided by the UKB [12]. The imputed dosages were converted from BGEN v1.2 format to PLINK1 binaries (.bed/.bim/.fam) hard-call format using QCTOOL v2 (see URLs) without specifying a hard-call threshold (i.e. threshold = 0) to retain the best guess for all genotypes.

Polygenic scoring

Polygenic scores were derived within a reference-standardised framework, whereby polygenic scores are derived using a common set of genetic variants, LD estimates, and allele frequency estimates [5]. This approach is well suited for the clinical setting and is also good practice for research purposes.

SNP-level QC

HapMap 3 variants from the LD-score regression website (see Web Resources) were extracted from UKB, inserting any HapMap 3 variants that were not available in UKB as missing genotypes (as required for reference MAF imputation by the PLINK allelic scoring function) [15]. No other SNP-level QC was performed.

Individual-level QC

Individuals of European ancestry were retained for polygenic score analysis. In addition to the 4-means clustering approach described above, we determined whether UKB participants were of European ancestry using 1000 Genomes Phase 3 projected principal components of population structure, retaining only those within three SD from the mean for the top 100 principal components. This process will also remove individuals who are outliers due to technical genotyping or imputation errors.

GWAS summary statistics

Publicly available GWAS summary statistics were identified for the phenotypes as defined above, or similar phenotypes (descriptive statistics in Table S1) [2, 16,17,18,19,20,21,22,23,24,25]. We excluded GWAS with documented sample overlap with UKB. Quality control of GWAS summary statistics is described in the Supplementary Material.

Reference genotype datasets

Genotype-based scoring in UKB was reference-standardised using the European subset of 1000 Genomes Phase 3 (N = 503) as the reference. This means the GWAS summary statistics were processed for polygenic scoring using only reference genotype data to estimate LD and allele frequencies, and the resulting polygenic scores in UKB were scaled and centred according to the mean and SD of the score in the reference. The reference-standardised approach has been described in further detail previously [5].

Polygenic scoring methodology

Polygenic scoring was carried out using DBSLMM [26], which models LD between genetic variants and applies shrinkage parameters to avoid overfitting. DBSLMM is a computationally scalable method that performs similarly to other leading polygenic scoring methods [5]. For comparison with the DBSLMM polygenic score results, a threshold and clump (pT + clump) approach was also used.

pT + clump was performed using an R² threshold of 0.1 and window of 250 kb. Within the MHC region (28–34 Mb on chromosome 6), the pT + clump method retains only the single most significant variant due to long range and complex LD in this region. Ten p value thresholds were used to select variants: 1 × 10⁻⁸, 1 × 10⁻⁶, 1 × 10⁻⁴, 1 × 10⁻², 0.1, 0.2, 0.3, 0.4, 0.5 and 1. The pT + clump approach was implemented using PLINK v1.9 [15].

After preparation of GWAS summary statistics, polygenic scores were calculated using PLINK with reference MAF imputation of missing data. All scores were standardised (scaled and centred) based on the mean and SD of polygenic scores in the reference sample (European subset of 1000 Genomes Phase 3).

Converting from relative to absolute scale

Converting to the absolute scale here requires updating the distribution parameters for a phenotype, given that we observe a polygenic score within a specified range (determined by quantiles). We develop methodology to achieve this for both binary and normally distributed phenotypes, using only the distribution of the phenotype and the predictive utility of the polygenic score.

Broadly, our approach defines the population distribution for the polygenic score using a measure of its predictive utility within normal distribution theory. The polygenic score quantiles are then estimated and, using these and the phenotype and polygenic score distributions, we derived the required, updated distribution parameters for the phenotype using conditional probability rules.

For binary phenotypes, such as major depression and MultiScler, polygenic scores can be modelled as a mixture of two normal distributions, using the population prevalence of the phenotype, and the predictive utility of the polygenic scores, often indicated by the area-under-the-ROC-curve (AUC). Once this distribution has been defined, the quantiles of the polygenic scores, and the proportion of cases within each quantile is estimated. Quantile estimation for the mixture distribution requires using a root-finding algorithm; here we use the “uniroot” function in the “base” R package [27]. A full derivation of the formulae for conversion to the absolute scale is available in the Supplementary Material.

For normally distributed continuous phenotypes, such as height and IQ, polygenic scores are defined as part of a bivariate normal distribution with the phenotype, using the mean and SD of the phenotype in the general population, and the predictive utility of the polygenic scores, often indicated as the variance explained (R²). Once this distribution has been defined, the quantiles of the polygenic scores, and the phenotype mean and SD within each quantile is estimated. The mean and SD of the phenotype within each polygenic score quantile are estimated using the “mtmvnorm” function in the “tmvtnorm” R package [28]. A full derivation of the formulae for the conversion to the absolute scale is available in the Supplementary Material.

Both conversions use some assumption of normality when defining the distribution of the polygenic score, which is underpinned by the central limit theorem. We note that polygenic scores derived using the pT + clump approach will often include only a small number of genetic variants when using a stringent p value threshold and may therefore not fit a normal distribution. To determine whether the conversions are biased in this scenario, we compared absolute estimates to those observed when using pT + clump polygenic scores based on the most stringent p-values threshold available, whilst retaining at least five genetic variants.

Estimating predictive utility of polygenic scores

The pseudovalidate function of the “lassosum” R package [29] estimates the correlation (R) between the polygenic score and GWAS phenotype to identify the optimal lassosum hyper parameters (s and lambda). We have previously shown lassosum has a similar predictive utility to the DBSLMM polygenic score [5]. For continuous phenotypes, the variance explained by the polygenic scores is R². For binary phenotypes, the AUC is obtained from the correlation via calculation of the Cohen’s d, accounting for the GWAS sampling ratio [30, 31]. The conversion of R to Cohen’s d is described in the Supplementary Material.

In the Supplementary Material, we explore two other approaches for estimating the predictive utility of polygenic scores, including AVENGEME [32] and G-WIZ [33].

Validation procedure

Conversion to absolute scale

Conversion to the absolute scale for binary and continuous phenotypes was validated in UKB. For binary phenotypes, the conversion was validated by comparing the observed number of affected individuals within each polygenic score quantile to the estimated values. For continuous phenotypes, the conversion was validated by comparing the observed phenotype mean and SD within each polygenic score quantile to estimated values. When validating the conversion to the absolute scale, the observed AUC/R² of the polygenic score, and observed sampling ratio or phenotype mean and SD were used to estimate the polygenic score distribution. We also compared observed and estimated values for Height stratified by sex.

Estimation of polygenic score AUC/R ²

To validate the AUC and R² estimates derived from lassosum, we compared the estimated values to those observed in UKB. We also used the estimated AUC/R² values when converting polygenic score into absolute terms, to determine the extent to which differences between observed and estimated AUC/R² values influenced the results.

Development of interactive visualisation tool

We developed an interactive webtool for converting and visualising standardised polygenic scores to the absolute scale for binary and normally distributed phenotypes. The webtools were developed using the “shiny” package in R, and are hosted on the shinyapps.io website (see URLs). For the shiny app implementation of the absolute scale conversion, the polygenic score distribution is split into 1000 quantiles to increase the precision of the results.

Databases reporting the predictive utility of polygenic scores, such as the Polygenic Score Catalog [34] (see URLs), store a range of effect size metrics for polygenic scores of binary outcomes, including AUC, R² on the observed scale, R² on the liability scale, odds ratio per standard deviation, and Cohen’s d. To enhance the utility of the interactive webtool for binary outcomes, we allow the predictive utility of the polygenic scores to be specified using any of these metrics. Details for the conversion between odds ratio per standard deviation and Cohen’s d are provided in the Supplementary Material. Other conversions have been previously documented: AUC [31], R² on the observed scale [30], R² on the liability scale [35].

Results

Validating conversion to absolute terms

We validated the approach for converting a polygenic score into absolute terms by comparing the observed and estimated distributions of phenotypes within DBSLMM polygenic score quantiles. When the AUC/R² of the polygenic score is known, the absolute estimates were highly concordant with observed values (Table 1 and Figs. 1 and 2).

**Fig. 1: Comparison of observed and estimated probability of being a case across 20 DBSLMM polygenic score quantiles.**

**Fig. 2: Comparison of observed and estimated phenotype mean and standard deviation across 20 DBSLMM polygenic score quantiles.**

For binary phenotypes (Table 1 and Fig. 1), the mean absolute difference between the observed and estimated proportion of cases was between 12.6% for MultiScler and 1.5% for both Depression and CAD. The concordance between observed and estimated values decreased as the number of cases within UKB decreased, reflecting increased error in the observed values.

For the three continuous phenotypes (Table 1 and Fig. 2), the mean absolute difference between observed and estimated means was <0.3%. The mean absolute difference between observed and estimated phenotype standard deviations across polygenic score quantiles were 1.3% for Intelligence, 1.5% for Height, and 6% for BMI. The reduced concordance between observed and estimated values for BMI reflects the increased skewness of this phenotype in UKB.

To demonstrate the flexibility of the conversion to model absolute risk within stratified populations, we compared observed and estimated absolute values for Height within males and females separately. Again, the concordance between observed and expected values was high (Fig. S1). Given large sex differences in Height, the polygenic score R² increased when stratified by sex, and correspondingly the difference in mean Height across polygenic score quantiles was larger, and the standard deviation of Height within polygenic score quantiles was smaller.

The observed and estimated absolute values were also highly concordant when using pT + clump polygenic scores defined using stringent p value thresholds (Figs. S2 and S3). Some discrepancy between observed and estimated values was present for Depression due to the low predictive utility of the polygenic score when using the stringent p value threshold to select variants.

We have provided examples using these conversions to interpret polygenic scores for an individual (Figs. 3 and 4). For binary phenotypes, we use the example schizophrenia, with a population prevalence of 1% and a polygenic score AUC of ~0.67 [36]. If an individual has a polygenic Z-score of 1.96, they are in the 97.5th percentile of schizophrenia polygenic scores. However, given the modest AUC of the polygenic score, only 2.7% of individuals with that polygenic score will develop schizophrenia (Fig. 3). For continuous phenotypes, we use the example of intelligence quotient (IQ), with a population mean of 100 and standard deviation of 15, for which the educational attainment polygenic score has been reported to explain 10% of the variance [37]. If an individual has a polygenic Z-score of −1.96, they are in the 2.5th percentile of educational attainment polygenic scores. The mean IQ of individuals with this polygenic score is 90.7, with 95% prediction intervals from 62.8 to 118.6 (Fig. 4).

**Fig. 3: Shiny app implementing absolute scale conversion for binary phenotypes.**

**Fig. 4: Shiny app implementing absolute scale conversion for normally distributed phenotypes.**

To illustrate the full range of predictive utility of polygenic scores on the absolute scale, we have simulated results based on a range of polygenic score AUC and prevalence values for binary phenotypes, and range of polygenic scores R² values for continuous phenotypes (Figs. S4 and S5).

Validating polygenic scores AUC/R ² estimation

The lassosum estimates of AUC/R² were concordant with the observed AUC/R² of PRScs polygenic scores for most phenotypes (Table 2). The absolute difference between estimated and observed AUC values was less than 0.04 for five of the eight binary phenotypes. The absolute difference between estimated and observed R² values were 0.007, 0.015, and 0.046 for BMI, Intelligence and Height, respectively. The lassosum estimates were most discordant for the three autoimmune disorders included in this study. The estimated AUC was substantially higher than the observed for IBD (AUC diff = 0.115) and MultiScler (AUC diff = 0.128). The analysis did not complete for RheuArth, resulting in an AUC of 0.5 being returned by the analysis.

Table 2 Comparison between polygenic score AUC/R² observed in UKB and estimated using lassosum.

Full size table

To determine the extent to which using estimated AUC/R² influences the absolute estimates, we compared absolute estimates derived using lassosum estimated AUC/R², to the observed absolute values (Table 3). The results for RheuArth were highly discordant due to the incomplete analysis estimating the AUC. After excluding RheuArth, the concordance between observed and estimated absolute values remained high when using estimated AUC/R² values. For IBD and MultiScler, the phenotypes with discordant estimates of polygenic score AUC, the mean absolute percentage difference between observed-estimated proportion of cases was 38.6% and 43.7%, respectively. Discrepancies were particularly pronounced in the upper tail of the polygenic score distribution. The ratio between estimated and observed proportions of cases in the top polygenic score quantile (top 5%) was 1.83 and 2.16 for IBD and MultiScler, respectively.

Table 3 Comparison between observed and estimated case probabilities across polygenic score quantiles for binary phenotypes.

Full size table

Discussion

This study has derived and evaluated methods for converting polygenic scores for both binary and normally distributed phenotypes from a relative value, of where the polygenic score lies on the distribution, into a value on the absolute scale. For a disorder, the conversion provides an estimate of the proportion of cases within a polygenic score quantile, and for a normally distributed trait, the conversion gives the trait mean and SD within a polygenic score quantile. Comparison of absolute estimates with observed values within UKB show the method is highly accurate when the AUC (for disorders) or R² (for a trait) of the polygenic score is known. Furthermore, we show that lassosum pseudovalidate function can provide accurate estimates of AUC/R² in most instances, though inaccuracies in AUC/R² estimation can substantially bias absolute estimates in the extremes of the polygenic score distribution.

Interpretation of polygenic scores is one of the greatest barriers to their safe application to the clinical setting, and also poses a problem when polygenic scores are delivered to the individuals via direct-to-consumer genetics testing companies. Our study provides a summary statistic-based approach for converting polygenic scores into absolute terms, enabling their accurate interpretation. Although previous research has developed summary statistic-based approaches for converting polygenic scores to the absolute scale for binary outcomes [38], our study additionally provides a conversion method for normally distributed outcomes and a user-friendly webtool to improve accessibility of the methods. Furthermore, where the predictive utility of polygenic scores is unknown, we show that lassosum pseudovalidate may be used although the results should be interpreted with caution.

The predictive utility of polygenic scores is under a great deal of scrutiny. A major criticism is that they are only predictive at the group-level, i.e. for research, but are poor predictors at the individual-level. Our approach of converting polygenic scores to the absolute scale helps understand this distinction as it converts the group-level metric of predictive utility (AUC or R²) into an absolute value for an individual (risk of disorder or trait value with prediction intervals). Indeed, the absolute estimates do not vary substantially across the polygenic score distribution except at the extremes, which reflects the modest AUC/R² of the current polygenic scores. The predictive utility of polygenic scores will increase with more powerful GWAS and better resolution of the causal variant. However, these scores will always be probabilistic, and will never give deterministic predictions as genetic variation only explains part of the phenotypic variance. Polygenic scores can be combined with other risk factors to increase the accuracy of prediction, though the value of prediction also depends partly on the actions or interventions available to address the predicted outcome.

Our approach focuses on converting polygenic scores onto the absolute scale given parameters describing the distribution of the outcome and PRS predictive utility in a representative sample. Therefore, our approach can be extended to account for other factors by modifying these input parameters. We demonstrate this flexibility by converting polygenic scores to the absolute scale for Height whilst accounting for sex differences by specifying the sex-specific mean and SD of Height, and the sex-specific PRS variance explained. This same approach could be used to model other factors modifying risk, such as smoking status for CAD. Of course, this summary statistic-based approach is limited by the availability of relevant summary statistics.

We have developed an interactive webtool implementing these methods to convert polygenic scores into absolute risk or trait estimates, also providing visual aids to promote the accurate interpretation of differences in risk. The webtool accepts any of the most used polygenic score effect size metrics, facilitating the use of resources such as the Polygenic Score Catalog [34], which collate polygenic score data and corresponding prediction metrics. We also plan to implement these methods on Impute.me, a popular non-profit website providing polygenic scores for users who upload genotype data from a direct-to-consumer genetic testing company.

There are several limitations of this study. Firstly, this summary statistic-based approach relies on the input parameters being representative of the target individual’s demographic. This study focuses on UKB participants of European ancestry, and we define the distribution of the phenotype using the observed distribution within UKB. However, specifying the distribution of a phenotype and the AUC/R² of the polygenic score for a population that is representative of the target individual may be challenging. For example, assuming the phenotype distribution and polygenic score AUC/R² found in a European population will give biased results in non-European populations where differences exist in the phenotype distribution and polygenic score AUC/R² [39]. Most GWAS are performed in European populations, and the variance explained by polygenic scores is typically higher in Europeans than non-Europeans [40]. Secondly, although the lassosum pseudovalidate approach works well in most instances, it can provide inaccurate results, particularly for rare outcomes, which could lead to inaccurate absolute estimates. Further development of methods that can estimates the AUC/R² of polygenic scores more reliably without a validation sample would be useful. Alternatively, the predictive utility of the polygenic score can be specified based on resources such as the Polygenic Score Catalog [34]. To support this effort, future polygenic score risk prediction studies should follow suggested reporting standards [41]. Thirdly, for continuous phenotypes, the approach is tailored to normally distributed phenotypes. Further development of methods to account for non-normally distributed phenotypes may be useful. Finally, for binary traits or disease, we calculate life-time risks that are based on pre-specified prevalence, which do not account for the risk period an individual has already lived through or other risk factors.

In summary, this study has provided an approach for converting polygenic scores into absolute risk and predictions based on GWAS summary statistics. It establishes a framework for appropriate and accurate interpretation of polygenic scores by patients, consumers, and healthcare professionals.

URLs

Interactive webtool/shiny apps: https://opain.github.io/GenoPred/PRS_to_Abs_tool.html
LDSC HapMap 3 SNP-list: https://data.broadinstitute.org/alkesgroup/LDSCORE/w_hm3.snplist.bz2
Impute.me: https://impute.me/
GenoPred website: https://opain.github.io/GenoPred
The Polygenic Score Catalog: https://www.pgscatalog.org/
QCTOOL v2: https://www.well.ox.ac.uk/~gav/qctool_v2/index.html

Data availability

The data that support the findings of this study are available from the UK Biobank Access Management System (https://bbams.ndph.ox.ac.uk/ams/), but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. For further information on how to access UKB data, see https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access.

References

Polderman TJC, Benyamin B, de Leeuw CA, Sullivan PF, van Bochoven A, Visscher PM, et al. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nat Genet. 2015;47:702–9. http://www.ncbi.nlm.nih.gov/pubmed/25985137.
Article CAS Google Scholar
Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173.
Article CAS Google Scholar
Howard DM, Adams MJ, Clarke T-K, Hafferty JD, Gibson J, Shirali M, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci. 2019;22:343–52.
Article CAS Google Scholar
Choi SW, Mak TS-H, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020;15:1–14.
Pain O, Glanville KP, Hagenaars SP, Selzam S, Fürtjes AE, Gaspar HA, et al. Evaluation of polygenic prediction methodology within a reference-standardized framework. PLoS Genet. 2021;17:e1009021.
Article CAS Google Scholar
Khera AV, Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50:1219–24.
Article CAS Google Scholar
Lewis CM, Vassos E. Polygenic risk scores: from research tools to clinical instruments. Genome Med. 2020;12:1–11.
Article Google Scholar
Wray NR, Lin T, Austin J, McGrath JJ, Hickie IB, Murray GK, et al. From basic science to clinical application of polygenic risk scores: a primer. JAMA Psychiatry. 2021;78:101–9.
Article Google Scholar
Zipkin DA, Umscheid CA, Keating NL, Allen E, Aung K, Beyth R, et al. Evidence-based risk communication: a systematic review. Ann Intern Med. 2014;161:270–80.
Article Google Scholar
Gigerenzer G, Gaissmaier W, Kurz-Milcke E, Schwartz LM, Woloshin S. Helping doctors and patients make sense of health statistics. Psychol Sci Public Interest. 2007;8:53–96.
Article Google Scholar
Furlotte NA, Kleinman A, Smith R, Hinds D. White paper 23-12: estimating complex phenotype prevelance using predictive models. 2015.
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–9.
Article CAS Google Scholar
McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet. 2016;48:1279–83.
UK10K Consortium. The UK10K project identifies rare variants in health and disease. Nature. 2015;526:82–90.
Article Google Scholar
Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:1.
Article Google Scholar
Wray NR, Sullivan PF. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv. 2017. https://doi.org/10.1038/s41588-018-0090-3.
Rietveld CA, Medland SE, Derringer J, Yang J, Esko T, Martin NW, et al. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science. 2013;340:1235488.
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206.
Article CAS Google Scholar
Scott RA, Scott LJ, Mägi R, Marullo L, Gaulton KJ, Kaakinen M, et al. An expanded genome-wide association study of type 2 diabetes in Europeans. Diabetes. 2017;66:2888–902.
Article CAS Google Scholar
Nikpay M, Goel A, Won H-H, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47:1121.
Article CAS Google Scholar
Liu JZ, Van Sommeren S, Huang H, Ng SC, Alberts R, Takahashi A, et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet. 2015;47:979–86.
Article CAS Google Scholar
Sawcer S, Hellenthal G, Pirinen M, Spencer CCA, Patsopoulos NA, Moutsianas L, et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature. 2011;476:214.
Article CAS Google Scholar
Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506:376–81.
Article CAS Google Scholar
Michailidou K, Lindström S, Dennis J, Beesley J, Hui S, Kar S, et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 2017;551:92–4.
Article Google Scholar
Schumacher FR, Al Olama AA, Berndt SI, Benlloch S, Ahmed M, Saunders EJ, et al. Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci. Nat Genet. 2018;50:928–36.
Article CAS Google Scholar
Yang S, Zhou X. Accurate and scalable construction of polygenic scores in large biobank data sets. Am J Hum Genet. 2020;106:679–93.
Article CAS Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Core Team; 2015. http://www.r-project.org.
Wilhelm S, Manjunath GB. tmvtnorm: Truncated Multivariate Normal and Student t Distribution. 2015.
Mak TSH, Porsch RM, Choi SW, Zhou X, Sham PC. Polygenic scores via penalized regression on summary statistics. Genet Epidemiol. 2017;41:469–80.
Article Google Scholar
Aaron B, Kromrey JD, Ferron J. Equating “r”-based and “d”-based effect size indices: problems with a commonly recommended formula. ERIC Clearinghouse. 1998.
Rice ME, Harris GT. Comparing effect sizes in follow-up studies: ROC Area, Cohen’s d, and r. Law Hum Behav. 2005;29:615.
Article Google Scholar
Palla L, Dudbridge F. A fast method that uses polygenic scores to estimate the variance explained by genome-wide marker panels and the proportion of variants affecting a trait. Am J Hum Genet. 2015;97:250–9.
Article CAS Google Scholar
Patron J, Serra-Cayuela A, Han B, Li C, Wishart DS. Assessing the performance of genome-wide association studies for predicting disease risk. PLoS ONE. 2019;14:e0220215.
Article CAS Google Scholar
Lambert SA, Gil L, Jupp S, Ritchie SC, Xu Y, Buniello A, et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat Genet. 2021;53:1–6.
Lee SH, Goddard ME, Wray NR, Visscher PM. A better coefficient of determination for genetic profile analysis. Genet Epidemiol. 2012;36:214–24.
Article Google Scholar
Pardiñas AF, Holmans P, Pocklington AJ, Escott-Price V, Ripke S, Carrera N, et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet. 2018;50:381.
Article Google Scholar
Allegrini AG, Selzam S, Rimfeld K, von Stumm S, Pingault J-B, Plomin R. Genomic prediction of cognitive traits in childhood and adolescence. Mol Psychiatry. 2019;24:819–27.
Article CAS Google Scholar
Lello L, Raben TG, Yong SY, Tellier LCAM, Hsu SDH.Genomic prediction of 16 complex disease risks including heart attack, diabetes, breast and prostate cancer. Sci Rep.2019;9:1–16.
Google Scholar
Wood TR, Owens N. Using synthetic datasets to bridge the gap between the promise and reality of basing health-related decisions on common single nucleotide polymorphisms. F1000Research. 2020;8:2147.
Article Google Scholar
Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51:584–91.
Article CAS Google Scholar
Wand H, Lambert SA, Tamburro C, Iacocca MA, O’Sullivan JW, Sillari C, et al. Improving reporting standards for polygenic scores in risk prediction studies. Nature. 2021;591:211–9.
Article CAS Google Scholar

Download references

Acknowledgements

The authors acknowledge use of the research computing facility at King’s College London, Rosalind (https://rosalind.kcl.ac.uk), which is delivered in partnership with the NIHR Maudsley BRC, and part-funded by capital equipment grants from the Maudsley Charity (award 980) and Guy’s & St. Thomas’ Charity (TR130505). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care. UKB: This research was conducted under UK Biobank application 18177.

Funding

This paper represents independent research funded by the UK Medical Research Council (MR/N015746/1), and the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London.

Author information

These authors contributed equally: Oliver Pain, Alexandra C. Gillett.

Authors and Affiliations

Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Oliver Pain, Alexandra C. Gillett & Cathryn M. Lewis
NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, SE5 8AF, UK
Oliver Pain & Cathryn M. Lewis
Department of Psychiatry and Medical Genetics, University of British Columbia, Vancouver, BC, Canada
Jehannine C. Austin
Danish National Genome Center, Copenhagen, Denmark
Lasse Folkersen
Department of Medical and Molecular Genetics, Faculty of Life Sciences and Medicine, King’s College London, London, UK
Cathryn M. Lewis

Authors

Oliver Pain
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra C. Gillett
View author publications
You can also search for this author in PubMed Google Scholar
Jehannine C. Austin
View author publications
You can also search for this author in PubMed Google Scholar
Lasse Folkersen
View author publications
You can also search for this author in PubMed Google Scholar
Cathryn M. Lewis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

OP and ACG worked together to conceive the study design, perform analyses, develop methodology and draft the manuscript. CML provided extensive feedback regarding the study design, methodology, and manuscript. JCA and LF provided feedback regarding the manuscript and webtool.

Corresponding author

Correspondence to Oliver Pain.

Ethics declarations

Competing interests

CML sits on the Myriad Neuroscience Scientific Advisory Board. The other authors declare no competing interests.

Ethical approval

This study used data from the UKB, which received ethical approval from the North West—Haydock Research Ethics Committee (reference 16/NW/0274).

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Text and Figures

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pain, O., Gillett, A.C., Austin, J.C. et al. A tool for translating polygenic scores onto the absolute scale using summary statistics. Eur J Hum Genet 30, 339–348 (2022). https://doi.org/10.1038/s41431-021-01028-z

Download citation

Received: 16 June 2021
Revised: 16 September 2021
Accepted: 09 December 2021
Published: 04 January 2022
Issue Date: March 2022
DOI: https://doi.org/10.1038/s41431-021-01028-z

This article is cited by

Principles and methods for transferring polygenic risk scores across global populations
- Linda Kachuri
- Nilanjan Chatterjee
- Tian Ge
Nature Reviews Genetics (2024)
Polygenic predisposition, sleep duration, and depression: evidence from a prospective population-based cohort
- Odessa S. Hamilton
- Andrew Steptoe
- Olesya Ajnakina
Translational Psychiatry (2023)
A simulation study for multifactorial genetic disorders to quantify the impact of polygenic risk scores on critical illness insurance
- Jinbo Zhao
- Michael Salter-Townshend
- Adrian O’Hagan
European Actuarial Journal (2023)
Patient and provider perspectives on polygenic risk scores: implications for clinical reporting and utilization
- Anna C. F. Lewis
- Emma F. Perez
- Elizabeth W. Karlson
Genome Medicine (2022)
Good genotype-phenotype relationships in rare disease are hard to find
- Alisdair McNeill
European Journal of Human Genetics (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

UK Biobank (UKB)

Phenotype data

Genetic data

Polygenic scoring

SNP-level QC

Individual-level QC

GWAS summary statistics

Reference genotype datasets

Polygenic scoring methodology

Converting from relative to absolute scale

Estimating predictive utility of polygenic scores

Validation procedure

Conversion to absolute scale

Estimation of polygenic score AUC/R 2

Development of interactive visualisation tool

Results

Validating conversion to absolute terms

Validating polygenic scores AUC/R 2 estimation

Discussion

URLs

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links

Estimation of polygenic score AUC/R ²

Validating polygenic scores AUC/R ² estimation