Predicting ExWAS findings from GWAS data: a shorter path to causal genes

Liang, Kevin Y. H.; Farjoun, Yossi; Forgetta, Vincenzo; Chen, Yiheng; Yoshiji, Satoshi; Lu, Tianyuan; Richards, J. Brent

doi:10.1007/s00439-023-02548-y

Predicting ExWAS findings from GWAS data: a shorter path to causal genes

Original Investigation
Published: 02 April 2023

Volume 142, pages 749–758, (2023)
Cite this article

Human Genetics Aims and scope Submit manuscript

Kevin Y. H. Liang^1,2,
Yossi Farjoun^1,7,8,9,
Vincenzo Forgetta^1,7,
Yiheng Chen^1,3,
Satoshi Yoshiji^1,3,10,11,
Tianyuan Lu^1,2,7 &
…
J. Brent Richards ORCID: orcid.org/0000-0002-3746-9086^{1,2,3,4,5,6,7}

1153 Accesses
2 Citations
10 Altmetric
Explore all metrics

Abstract

GWAS has identified thousands of loci associated with disease, yet the causal genes within these loci remain largely unknown. Identifying these causal genes would enable deeper understanding of the disease and assist in genetics-based drug development. Exome-wide association studies (ExWAS) are more expensive but can pinpoint causal genes offering high-yield drug targets, yet suffer from a high false-negative rate. Several algorithms have been developed to prioritize genes at GWAS loci, such as the Effector Index (Ei), Locus-2-Gene (L2G), Polygenic Prioritization score (PoPs), and Activity-by-Contact score (ABC) and it is not known if these algorithms can predict ExWAS findings from GWAS data. However, if this were the case, thousands of associated GWAS loci could potentially be resolved to causal genes. Here, we quantified the performance of these algorithms by evaluating their ability to identify ExWAS significant genes for nine traits. We found that Ei, L2G, and PoPs can identify ExWAS significant genes with high areas under the precision recall curve (Ei: 0.52, L2G: 0.37, PoPs: 0.18, ABC: 0.14). Furthermore, we found that for every unit increase in the normalized scores, there was an associated 1.3–4.6-fold increase in the odds of a gene reaching exome-wide significance (Ei: 4.6, L2G: 2.5, PoPs: 2.1, ABC: 1.3). Overall, we found that Ei, L2G, and PoPs can anticipate ExWAS findings from widely available GWAS results. These techniques are therefore promising when well-powered ExWAS data are not readily available and can be used to anticipate ExWAS findings, allowing for prioritization of genes at GWAS loci.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci

Article 28 October 2021

An effector index to predict target genes at GWAS loci

Article 11 February 2022

Genepanel.iobio - an easy to use web tool for generating disease- and phenotype-associated gene lists

Article Open access 11 December 2019

Data availability

Source code can be accessed through Github upon publication.

References

Auer PL, Lettre G (2015) Rare variant association studies: considerations, challenges and opportunities. Genome Med 7:16. https://doi.org/10.1186/s13073-015-0138-2
Article PubMed PubMed Central Google Scholar
Backman JD, Li AH, Marcketta A et al (2021) Exome sequencing and analysis of 454,787 UK Biobank participants. Nature. https://doi.org/10.1038/s41586-021-04103-z
Article PubMed PubMed Central Google Scholar
Boyle EA, Li YI, Pritchard JK (2017) An expanded view of complex traits: from polygenic to omnigenic. Cell 169:1177–1186. https://doi.org/10.1016/j.cell.2017.05.038
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan BK, Loh P-R, Finucane HK et al (2015) LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 47:291–295. https://doi.org/10.1038/ng.3211
Article CAS PubMed PubMed Central Google Scholar
Butcher SP (2003) Target discovery and validation in the post-genomic era. Neurochem Res 28:367–371. https://doi.org/10.1023/A:1022349805831
Article CAS PubMed Google Scholar
Carvalho-Silva D, Pierleoni A, Pignatelli M et al (2019) Open targets platform: new developments and updates two years on. Nucleic Acids Res 47:D1056–D1065. https://doi.org/10.1093/nar/gky1133
Article CAS PubMed Google Scholar
Curtis D (2019) A weighted burden test using logistic regression for integrated analysis of sequence variants, copy number variants and polygenic risk score. Eur J Hum Genet 27:114–124. https://doi.org/10.1038/s41431-018-0272-6
Article CAS PubMed Google Scholar
de Leeuw CA, Mooij JM, Heskes T, Posthuma D (2015) MAGMA: generalized gene-set analysis of GWAS data. PLOS Comput Biol 11:4219. https://doi.org/10.1371/journal.pcbi.1004219
Article CAS Google Scholar
Dunn OJ (1961) Multiple comparisons among means. J Am Stat Assoc 56:52–64
Article Google Scholar
Edwards SL, Beesley J, French JD, Dunning AM (2013) Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet 93:779–797. https://doi.org/10.1016/j.ajhg.2013.10.012
Article CAS PubMed PubMed Central Google Scholar
Forgetta V, Jiang L, Vulpescu NA et al (2022) An effector index to predict target genes at GWAS loci. Hum Genet. https://doi.org/10.1007/s00439-022-02434-z
Article PubMed Google Scholar
Fulco CP, Nasser J, Jones TR et al (2019) Activity-by-contact model of enhancer–promoter regulation from thousands of CRISPR perturbations. Nat Genet 51:1664–1669. https://doi.org/10.1038/s41588-019-0538-0
Article CAS PubMed PubMed Central Google Scholar
Gazal S, Weissbrod O, Hormozdiari F et al (2022) Combining SNP-to-gene linking strategies to identify disease genes and assess disease omnigenicity. Nat Genet. https://doi.org/10.1038/s41588-022-01087-y
Article PubMed PubMed Central Google Scholar
Ghoussaini M, Mountjoy E, Carmona M et al (2021) Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res 49:D1311–D1320. https://doi.org/10.1093/nar/gkaa840
Article CAS PubMed Google Scholar
Hrdlickova B, de Almeida RC, Borek Z, Withoff S (2014) Genetic variation in the non-coding genome: involvement of micro-RNAs and long non-coding RNAs in disease. Biochim Biophys Acta BBA 1842:1910–1922. https://doi.org/10.1016/j.bbadis.2014.03.011
Article CAS PubMed Google Scholar
Karczewski KJ, Solomonson M, Chao KR et al (2022) Systematic single-variant and gene-based association testing of thousands of phenotypes in 426,370 UK Biobank exomes. Medrxiv. https://doi.org/10.1101/2021.06.19.21259117
Article Google Scholar
Kemp JP, Morris JA, Medina-Gomez C et al (2017) Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis. Nat Genet 49:1468–1475. https://doi.org/10.1038/ng.3949
Article CAS PubMed PubMed Central Google Scholar
King EA, Davis JW, Degner JF (2019) Are drug targets with genetic support twice as likely to be approved? Revised estimates of the impact of genetic support for drug mechanisms on the probability of drug approval. PLOS Genet 15:e1008489. https://doi.org/10.1371/journal.pgen.1008489
Article CAS PubMed PubMed Central Google Scholar
Lee S, Emond MJ, Bamshad MJ et al (2012) Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. Am J Hum Genet 91:224–237. https://doi.org/10.1016/j.ajhg.2012.06.007
Article CAS PubMed PubMed Central Google Scholar
Lindsay MA (2003) Target discovery. Nat Rev Drug Discov 2:831–838. https://doi.org/10.1038/nrd1202
Article CAS PubMed Google Scholar
Mahajan A, Taliun D, Thurner M et al (2018) Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat Genet 50:1505–1513. https://doi.org/10.1038/s41588-018-0241-6
Article CAS PubMed PubMed Central Google Scholar
Mirza AH, Kaur S, Brorsson CA, Pociot F (2014) Effects of GWAS-associated genetic variants on lncRNAs within IBD and T1D candidate loci. PLoS ONE 9:e105723. https://doi.org/10.1371/journal.pone.0105723
Article CAS PubMed PubMed Central Google Scholar
Morris JA, Kemp JP, Youlten SE et al (2019) An atlas of genetic influences on osteoporosis in humans and mice. Nat Genet 51:258–266. https://doi.org/10.1038/s41588-018-0302-x
Article CAS PubMed Google Scholar
Mountjoy E, Schmidt EM, Carmona M et al (2021) An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat Genet 53:1527–1533. https://doi.org/10.1038/s41588-021-00945-5
Article CAS PubMed PubMed Central Google Scholar
Nasser J, Bergman DT, Fulco CP et al (2021) Genome-wide enhancer maps link risk variants to disease genes. Nature 593:238–243. https://doi.org/10.1038/s41586-021-03446-x
Article CAS PubMed PubMed Central Google Scholar
Nelson MR, Tipney H, Painter JL et al (2015) The support of human genetic evidence for approved drug indications. Nat Genet 47:856–860. https://doi.org/10.1038/ng.3314
Article CAS PubMed Google Scholar
Nicolae DL, Gamazon E, Zhang W et al (2010) Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet 6:e1000888. https://doi.org/10.1371/journal.pgen.1000888
Article CAS PubMed PubMed Central Google Scholar
Ochoa D, Karim M, Ghoussaini M et al (2022) Human genetics evidence supports two-thirds of the 2021 FDA-approved drugs. Nat Rev Drug Discov. https://doi.org/10.1038/d41573-022-00120-3
Article PubMed Google Scholar
Paul SM, Mytelka DS, Dunwiddie CT et al (2010) How to improve R&D productivity: the pharmaceutical industry’s grand challenge. Nat Rev Drug Discov 9:203–214. https://doi.org/10.1038/nrd3078
Article CAS PubMed Google Scholar
Purcell S, Neale B, Todd-Brown K et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575. https://doi.org/10.1086/519795
Article CAS PubMed PubMed Central Google Scholar
Schriml LM, Mitraka E, Munro J et al (2019) Human Disease Ontology 2018 update: classification, content and workflow expansion. Nucleic Acids Res. https://doi.org/10.1093/nar/gky1032
Article PubMed Google Scholar
Seyhan AA (2019) Lost in translation: the valley of death across preclinical and clinical divide identification of problems and overcoming obstacles. Transl Med Commun. https://doi.org/10.1186/s41231-019-0050-7
Article Google Scholar
Stranger BE, Nica AC, Forrest MS et al (2007) Population genomics of human gene expression. Nat Genet 39:1217–1224. https://doi.org/10.1038/ng2142
Article CAS PubMed PubMed Central Google Scholar
Wang Q, Dhindsa RS, Carss K et al (2021) Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature 597:527–532. https://doi.org/10.1038/s41586-021-03855-y
Article CAS PubMed PubMed Central Google Scholar
Weeks EM, Ulirsch JC, Cheng NY et al (2020) Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases. MedRxiv. https://doi.org/10.1101/2020.09.08.20190561
Article Google Scholar
Wishart DS, Feunang YD, Guo AC et al (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. https://doi.org/10.1093/nar/gkx1037
Article PubMed PubMed Central Google Scholar
Xu Y, Li Z (2020) CRISPR-Cas systems: overview, innovations and applications in human disease research and gene therapy. Comput Struct Biotechnol J 18:2401–2415. https://doi.org/10.1016/j.csbj.2020.08.031
Article CAS PubMed PubMed Central Google Scholar
Zhang F, Lupski JR (2015) Non-coding genetic variants in human disease. Hum Mol Genet 24:R102–R110. https://doi.org/10.1093/hmg/ddv259
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

The Richards research group is supported by the Canadian Institutes of Health Research (CIHR: 365825; 409511, 100558, 169303), the McGill Interdisciplinary Initiative in Infection and Immunity (MI4), the Lady Davis Institute of the Jewish General Hospital, the Jewish General Hospital Foundation, the Canadian Foundation for Innovation, the NIH Foundation, Cancer Research UK, Genome Québec, the Public Health Agency of Canada, McGill University, Cancer Research UK [grant umber C18281/A29019] and the Fonds de Recherche Québec Santé (FRQS). JBR is supported by a FRQS Mérite Clinical Research Scholarship. Support from Calcul Québec and Compute Canada is acknowledged. TwinsUK is funded by the Welcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. These funding agencies had no role in the design, implementation or interpretation of this study.

Author information

Authors and Affiliations

Lady Davis Institute for Medical Research, Jewish General Hospital, Montréal, QC, H3T 1E2, Canada
Kevin Y. H. Liang, Yossi Farjoun, Vincenzo Forgetta, Yiheng Chen, Satoshi Yoshiji, Tianyuan Lu & J. Brent Richards
Quantitative Life Sciences Program, McGill University, Montréal, QC, H3A 0G4, Canada
Kevin Y. H. Liang, Tianyuan Lu & J. Brent Richards
Department of Human Genetics, McGill University, Montréal, QC, H3A 0G4, Canada
Yiheng Chen, Satoshi Yoshiji & J. Brent Richards
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC, H3A 0G4, Canada
J. Brent Richards
Department of Medicine, McGill University, Montréal, QC, H3A 0G4, Canada
J. Brent Richards
Department of Twin Research, King’s College London, London, UK
J. Brent Richards
5 Prime Sciences Incorporated, Montréal, Canada
Yossi Farjoun, Vincenzo Forgetta, Tianyuan Lu & J. Brent Richards
Broad Institute, Cambridge, MA, 02142, USA
Yossi Farjoun
Fulcrum Genomics LLC, Boulder, CO, 80302, USA
Yossi Farjoun
Kyoto-McGill International Collaborative School in Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Satoshi Yoshiji
Japan Society for the Promotion of Science, Tokyo, Japan
Satoshi Yoshiji

Authors

Kevin Y. H. Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yossi Farjoun
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Forgetta
View author publications
You can also search for this author in PubMed Google Scholar
Yiheng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Yoshiji
View author publications
You can also search for this author in PubMed Google Scholar
Tianyuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
J. Brent Richards
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design: KL and JBR. Data analyses: KL, YF, and VF. Manuscript writing: KL, YF, YC, SY, and JBR. Supervision: JBR. Interpretation of data: all authors. All authors were involved in the preparation and revision of the manuscript.

Corresponding author

Correspondence to J. Brent Richards.

Ethics declarations

Conflict of interest

JBR’s institution has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline and Biogen for projects unrelated to this research. JBR is the CEO of 5 Prime Sciences (www.5primesciences.com), which provides research services for biotech, pharma and venture capital companies for projects unrelated to this research. VF, YF, and TL are employees of 5 Prime Sciences. Authors KYHL, YC, SY declares that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

This article does not contain any studies with human participants.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 721 KB)

Supplementary file2 (XLSX 51 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liang, K.Y.H., Farjoun, Y., Forgetta, V. et al. Predicting ExWAS findings from GWAS data: a shorter path to causal genes. Hum Genet 142, 749–758 (2023). https://doi.org/10.1007/s00439-023-02548-y

Download citation

Received: 02 November 2022
Accepted: 22 March 2023
Published: 02 April 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00439-023-02548-y

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting ExWAS findings from GWAS data: a shorter path to causal genes

Abstract

Access this article

Similar content being viewed by others

An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci

An effector index to predict target genes at GWAS loci

Genepanel.iobio - an easy to use web tool for generating disease- and phenotype-associated gene lists

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 721 KB)

Supplementary file2 (XLSX 51 KB)

Rights and permissions

About this article

Cite this article

Navigation

Predicting ExWAS findings from GWAS data: a shorter path to causal genes

Abstract

Access this article

Similar content being viewed by others

An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci

An effector index to predict target genes at GWAS loci

Genepanel.iobio - an easy to use web tool for generating disease- and phenotype-associated gene lists

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 721 KB)

Supplementary file2 (XLSX 51 KB)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation