Abstract
Downstream analysis of OMICS data requires interpretation of many molecular components considering current biological knowledge. Most tools used at present for functional enrichment analysis workflows applied to the field of proteomics are either borrowed or have been modified from genomics workflows to accommodate proteomics data. While the field of proteomics data analytics is evolving, as is the case for molecular annotation coverage, one can expect the rise of enhanced databases with less redundant ontologies spanning many elements of the tree of life. The methodology described here shows in practical steps how to perform overrepresentation analysis, functional class scoring, and pathway-topology analysis using a preexisting neurological dataset of proteomic data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gene Ontology Consortium (2019) The gene ontology resource: 20 years and still GOing strong. Nucleic Aids Res 47(D1):D330–D338. https://doi.org/10.1093/nar/gky1055
Kanehisa M (2019) Toward understanding the origin and evolution of cellular organisms. Protein Sci 28(11):1947–1951. https://doi.org/10.1002/pro.3715
Slenter DN, Kutmon M, Hanspers K, Riutta A, Windsor J, Nunes N, Mélius J, Cirillo E, Coort SL, Digles D, Ehrhart F, Giesbertz P, Kalafati M, Martens M, Miller R, Nishida K, Rieswijk L, Waagmeester A, Eijssen LMT, Evelo CT, Pico AR, Willighagen EL (2018) WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research. Nucleic Acids Res 46(D1):D661–D667. https://doi.org/10.1093/nar/gkx1064
Jassal B, Matthews L, Viteri G, Gong C, Lorente P, Fabregat A, Sidiropoulos K, Cook J, Gillespie M, Haw R, Loney F, May B, Milacic M, Rothfels K, Sevilla C, Shamovsky V, Shorser S, Varusai T, Weiser J, Wu G, Stein L, Hermjakob H, D'Eustachio P (2020) The reactome pathway knowledgebase. Nucleic Acids Res 48(D1):D498–D503. https://doi.org/10.1093/nar/gkz1031
Jewison T, Su Y, Disfany FM, Liang Y, Knox C, Maciejewski A, Poelzer J, Huynh J, Zhou Y, Arndt D, Djoumbou Y, Liu Y, Deng L, Guo AC, Han B, Pon A, Wilson M, Rafatnia S, Liu P, Wishart DS (2014) SMPDB 2.0: big improvements to the small molecule pathway database. Nucleic Acids Res 42:D478–D484. https://doi.org/10.1093/nar/gkt1067
Karp PD, Billington R, Caspi R, Fulcher CA, Latendresse M, Kothari A, Keseler IM, Krummenacker M, Midford PE, Ong Q, Ong WK, Paley SM, Subhraveti P (2019) The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform 20(4):1085–1093. https://doi.org/10.1093/bib/bbx085
Amberger JS, Bocchini CA, Scott AF, Hamosh A (2019) OMIM.org: leveraging knowledge across phenotype-gene relationships. Nucleic Acids Res 47(D1):D1038–D1043. https://doi.org/10.1093/nar/gky1151
Piñero J, RamÃrez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, Furlong LI (2020) The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res 48(D1):D845–D855. https://doi.org/10.1093/nar/gkz1021
Boyle EI, Weng S, Gollub J, Jin H, Botstein D, Cherry JM, Sherlock G (2004) GO::TermFinder—open source software for accessing gene ontology information and finding significantly enriched gene ontology terms associated with a list of genes. Bioinformatics 20(18):3710–3715. https://doi.org/10.1093/bioinformatics/bth456
Ihnatova I, Popovici V, Budinska E (2018) A critical comparison of topology-based pathway analysis methods. PLoS One 13(1):e0191154. https://doi.org/10.1371/journal.pone.0191154
Ma J, Shojaie A, Michailidis G (2019) A comparative study of topology-based pathway enrichment analysis methods. BMC Bioinformatics 20(1):546. https://doi.org/10.1186/s12859-019-3146-1
Ping L, Duong DM, Yin L, Gearing M, Lah JJ, Levey AI, Seyfried NT (2018) Global quantitative analysis of the human brain proteome in Alzheimer’s and Parkinson’s disease. Sci Data 5:180036. https://doi.org/10.1038/sdata.2018.36
Mlecnik B, Galon J, Bindea G (2019) Automated exploration of gene ontology term and pathway networks with ClueGO-REST. Bioinformatics 35(19):3864–3866. https://doi.org/10.1093/bioinformatics/btz163
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11):2498–2504. https://doi.org/10.1101/gr.1239303
Supek F, Bošnjak M, Škunca N, Šmuc T (2011) REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6(7):e21800. https://doi.org/10.1371/journal.pone.0021800
R Core Team (2020) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. http://www.R-project.org
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK (2015) Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43(7):e47. https://doi.org/10.1093/nar/gkv007
Willighagen E (2020) Pico, A (rWikiPathways—R client library for the WikiPathways API. R package version 4.0. http://bioconductor.org/packages/release/bioc/html/rWikiPathways.html. Accessed 5 Oct 2020. https://doi.org/10.18129/B9.bioc.rWikiPathways
Pagès H, Carlson M, Falcon S, Li N (2020) Annotationdbi: manipulation of SQLite-based annotations in bioconductor. R package version 1.50.3. http://bioconductor.org/packages/release/bioc/html/AnnotationDbi.html. Accessed 5 Oct 2020. https://doi.org/10.18129/B9.bioc.AnnotationDbi
Carlson M (2019) org. Hs. eg. db: genome wide annotation for human. R package version 3.8.2. http://bioconductor.org/packages/release/data/annotation/html/org.Hs.eg.db.html. Accessed 5 Oct 2020. https://doi.org/10.18129/B9.bioc.org.Hs.eg.db
Yu G, Wang LG, Han Y, He QY (2012) clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16(5):284–287. https://doi.org/10.1089/omi.2011.0118
Yu G, Hu E (2020) enrichplot: visualization of functional enrichment result. R package version 1.8.1. http://bioconductor.org/packages/release/bioc/html/enrichplot.html. Accessed 5 Oct 2020. https://doi.org/10.18129/B9.bioc.enrichplot
Tarca AL, Draghici S, Khatri P, Hassan SS, Mittal P, Kim JS, Kim CJ, Kusanovic JP, Romero R (2009) A novel signaling pathway impact analysis. Bioinformatics 25(1):75–82. https://doi.org/10.1093/bioinformatics/btn577
Sergushichev AA (2016) An algorithm for fast preranked gene set enrichment analysis using cumulative statistic calculation. BioRxiv:060012. https://doi.org/10.1101/060012
Acknowledgments
HH is supported by a grant from Highlands & Islands Enterprise.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Fernandes, M., Husi, H. (2021). ORA , FCS , and PT Strategies in Functional Enrichment Analysis. In: Cecconi, D. (eds) Proteomics Data Analysis. Methods in Molecular Biology, vol 2361. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1641-3_10
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1641-3_10
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1640-6
Online ISBN: 978-1-0716-1641-3
eBook Packages: Springer Protocols