Abstract
As practitioners, we aim to provide a consolidated introduction of tidy data science along with routine packages for relational data representation and interpretation, with the focus on analytics related to human genetic interactions. We describe three showcases (also made available at https://23verse.github.io/gini), all done so via the R one-liner, in this chapter defined as a sequential pipeline of elementary functions chained together achieving a complex task. We guide the readers through step-by-step instructions on (case 1) performing network module analysis of genetic interactions, followed by visualization and interpretation; (case 2) implementing a practical strategy of how to identify and interpret tissue-specific genetic interactions; and (case 3) carrying out interaction-based tissue clustering and differential interaction analysis. All showcases demonstrate simplistic beauty and efficient nature of this analytics. We anticipate that mastering a dozen of one-liners to efficiently interpret genetic interactions is very timely now; opportunities for computational translational research are arising for data scientists to harness therapeutic potential of human genetic interaction data that are ever-increasingly available.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Domingo J, Baeza-Centurion P, Lehner B (2019) The causes and consequences of genetic interactions (epistasis). Annu Rev Genomics Hum Genet 20:433–460. https://doi.org/10.1146/annurev-genom-083118-014857
Costanzo M, Kuzmin E, van Leeuwen J et al (2019) Global genetic networks and the genotype-to-phenotype relationship. Cell 177:85–100. https://doi.org/10.1016/j.cell.2019.01.033
Norman TM, Horlbeck MA, Replogle JM et al (2019) Exploring genetic interaction manifolds constructed from rich single-cell phenotypes. Science 365:786–793. https://doi.org/10.1126/science.aax4438
Huang A, Garraway LA, Ashworth A, Weber B (2020) Synthetic lethality as an engine for cancer drug target discovery. Nat Rev Drug Discov 19:23–37. https://doi.org/10.1038/s41573-019-0046-z
Oughtred R, Stark C, Breitkreutz BJ et al (2019) The BioGRID interaction database: 2019 update. Nucleic Acids Res 47:D529–D541. https://doi.org/10.1093/nar/gky1079
Huber W, Carey VJ, Gentleman R et al (2015) Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods 12:115–121. https://doi.org/10.1038/nmeth.3252
Wickham Rstudio H (2014) Tidy data. J Stat Softw 59:1–23. https://doi.org/10.18637/jss.v059.i10
Wickham H, Averick M, Bryan J et al (2019) Welcome to the tidyverse. J Open Source Softw 4:1686. https://doi.org/10.21105/joss.01686
Csardi G, Nepusz T (2006) The igraph software package for complex network research. Int J Complex Syst 1695:1695
The Genotype Tissue Expression Consortium (2019) The GTEx Consortium atlas of genetic regulatory effects across human tissues. bioRxiv. https://doi.org/10.1101/787903
Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008:P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008
Kamada T (1989) An algorithm for drawing general undirected graphs. Inf Process Lett 31:7–15. https://doi.org/10.1142/9789814434478_0005
Witten TA, Sander LM (1981) Diffusion-limited aggregation, a kinetic critical phenomenon. Phys Rev Lett 47:1400–1403. https://doi.org/10.1103/PhysRevLett.47.1400
Fang H, Knezevic B, Burnham KL, Knight JC (2016) XGR software for enhanced interpretation of genomic summary data, illustrated by application to immunological traits. Genome Med 8:129. https://doi.org/10.1186/s13073-016-0384-y
Fang H, Gough J (2014) The “dnet” approach promotes emerging research on cancer patient survival. Genome Med 6:64. https://doi.org/10.1186/s13073-014-0064-8
Ritchie ME, Phipson B, Wu D et al (2015) Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43:e47. https://doi.org/10.1093/nar/gkv007
Acknowledgement
HF is supported by the Program for Professor of Special Appointment (Eastern Scholar) at Shanghai Institutions of Higher Learning.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Jiang, L., Fang, H. (2021). Genetic Interaction Network Interpretation: A Tidy Data Science Perspective. In: Wong, KC. (eds) Epistasis. Methods in Molecular Biology, vol 2212. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-0947-7_22
Download citation
DOI: https://doi.org/10.1007/978-1-0716-0947-7_22
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-0946-0
Online ISBN: 978-1-0716-0947-7
eBook Packages: Springer Protocols