ABSTRACT
Uncovering genotype-phenotype relationships is a fundamental challenge in genomics. Gene prioritization is an important step for this endeavor to make a short manageable list from a list of thousands of genes coming from high-throughput studies. Network propagation methods are promising and state of the art methods for gene prioritization based on the premise that functionally-related genes tend to be close to each other in the biological networks.
In this study, we present PhenoGeneRanker, an improved version of a recently developed network propagation method called Random Walk with Restart on Multiplex Heterogeneous Networks (RWR-MH). PhenoGeneRanker allows multi-layer gene and disease networks. It also calculates empirical p-values of gene ranking using random stratified sampling of genes based on their connectivity degree in the network.
We ran PhenoGeneRanker using multi-omics datasets of rice to effectively prioritize the cold tolerance-related genes. We observed that top genes selected by PhenoGeneRanker were enriched in cold tolerance-related Gene Ontology (GO) terms whereas bottom ranked genes were enriched in general GO terms only. We also observed that top-ranked genes exhibited significant p-values suggesting that their rankings were independent of their degree in the network.
CCS CONCEPTS • Bioinformatics • Biological networks • System biology • Computational genomics
Availability and implementation The source code is available on GitHub at https://github.com/bozdaglab/PhenoGeneRanker under Creative Commons Attribution 4.0 license
Contact cdursun{at}mcw.edu or serdar.bozdag{at}marquette.edu
Footnotes
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).