Abstract
Molecular interaction databases can be used to study the evolution of molecular pathways across species. Querying such pathways is a challenging computational problem, and recent efforts have been limited to simple queries (paths), or simple networks (forests). In this paper, we significantly extend the class of pathways that can be efficiently queried to the case of trees, and graphs of bounded treewidth. Our algorithm allows the identification of non-exact (homeomorphic) matches, exploiting the color coding technique of Alon et al. We implement a tool for tree queries, called QNet, and test its retrieval properties in simulations and on real network data. We show that QNet searches queries with up to 9 proteins in seconds on current networks, and outperforms sequence-based searches. We also use QNet to perform the first large scale cross-species comparison of protein complexes, by querying known yeast complexes against a fly protein interaction network. This comparison points to strong conservation between the two species, and underscores the importance of our tool in mining protein interaction networks.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Alon, N., Yuster, R., Zwick, U.: Color-coding. Journal of the ACM 42(4), 844–856 (1995)
Ashburner, M., et al.: The gene onthology consortium. gene onthology: Toll for the unification of biology. Nature Genetics 25, 25–29 (2000)
Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B. 57, 289–300 (1995)
Berg, J., Lassig, M., Wagner, A.: Structure and evolution of protein interaction networks: A statistical model for link dynamics and gene duplications. Bio. Med. Center Evolutionary Biology 4, 51 (2001)
Dent, P., Yacoub, A., Fisher, P.B., Hagan, M.P., Grant, S.: Mapk pathways in radiation responses. Oncogene 22(37), 5885–5896 (2003)
Sohler, F., Zimmer, R.: Identifying active transcription factors and kinases from expression data using pathway queries. Bioinformatics 21(Suppl. 2), ii115–ii122 (2005)
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Co., San Francisco (1979)
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-completeness. W. H. Freeman and Company, San Francisco (1979)
Guldener, U., Munsterkotter, M., Oesterheld, M., Pagel, P., Ruepp, A., Mewes, H.-W., Stumpflen, V.: MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. 34(Database issue), 436–441 (2006)
Hirsh, E., Sharan, R.: Identification of conserved protein complexes based on a model of protein network evolution. In: Fifth European Conference on Computational Biology (ECCB’06) (to appear, 2006)
Ito, T., Chiba, T., Yoshida, M.: Exploring the yeast protein interactome using comprehensive two-hybrid projects. Trends Biotechnology 19, 23–27 (2001)
Kanehisa, M., Goto, S., Kawashima, S., Okuno, Y., Hattori, M.: The KEGG resource for deciphering the genome. Nucleic Acids Res. 32(Database issue), 277–280 (2004)
Kelley, B.P., Sharan, R., Karp, R.M., Sittler, T., Root, D.E., Stockwell, B.R., Ideker, T.: Conserved pathways within bacteria and yeast as revealed by global protein network alignment. Proc. Natl. Acad. Sci. USA 100(20), 11394–11399 (2003)
Kloks, T.: Treewidth: computations and approximations. Springer, Heidelberg (1994)
Mann, M., Hendrickson, R., Pandey, A.: Analysis ures of proteins and proteomes by mass spectrometry. Annu. Rev. Biochem. 70, 437–473 (2001)
Mewes, H.W., Frishman, D., Mayer, K.F., Munsterkotter, M., Noubibou, O., Pagel, P., Rattei, T., Oesterheld, M., Ruepp, A., Stumpflen, V.: MIPS: analysis and annotation of proteins from whole genomes in 2005. Nucleic Acids Res. 34(Database issue), 169–172 (2006)
Pinter, R.Y., Rokhlenko, O., Yeger-Lotem, E., Ziv-Ukelson, M.: Alignment of metabolic pathways. Bioinformatics 21(16), 3401–3408 (2005)
Shlomi, T., Segal, D., Ruppin, E., Sharan, R.: QPath: A Method for Querying Pathways in a Protein-Protein Interaction Network. BMC Bioinformatics 7, 199 (2006)
Stanyon, C.A., Liu, G., Mangiola, B.A., Patel, N., Giot, L., Kuang, B., Zhang, H., Zhong, J., Finley, J.: A Drosophila protein-interaction map centered on cell-cycle regulators. Genome Biol. 5(12), R96 (2004)
Xenarios, I., Rice, D.W., Salwinski, L., Baron, M.K., Marcotte, E.M., Eisenberg, D.: DIP: the database of interacting proteins. Nucleic Acids Res. 28(1), 289–291 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Dost, B., Shlomi, T., Gupta, N., Ruppin, E., Bafna, V., Sharan, R. (2007). QNet: A Tool for Querying Protein Interaction Networks. In: Speed, T., Huang, H. (eds) Research in Computational Molecular Biology. RECOMB 2007. Lecture Notes in Computer Science(), vol 4453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71681-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-71681-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71680-8
Online ISBN: 978-3-540-71681-5
eBook Packages: Computer ScienceComputer Science (R0)