Skip to main content

A Biclique Approach to Reference Anchored Gene Blocks and Its Applications to Pathogenicity Islands

  • Conference paper
  • First Online:
Algorithms in Bioinformatics (WABI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 9838))

Included in the following conference series:

Abstract

We formalize a new problem variant in gene-block discovery, denoted Reference-Anchored Gene Blocks (RAGB). Given a query sequence Q of length n, representing the gene-array of a DNA element, a window size bound d on the length of a substring of interest in Q, and a set of target gene sequences \(\mathcal {T}=\{T_1 \ldots T_c\}\). Our objective is to identify gene-blocks in \(\mathcal {T}\) that are centered in a subset q of co-localized genes from Q, and contain genomes from \(\mathcal {T}\) in which the corresponding orthologs of the genes from q are also co-localized. We cast RAGB as a variant of a (colored) biclique problem in bipartite graphs, and analyze its parameterized complexity, as well as the parameterized complexity of other related problems. We give an \(O(nm+2^dnm/\lg m)\) time algorithm for the uncolored variant of our biclique problem, where m is the number of areas of interest that are parsed from the target sequences, and n and d are as defined above. Our algorithm can be adapted to compute all maximal bicliques in the graph within the same time complexity, and to handle edge-weights with a slight \(O(\lg d)\) increase to its time complexity. For the colored version of the problem, our algorithm has a time complexity of \(O(2^dnm)\). We implement the algorithm and exemplify its application to LEE, a well-known pathogenicity island from the e. coli genome harboring virulence genes. Our code and supplementary materials, including omitted proofs and figures, are available at https://www.cs.bgu.ac.il/~negevcb/RAGB/.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alon, N., Yuster, R., Zwick, U.: Color-coding. J. ACM 42(4), 844–856 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  2. Bergeron, A., Chauve, C., Gingras, Y.: Formal models of gene clusters. Bioinform. Algorithms Tech. Appl. 8, 177–202 (2008)

    Google Scholar 

  3. Chateau, A., Riou, P., Rivals, E.: Approximate common intervals in multiple genome comparison. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 131–134. IEEE (2011)

    Google Scholar 

  4. Chen, L., et al.: VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 33(suppl 1), D325–D328 (2005)

    Google Scholar 

  5. Deng, W., et al.: Dissecting virulence: systematic and functional analyses of a pathogenicity island. Proc. Natl. Acad. Sci. U.S.A. 101(10), 3597–3602 (2004)

    Article  Google Scholar 

  6. Elliott, S.J., et al.: The complete sequence of the locus of enterocyte effacement (LEE) from enteropathogenic escherichia coli e2348/69. Mol. Microbiol. 28(1), 1–4 (1998)

    Article  Google Scholar 

  7. Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H Freeman, New York (1979)

    MATH  Google Scholar 

  8. Hazen, T.H., et al.: Refining the pathovar paradigm via phylogenomics of the attaching and effacing escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 110(31), 12810–12815 (2013)

    Article  Google Scholar 

  9. Dhillon, B.K.: Islandviewer 3: more flexible, interactive genomic island discovery, visualization and analysis. Nucleic Acids Res. 43, W104–W108 (2015). gkv401

    Article  Google Scholar 

  10. Kim, S., et al.: A hybrid gene team model and its application to genome analysis. J. Bioinform. Comput. Biol. 4(02), 171–196 (2006)

    Article  Google Scholar 

  11. Ling, X., He, X., Xin, D.: Detecting gene clusters under evolutionary constraint in a large number of genomes. Bioinform. 25(5), 571–577 (2009)

    Article  Google Scholar 

  12. Ream, D.C., et al.: An event-driven approach for studying gene block evolution in bacteria. Bioinform. 31(13), 2075–2083 (2015)

    Article  Google Scholar 

  13. Downey, R.G., Fellows, M.R.: Parameterized Complexity. Springer, New York (1999)

    Book  MATH  Google Scholar 

  14. Samuel, W., et al.: Organization and coordinated assembly of the type III secretion export apparatus. Proc. Nat. Acad. Sci. 107(41), 17745–17750 (2010)

    Article  Google Scholar 

  15. Schmidt, H., Hensel, M.: Pathogenicity islands in bacterial pathogenesis. Clin. Microbiol. Rev. 17(1), 14–56 (2004)

    Article  Google Scholar 

  16. Schmidt, T., Stoye, J.: Quadratic time algorithms for finding common intervals in two and more sequences. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 347–358. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  17. Waack, S., et al.: Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models. BMC Bioinform. 7(1), 142 (2006)

    Article  Google Scholar 

Download references

Acknowledgements

The research of D.H. and A.B. was partially supported by the People Programme (Marie Curie Actions) of the European Union’s Seventh Framework Programme (FP7/2007-2013) under REA grant agreement number 631163.11, and by the Israel Science Foundation (grant No. 551145/). The research of M.Z-U. and A.B. was partially supported by the Israel Science Foundation (grant No. 179/14.) and by the Frankel Center for Computer Science at Ben Gurion University.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Danny Hermelin or Michal Ziv-Ukelson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Benshahar, A., Chalifa-Caspi, V., Hermelin, D., Ziv-Ukelson, M. (2016). A Biclique Approach to Reference Anchored Gene Blocks and Its Applications to Pathogenicity Islands. In: Frith, M., Storm Pedersen, C. (eds) Algorithms in Bioinformatics. WABI 2016. Lecture Notes in Computer Science(), vol 9838. Springer, Cham. https://doi.org/10.1007/978-3-319-43681-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43681-4_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43680-7

  • Online ISBN: 978-3-319-43681-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics