ABSTRACT
Building ancestral recombination graphs (ARG) with minimum number of recombination events for large datasets is a challenging problem. We have proposed ARG4WG and REARG heuristic algorithm for constructing ARGs with thousands of whole genome sequences. However, these algorithms do not result in ARGs with minimal number of recombination events. In this work, we propose GAMARG algorithm, an improvement of ARG4WG, to optimize the number of recombination events in ARG building process. Experiment with different datasets showed that GAMARG algorithm outperforms other heuristic algorithms in building ARGs for large datasets. It also is much better than other heuristic algorithms and comparable to exhaustive search methods for small datasets.
- M. Arenas, "The importance and application of the ancestral recombination graph," Front. Genet., vol. 4, p. 206, 2013.Google ScholarCross Ref
- L. Wang, K. Zhang, and L. Zhang, "Perfect phylogenetic networks with recombination," J. Comput. Biol., vol. 8, no. 1, pp. 69--78, 2001.Google ScholarCross Ref
- Y. S. Song and J. Hein, "Constructing minimal ancestral recombination graphs," J. Comput. Biol., vol. 12, no. 2, pp. 147--169, 2005.Google ScholarCross Ref
- R. B. Lyngsø, Y. S. Song, and J. Hein, "Minimum recombination histories by branch and bound," in International Workshop on Algorithms in Bioinformatics, 2005, pp. 239--250. Google ScholarDigital Library
- M. J. Minichiello and R. Durbin, "Mapping trait loci by use of inferred ancestral recombination graphs," Am. J. Hum. Genet., vol. 79, no. 5, pp. 910--922, 2006.Google ScholarCross Ref
- T. T. P. Nguyen, V. S. Le, H. B. Ho, and Q. S. Le, "Building ancestral recombination graphs for whole genomes," IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 14, no. 2, pp. 478--483, 2017. Google ScholarDigital Library
- T. T. P. Nguyen and V. S. Le, "Building minimum recombination ancestral recombination graphs for whole genomes," in 2017 4th NAFOSTED Conference on Information and Computer Science, NICS 2017 - Proceedings, 2017, vol. 2017--Janua, pp. 248--253.Google Scholar
- R. R. Hudson and N. L. Kaplan, "Statistical properties of the number of recombination events in the history of a sample of DNA sequences," Genetics, vol. 111, no. 1, pp. 147--164, 1985.Google Scholar
- M. Kreitman, "Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster," Nature, vol. 304, no. 5925, p. 412, 1983.Google ScholarCross Ref
- 1000 Genomes Project Consortium and others, "A map of human genome variation from population-scale sequencing," Nature, vol. 467, no. 7319, p. 1061, 2010.Google ScholarCross Ref
Index Terms
- A Hybrid Approach to Optimize the Number of Recombinations in Ancestral Recombination Graphs
Recommendations
Building Ancestral Recombination Graphs for Whole Genomes
We propose a heuristic algorithm, called ARG4WG, to build plausible ancestral recombination graphs ARGs from thousands of whole genome samples. By using the longest shared end for recombination inference, ARG4WG constructs ARGs with small numbers of ...
Comments