Abstract
Distance based phylogenetic reconstruction methods use the evolutionary distances between species in order to reconstruct the tree spanning them. This paper continues the line of research which attempts to adjust to each given set of input sequences a distance function which maximizes the expected accuracy of the reconstructed tree. We demonstrate both analytically and experimentally that by deliberately assuming an oversimplified evolutionary model, it is possible to increase the accuracy of reconstruction.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Atteson, K.: The performance of neighbor-joining methods of phylogenetic reconstruction. Algorithmica 25, 251–278 (1999)
Bishop, C.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)
Ciccarelli, F.D., Doerks, T., von Mering, C., Creevey, C.J., Snel, B., Bork, P.: Toward automatic reconstruction of a highly resolved tree of life. Science 311(5765), 1283–1287 (2006)
Doerr, D., Gronau, I., Moran, S., Yavneh, I.: Stochastic errors vs. modeling errors in distance based phylogenetic reconstructions (2011) (in preparation), http://www.cs.technion.ac.il/~moran/r/wabi-in-prep.pdf
Erdos, P., Steel, M., Szekely, L., Warnow, T.: A few logs suffice to build (almost) all trees (I). Random Structures Algorithms 14, 153–184 (1999)
Felstenstein, J., Sober, E.: Parsimony and likelihood: an exchange. Systematic Zoology 35, 617–626 (1986)
Fisher, R.: The use of multiple measurements in taxonomic problems. Annals of Eugenics 7, 177–188 (1936)
Gascuel, O.: BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. Mol. Biol. Evol. 14(7), 685–695 (1997)
Gronau, I., Moran, S., Yavneh, I.: Towards optimal distance functions for stochastic substitution models. J. Theor. Biol. 260(2), 294–307 (2009)
Gronau, I., Moran, S., Yavneh, I.: Adaptive distance measures for resolving K2P quartets: Metric separation versus stochastic noise. J. Comp. Biol. 17(11), 1391–1400 (2010)
Guindon, S., Gascuel, O.: A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Systematic Biology 52, 696–704 (2003)
Jukes, T., Cantor, C.: Evolution of protein molecules. In: Munro, H. (ed.) Mammalian Protein Metabolism, pp. 21–132. Academic Press, New York (1969)
Kimura, M.: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16(2), 111–120 (1980)
Lanave, C., Preparata, G., Saccone, C., Serio, G.: A new method for calculating evolutionary substitution rates. J. Mol. Evol. 20, 86–93 (1984)
Lockhart, P., Steel, M., Hendy, M., Penny, D.: Recovering evolutionary trees under a more realistic model of sequence evolution. Mol. Biol. Evol. 11(4), 605–612 (1994)
Rodriguez, F., Oliver, J.L., Marin, A., Medina, J.R.: The general stochastic model of nucleotide substitution. J. Theor. Biol. 142, 485–501 (1990)
Saitou, N., Nei, M.: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)
Sober, E.: A likelihood justification of parsimony. Cladistics 1, 209–233 (1985)
Steel, M.: Recovering a tree from the leaf colourations it generates under a Markov model. Appl. Math. Lett. 7(2), 19–24 (1994)
Steel, M., Penny, D.: Parsimony, likelihood, and the role of models in molecular phylogenetics. Mol. Biol. Evol. 17, 839–850 (2000)
Studier, J., Keppler, K.: A note on the neighbor-joining algorithm of Saitou and Nei. Mol. Biol. Evol. 5(6), 729–731 (1988)
Yarza, P., Ludwig, W., Euzeby, J., Amann, R., Schleifer, K.H., Glockner, F.O., Rossello-Mora, R.: Update of the All-Species Living Tree Project based on 16S and 23S rRNA sequence analyses. Syst. Appl. Microbiol. 33, 291–299 (2010)
Zaretskii, K.: Constructing a tree on the basis of a set of distances between the hanging vertices. Uspekhi Mat Nauk 20(6), 90–92 (1965) (in Russian)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Doerr, D., Gronau, I., Moran, S., Yavneh, I. (2011). Stochastic Errors vs. Modeling Errors in Distance Based Phylogenetic Reconstructions. In: Przytycka, T.M., Sagot, MF. (eds) Algorithms in Bioinformatics. WABI 2011. Lecture Notes in Computer Science(), vol 6833. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23038-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-23038-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23037-0
Online ISBN: 978-3-642-23038-7
eBook Packages: Computer ScienceComputer Science (R0)