Effect of taxon sampling on recovering the phylogeny of squamate reptiles based on complete mitochondrial genome and nuclear gene sequence data
Introduction
The molecular phylogeny of land vertebrates is presently among the best documented (Meyer and Zardoya, 2003) owing to newly-compiled large sequence data sets based on mitochondrial (mt) and/ or nuclear genes, as well as on rather thorough lineage samplings. This is particularly true for recently reported amphibian (San Mauro et al., 2005, Frost et al., 2006, Roelants et al., 2007), and mammal (Murphy et al., 2001a, Murphy et al., 2001b, Springer et al., 2001) molecular phylogenies, which are relatively robust from a statistical point of view, and will be essential as a framework to any future comparative study pertaining these taxa. In contrast, our understanding of phylogenetic relationships within the third main lineage of tetrapods, i.e. sauropsids (reptiles + birds) is still emerging because thus far accumulated molecular data for this group are limited as compared to mammals and amphibians. The classic hypothesis on sauropsid phylogenetic relationships is based on the absence or presence of two skull temporal fenestrae, and considers a basal split into Anapsida (turtles) and Diapsida (other reptiles + birds), respectively (Meyer and Zardoya, 2003). The latter are further divided into Lepidosauria (squamates + the New Zealand living fossil, the tuatara) and Archosauria (crocodiles + birds). The traditional view of turtles as anapsids (Lee, 2001) has been challenged by several morphological studies suggesting diapsid affinities of turtles (Rieppel and deBraga, 1996, Hill, 2005). Molecular phylogenies (Zardoya and Meyer, 1998, Hedges and Poling, 1999, Kumazawa and Nishida, 1999, Hugall et al., 2007) place the turtles as derived diapsids related with Archosauria.
With nearly 8000 living species and a worldwide distribution (Zug et al., 2001, Pianka and Vitt, 2003, Pough et al., 2004), squamate reptiles conform a highly diversified clade that includes lizards, snakes and amphisbaenians (Townsend et al., 2004, Estes et al., 1988). The main lineages of squamates exhibit a great variety of specialized morphological, behavioral and ecological forms (Zug et al., 2001, Pianka and Vitt, 2003, Pough et al., 2004), which have seriously hindered establishing higher-level phylogenetic relationships within the group based on morphology (e.g., Estes et al., 1988, Lee, 1998, Lee, 2000, Kearney, 2003). Traditionally, squamates have been divided into two major groupings (Iguania and Scleroglossa) based mostly on osteological and soft anatomy characters (Estes et al., 1988, Lee, 1998, Reynoso, 1998, Lee and Caldwell, 2000). This main cladogenetic split has been linked to major differences in tongue structure and associated feeding behavior (Vitt et al., 2003, Vitt and Pianka, 2005). Iguania, which include iguanids, agamids, and chamaeleonids (the latter two grouped together into Acrodonta), use the tongue for prey prehension (as Tuataras) whereas Scleroglossa, which include the remaining squamates, use teeth and jaw for prey prehension, freeing the tongue for chemosensory reception, and seemingly allowing present-day predomination of scleroglossans over iguanians worldwide (Schwenk, 1993, Vitt et al., 2003, Pough et al., 2004, Townsend et al., 2004, Vidal and Hedges, 2005). Scleroglossa is further divided into three infraorders: Gekkota, Scincomorpha, and Anguimorpha, with the latter two grouped into a higher rank, the Autarchoglossa. The limbless groups, i.e. Amphisbaenia, Serpentes and Dibamidae are normally left as “incertae sedis” within the Scleroglossa (Estes et al., 1988) (Fig. 1A).
Several recent papers (Townsend et al., 2004, Vidal and Hedges, 2005, Böhme et al., 2007, Douglas et al., 2006, Kumazawa, 2007) have focused on the molecular phylogeny of squamates deriving at very different conclusions (Fig. 1B, C, and D). Thus far, no molecular phylogeny recovers the basal split of squamates into Iguania and Scleroglossa, against morphological evidence (Townsend et al., 2004, Vidal and Hedges, 2005, Kumazawa, 2007) (Fig. 1). Moreover, molecular phylogenies based on either mt (Böhme et al., 2007, Kumazawa, 2007), nuclear (Vidal and Hedges, 2005) or combined (Townsend et al., 2004, Hugall et al., 2007) sequence data fairly agree in supporting Dibamida and Gekkota as the most basal squamate lineages (but see Harris, 2003, Zhou et al., 2006) (Fig. 1). Scincomorpha are recovered generally as paraphyletic with Scincoidea (Scincidae, Xantusiidae, and Cordylidae) placed as a sister group of Lacertoidea (Lacertidae and Teiidae) + Amphisbaenia, and within a larger clade that also includes Anguimorpha and Iguania (Fig. 1). However, phylogenetic relationships within this larger clade remain largely unresolved (Townsend et al., 2004, Vidal and Hedges, 2005, Böhme et al., 2007, Hugall et al., 2007, Kumazawa, 2007), and the monophyly of Scincomorpha cannot be statistically rejected (Kumazawa, 2007). In addition, the relative phylogenetic position of Serpentes varies among studies, and it is particularly volatile in those based on mt data, likely due to the relatively long branches of the taxon (Townsend et al., 2004, Böhme et al., 2007, Zhou et al., 2006). Moreover, recent studies (Vidal and Hedges, 2005, Fry et al., 2006) have also reported the existence of a ‘venom clade’ (= Toxicofera), including all major squamate lineages with species possessing toxin-secreting oral glands (namely Serpentes, Iguania, and Anguimorpha) (Fry et al., 2006). However, venom glands were reported in Pogona (Acrodonta; Agamidae) and not in Iguanidae (Fry et al., 2006) whereas Iguanidae (and not Acrodonta) were used to represent Iguania in the phylogenetic analyses (Vidal and Hedges, 2005, Fry et al., 2006). Overall, it seems that main differences among recovered molecular phylogenies of squamates may be related with the use of different outgroup taxa, taxon coverages, and gene data sets, as well as with the observations that terminal branches are rather long with respect to internal nodes in the squamate phylogeny, and that some lineages (e.g. Serpentes; Kumazawa et al., 1998) have consistently higher evolutionary rates than others. The relative contribution of these factors to the instability of the squamate molecular phylogeny awaits further investigation.
In order to further resolve the molecular phylogeny of squamates, we increased the number of analyzed species by sequencing three new complete squamate mt genomes. Our phylogenetic analyses including the new mt genome sequence data recovered a topology that differed substantially from those previously reported (Townsend et al., 2004, Vidal and Hedges, 2005, Kumazawa, 2007). We explored whether topological differences could be correlated with the choice of molecular marker, i.e. mt or nuclear DNA, with using different outgroup taxa, i.e. amphibians (Xenopus laevis) versus amniotes, with increasing species sampling of the different squamate lineages, and/ or with incorporating lineages with different evolutionary rates into the phylogenetic analyses.
Section snippets
Taxon sampling
The nucleotide sequence of the complete mt genome was determined anew from a single individual of three species of squamate lizards: the Iberian worm lizard Blanus cinereus (Amphisbaenidae; collected by MGP and Iñigo Martínez-Solano in Santa Maria de Alameda-Madrid, Spain; May 2001; voucher MNCN/ADN 21711) the slow worm Anguis fragilis (Anguidae; collected by RZ in Vilarmiel-Lugo, Spain; April 2003; voucher MNCN/ADN 7215) and the Moorish gecko Tarentola mauritanica (Geckkonidae; collected by
Mitochondrial genome organization and structural features
The complete nucleotide sequences of the L-strand of the mt genomes of three squamates were determined anew. The total lengths of the new squamate mt genomes were 17.035 bp for B. cinereus, 16.593 bp for T. mauritanica and 17.479 bp for A. fragilis. These lengths are within the range reported for other squamate mt genomes (Kumazawa, 2007). All three mt genomes encoded for two rRNA, 22 tRNA and 13 protein-coding genes. The gene organization of the three newly determined mt genomes conformed to
Acknowledgments
We thank Carlos Fernández for providing us with access to the Centro de Supercomputación de Galicia (CESGA) and Aurelio Rodriguez for installing the corresponding phylogenetic software in CESGA. EMA was funded by a CSIC-I3P postgrado fellowship. DSM was sponsored by a postdoctoral fellowship (MEC/Fulbright 2007-0448) of the Ministry of Education and Science (MEC) of Spain. The study was partially funded by MEC under the projects CGL2004-04680-C10-10/BOS to MGP and CGL2004-00401 to RZ and by
References (85)
- et al.
Mitochondrial genome of Pogona vitticepes (Reptilia; Agamidae): control region duplication and the origin of Australasian agamids
Gene
(2005) - et al.
The complete mitochondrial genome of the green lizard Lacerta viridis viridis (Reptilia: Lacertidae) and its phylogenetic position within squamate reptiles
Gene
(2007) Codon bias variation in C-mos between squamate families might distort phylogenetic inferences
Mol. Phylogenet. Evol.
(2003)Mitochondrial genomes from major lizard families suggest their phylogenetic relationships and ancient radiations
Gene
(2007)Convergent evolution and character correlation in burrowing reptiles: towards a resolution of squamate relationships
Biol. J. Linn. Soc.
(1998)- et al.
Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences
Mol. Phylogenet. Evol.
(2004) - et al.
Molecular timescales and the fossil record: a paleontological perspective
Trends Genet.
(2004) - et al.
Molecular systematics of primary reptilian lineages and the tuatara mitochondrial genome
Mol. Phylogenet. Evol.
(2003) - et al.
Phylogenetic relationships of discoglossid frogs (Amphibia: Anura: Discoglossidae) based on complete mitochondrial genomes and nuclear genes
Gene
(2004) - et al.
Phylogeny of caecilian amphibians (Gymnophiona) based on complete mitochondrial genomes and nuclear RAG1
Mol. Phylogenet. Evol.
(2004)
The phylogeny of squamate reptiles (lizards, snakes and amphisbaenians) inferred from nine nuclear protein-coding genes
C R Biol.
Molecular dating when rates vary
Trends Ecol. Evol.
The complete mitochondrial genome of Gekko gecko (Reptilia: Gekkonidae) and support for the monophyly of Sauria including Amphisbaenia
Mol. Phylogenet. Evol.
ProtTest: selection of best-fit models of protein evolution
Bioinformatics
Model of amino acid substitution in proteins encoded by mitochondrial DNA
J. Mol. Evol.
Information theory as an extension of the maximum likelihood principle
Dating the tree of life
Science
Paleontological evidence to date the Tree of Life
Mol. Biol. Evol.
Animal mitochondrial genomes
Nucleic Acids Res.
Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis
Mol. Biol. Evol.
A mitogenomic study on the phylogenetic position of snakes
Zool. Scr.
Reliability of Bayesian posterior probabilities and bootstrap frequencies in phylogenetics
Syst. Biol.
Phylogenetic relationships within Squamata
At the feet of the dinosaurs: the early history and radiation of lizards
Biol. Rev. Cambridge Philos. Soc.
Cases in which parsimony or compatibility methods will be positively misleading
Syst. Zool.
Evolutionary trees from DNA sequences: a maximum likelihood approach
J. Mol. Evol.
PHYLIP—Phylogeny inference package (Version 3.2.)
Cladistics
The amphibian tree of life
Bull. Am. Mus. Nat. His.
Early evolution of the venom system in lizards and snakes
Science
A phylogeny of turtles
A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood
Syst. Biol.
A molecular phylogeny of reptiles
Science
Integration of morphological data sets for phylogenetic analysis of Amniota: the importance of integumentary characters and increased taxonomic sampling
Syst. Biol.
MRBAYES: Bayesian inference of phylogenetic trees
Bioinformatics
Bayesian inference of phylogeny and its impact on evolutionary biology
Science
Calibration choice, rate smoothing, and the pattern of tetrapod diversification according to the long nuclear gene RAG-1
Syst. Biol.
OGRe: a relational database for comparative analyses of mitochondrial genomes
Nucleic Acids Res.
Systematics of the amphisbaenia (Lepidosauria: Squamata) based on morphological evidence from recent and fossil forms
Herpetol. Monogr.
Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea
J. Mol. Evol.
Performance of a divergence time estimation method under a probabilistic model of rate evolution
Mol. Biol. Evol.
First definitive record of Mesozoic lizards from Madagascar
J. Vertebr. Paleontol.
Complete mitochondrial DNA sequences of the green turtle and blue-tailed mole skink: statistical evidence for archosaurian affinity of turtles
Mol. Biol. Evol.
Cited by (46)
Improved sampling at the subspecies level solves a taxonomic dilemma – A case study of two enigmatic Chinese tit species (Aves, Passeriformes, Paridae, Poecile)
2017, Molecular Phylogenetics and EvolutionCitation Excerpt :Density of taxon sampling has been repeatedly evaluated as one of the crucial factors affecting the accuracy of phylogenetic analyses and the resulting topologies (Omland et al., 1999; Johnson, 2001; Braun and Kimball, 2002; Zwickl and Hillis, 2002; Funk and Omland, 2003; Heath et al., 2008; Albert et al., 2009; McKay and Zink, 2010; Päckert et al., 2010; Nabhan and Sarkar, 2011).
An ancient lineage of slow worms, genus Anguis (Squamata: Anguidae), survived in the Italian Peninsula
2013, Molecular Phylogenetics and EvolutionMetabolic rate and body size are linked with perception of temporal information
2013, Animal BehaviourCitation Excerpt :As the light levels of species that inhabit turbid waters are typically orders of magnitude lower than typical daylight levels (40–1000 lx; Ali & Klyne 1985; Palmer & Grant 2010; Kreysing et al. 2012) and the harp seal, Pagophilus groenlandicus, regularly forages at depths greater than 200 m (Folkow et al. 2004) where light levels are comparable to nocturnal light levels (Palmer & Grant 2010), we categorized these species as inhabiting low light level environments. To correct for the phylogenetic nonindependence of species we constructed a composite tree of the study species using published molecular phylogenies and divergence times from various sources (Schoch 1985; Janossy 1986; Mercer & Roth 2003; Hedges et al. 2006; Wiens et al. 2006; Benton & Donoghue 2007; Murphy et al. 2007; Brown et al. 2008; Li et al. 2008; Naro-Maciel et al. 2008; Albert et al. 2009; Lim et al. 2010; Little et al. 2010; Perelman et al. 2011; see the Appendix and Fig. A1). In instances in which a divergence time was not available for two species we used the conservatively estimated date of first appearance as the divergence time taken from the Paleobiology Database (Alroy et al. 2008).
Evolutionary patterns of the mitochondrial genome in the Moorish gecko, Tarentola mauritanica
2013, GeneCitation Excerpt :The protein-coding gene sequences were identified and annotated in Geneious Pro v5.4.3 by the start and stop codons, and sequence similarity in the alignment. Inconsistencies between the obtained sequences in this study, and the mitogenome from Albert et al. (2009) were detected, regarding the orientation of two tRNAs (tRNA-Gln and tRNA-Glu). Using the program ARWEN for tRNA detection in metazoan mitochondrial sequences (Laslett and Canbäck, 2008), we identified an incorrect orientation of both tRNA fragments on the GenBank sequence.
Phylogeny and cryptic diversification in Southeast Asian flying geckos
2012, Molecular Phylogenetics and Evolution
- 1
Present address: Estación Biológica de Doñana, CSIC. Avda. María Luisa, s/n. Pabellón del Perú. E-41013 Sevilla, Spain.