A rodent anchored hybrid enrichment probe set for a range of phylogenetic utility – from order to species
Cite this dataset
Bangs, Max; Steppan, Scott (2021). A rodent anchored hybrid enrichment probe set for a range of phylogenetic utility – from order to species [Dataset]. Dryad. https://doi.org/10.5061/dryad.x69p8czjx
Abstract
Rodents are the largest order of mammals and contain several model organisms important to scientific research in a variety of fields, yet no large set of genomic markers have been designed for this group to date, hindering evolutionary studies into relationships of the group as a whole. Here we present a genomic probe set designed and optimized for rodents with a protocol easy to replicate with little laboratory investment. This design utilizes an anchored hybrid enrichment approach specifically targeting rodents to generate longer loci with a higher mutation rate than existing vertebrate probes to provide utility at various taxonomic levels. Using a test set of rodents from all five suborders we successfully obtained alignments for 416 of the 418 target loci with an average of 1,379 base pairs per locus and a total alignment of more than half a million base pairs. This genomic dataset performed well in all phylogenetic analyses, especially in recent phylogenetic splits, with ample parsimoniously-informative sites within genera and even within species, showing more than four times as many single nucleotide polymorphisms per locus than a recent vertebrate ultra-conserved elements study. Additional support is provided in resolving basal clades in Rodentia. By providing this probe design, we hope that more labs can easily generate data for answering questions in rodents from species delimitation to understanding relationships among families in rapid radiations.
Methods
Probes designed using MyBaits (Chafin et al. 2018) with the mouse-60-way alignment from UCSC along with a subset of probes from Lemmon et al. 2012. Each traget site ranged from 240-400bps and was split into 21 tiled probes for five species (one per suborder of rodent). For more details on the methods see linked manuscript from Molecular Ecology Resources.
Usage notes
Dataset includes:
1) Three .tre files; Astral LPP, Astral polytomy test, and IQ-tree ultra-fast bootstrap.
2) List of probes in .csv format a total of 46,893 probes designed for 446 loci and totally 5.6Mbps. This file can be used to directly order the rodent418probe set from Agilent of similar company. The first 25,116 probes are new rodent specific probes and are labeled as follows:
species, Name of locus, probe number
Funding
National Science Foundation, Award: DEB-1754748