Developing informative microsatellite markers for non-model species using reference mapping against a model species’ genome

Hung, Chih-Ming; Yu, Ai-Yun; Lai, Yu-Ting; Shaner, Pei-Jen L.

doi:10.1038/srep23087

Download PDF

Article
Open access
Published: 15 March 2016

Developing informative microsatellite markers for non-model species using reference mapping against a model species’ genome

Chih-Ming Hung¹,
Ai-Yun Yu²,
Yu-Ting Lai² &
…
Pei-Jen L. Shaner²

Scientific Reports volume 6, Article number: 23087 (2016) Cite this article

2578 Accesses
9 Citations
Metrics details

Subjects

A Corrigendum to this article was published on 19 June 2017

This article has been updated

Abstract

Microsatellites have a wide range of applications from behavioral biology, evolution, to agriculture-based breeding programs. The recent progress in the next-generation sequencing technologies and the rapidly increasing number of published genomes may greatly enhance the current applications of microsatellites by turning them from anonymous to informative markers. Here we developed an approach to anchor microsatellite markers of any target species in a genome of a related model species, through which the genomic locations of the markers, along with any functional genes potentially linked to them, can be revealed. We mapped the shotgun sequence reads of a non-model rodent species Apodemus semotus against the genome of a model species, Mus musculus, and presented 24 polymorphic microsatellite markers with detailed background information for A. semotus in this study. The developed markers can be used in other rodent species, especially those that are closely related to A. semotus or M. musculus. Compared to the traditional approaches based on DNA cloning, our approach is likely to yield more loci for the same cost. This study is a timely demonstration of how a research team can efficiently generate informative (neutral or function-associated) microsatellite markers for their study species and unique biological questions.

Assessing the recovery of Y chromosome microsatellites with population genomic data using Papio and Theropithecus genomes

Article Open access 24 August 2023

Giacomo Mutti, Gonzalo Oteo-Garcia, … Cristian Capelli

The landscape of microsatellites in the enset (Ensete ventricosum) genome and web-based marker resource development

Article Open access 17 September 2020

Manosh Kumar Biswas, Jaypal N. Darbar, … J. S. Heslop-Harrison

Whole genome survey of big cats (Genus: Panthera) identifies novel microsatellites of utility in conservation genetic study

Article Open access 08 July 2021

Jee Yun Hyun, Puneet Pandey, … Hang Lee

Introduction

Microsatellites have been applied to a wide range of biological studies given their extensive genome distribution, high level of polymorphism, and high amplification success¹. However, microsatellite markers mostly are anonymous DNA fragments, obstructing their usefulness in deeper applications². Long considered as neutral markers, microsatellites have been used for parentage analysis, population genetics, and natural resource management^3,4,5,6. More recently, studies have found that a significant portion of microsatellites are associated with functional changes, and mutations that may cause diseases^7,8; furthermore, some are so closely linked to genes under selection that they deviate from neutral patterns^9,10. Consequently, microsatellites can be used to tag corresponding functional genes or map quantitative trait loci (QTL)^11,12. These characters make microsatellites useful markers beyond their early application in population genetics. However, the conventional protocols based on DNA cloning generate microsatellite markers without background information on genomic locations or associated gene functions, both of which are important in the study of nature selection or selective sweeps and the evolution of agriculturally or medically important traits¹.

There have been several new approaches that use the next-generation sequencing (NGS) technologies to develope microsatellite markers making the procedures more efficient in terms of time and money and potentially broadening its application in biology^{13,14,15,16,17}. However, genomic background information of most markers is still unavailable. Even though microsatellite markers associated with functional genes can be developed from expressed sequence tags (ESTs) or transcriptomes, such approaches can only generate markers mostly located in coding regions, representing a minor part of a genome^12,15. By contrast, whole genomes published for a variety of species have provided unprecedented resources to develop genome-wide microsatellite markers with detailed genomic background, even for species whose genomes are not likely to be assembled in the foreseeable future.

Rodents have been the subjects of study in a wide array of biological disciplines, from evolutionary biology, to community ecology and epidemiology given their widespread distributions and close interactions with humans^{18,19,20,21,22}. The house mouse (Mus musculus) can provide valuable reference information for developing microsatellite markers for other rodents because its whole genome sequences and gene ontology are well studied²³. In this study, we developed microsatellite markers with annotation such as distances between microsatellite motifs and surrounding functional genes for a rodent species, Apodemus semotus, by mapping its raw sequence reads against the M. musculus genome. The two genera Apodemus and Mus have diverged c. 10 million years ago²⁴. Apodemus are the most common rodents in the temperate zone of the Palearctic region²⁵, and may show diverse adaptations throughout their wide distribution range via natural selection. The markers developed here can also be used in other species closely related to A. semotus or M. musculus.

Methods

Genomic DNA extraction and genome sequencing

One notch (~1.5 mm diameter) of ear tissue was cut using an ear punch from each of 24 A. semotus individuals collected at the Shei-Pa National Park in Taiwan. The tissue samples were preserved in 95% EtOH and stored in a −20 °C freezer until DNA extraction. Genomic DNA was extracted from the tissue samples using the LiCl method²⁶.

Ethics statement

We had IACUC approval for rodent trapping and tissue sampling procedures in this project through National Taiwan Normal University’s Institutional Animal Care and Use Committee (protocol no. 102004); all processes involving live animals were performed in accordance with the approved guidelines.

Genome sequencing and Reference mapping

Illumina shotgun sequencing was applied to the genomic DNA of a female A. semotus sample. An Illumina library with an insertion size of 500 bp was prepared from 2.74 μg of DNA. Paired-end 90 bp sequence reads were obtained from a lane of Illumina HiSeq 2000. The library preparation and sequencing were performed by Beijing Genomic Institute (BGI, Shenzhen, China).

The whole genome sequences of M. musculus (GRCm38.p3, downloaded from the NCBI GenBank) was used as the reference genome. The Illumina paired-end reads of A. semotus were mapped against the genome of M. musculus using CLC Genomics Workbench 6.02 (CLC Inc, Aarhus, Denmark). The “Reference Mapper” tool of CLC was run with the default parameter settings (insertion cost = 3, deletion cost = 3, mismatch cost = 2, length fraction = 0.5) except that “similarity fraction” was changed from 0.8 to 0.85 to increase the portion of conserved regions in the mapped genome.

Identification of microsatellite loci and primer design

Fragments of mapped sequences (hereafter “scaffolds”) were screened using MSATCOMMANDER 1.0.8²⁷ to identify tetra-microsatellite motifs with a minimum of 8 tandem repeats and to design primers with a maximum product size of 450 bp. The primers were designed using Primer3²⁸ implemented in MSATCOMMANDER. The primer size was set from 20 to 24 bp, the “T_M” was set at 57 °C (with a range of 54–65 °C), and the other parameters were set as default. A M13R (5′-GGAAACAGCTATGACCAT-3′) or CAG tag (5′-CAGTCGGGCGTCATCA-3′) was added on the 5′ end of one primer from each pair to enable the application of a third primer that was fluorescently labeled with FAM, HEX, TEMRA.

Microsatellite amplification and genotyping

Polymerase chain reactions (PCRs) were performed in 10 μl reaction mixture, containing 15–25 ng DNA, 0.05 μM of each primer, 0.5 μM of each primer with a M13R or CAG tail, 0.5 μM of a fluorescently labeled M13R-tag or CAG-tag primer, 0.2 mM of each dNTP, and 0.75U Taq polymerase (TOYOBO, Blend Taq -Plus-) with 1X PCR buffer. The PCR cycling profile was consisted of an initial denaturation step of 2 min at 95 °C followed by 20 cycles of 95 °C for 30 s, 60 °C (decreased 0.5 per cycle) for 30 s and 72 °C for 40 s and 20 cycles of 95 °C for 30 s, 50 °C for 30 s and 72 °C for 40 s, followed by a final extension step at 72 °C for 7 min. Amplified microsatellite products were genotyped using ABI 3730XL sequencer (Applied Biosystems), and allele sizes were scored using PeakScanner (Applied Biosystems).

Genotype data analysis

The program CERVUS 3.0²⁹ was used to estimate the number of alleles (N_A), expected (H_E) and observed heterozygosity (H_O). We used GENEPOP 4.2³⁰ to assess deviation from Hardy-Weinberg equilibrium (HWE). MICRO-CHECKER ver. 2.2.3³¹ was used to test for null alleles.

Identifying protein genes linked to microsatellites

We firstly compared A. semotus DNA sequence fragments (i.e., query fragments) that extend from 100 kbp upstream to 100 kbp downstream of the regions flanked by microsatellite primers against M. musculus protein sequences (GRCm38.p3, downloaded from the NCBI GenBank) using BLASTX v 2.2.30+^32,33 with a threshold E-value of 10⁻⁶. The region of 200 kbp was chosen because it was a reasonable range that a selective sweep might affect⁹. Secondly, we used NCBI Map Viewer³⁴ (http://www.ncbi.nlm.nih.gov/mapview/) to locate genes with annotated intron and exon structures in the 200 kbp regions (from 100 kbp upstream to 100 kbp downstream) centering the microsatellites in the M. musculus genome. We considered the genes potentially linked to the microsatellites in A. semotus scaffolds only when they were identified in both the BLASTX and Map Viewer results (Supplementary Fig. S1). In other words, the Map Viewer results were used to filter the BLASTX outputs for A. semotus to avoid false positive results.

In the Map Viewer results, predicted or uncharacterized genes were labeled as “Gm”, “LOC” or “Rik” and microRNA as “Mir”. However, the BLASTX approach could not find these genes. For the sake of simplicity, we did not take them in to account in this study.

Cross-species amplification test

We used 13 other rodent species (A. agrarius, M. musculus, M. caroli, Micromys minutus, Niviventer coxingi, N. culturatus, Bandicota indica, Rattus exulans, R. losea, R. norvegeicus, R. tanezumi, Microtus kikuchii and Eothenomys melanogaster) from eight genera to test cross-species amplification for the primers designed from A. semotus. The 13 species and A. semotus have diverged over 20 million years³⁵. Five samples from each of the 13 species were used to estimate the successful amplification rates of the microsatellite loci. A sample would need to amplify a product of the expected size with a lack of smearing to be considered successful.

Results

Mapped genome of A. semotus

We started with 32.6 Gb paired-end sequence reads of A. semotus. By mapping the reads against the M. musculus genome with a size of 2.72 Gb, we obtained 271 scaffolds with a total size of 1.7 Gb (excluding mapping gaps) and an average sequencing coverage (or depth) of 5.6-fold.

Microsatellite screening and quality evaluation

We identified 63,672 tetra-repeat microsatellite motifs in 90 scaffolds longer than one million bp. We designed primers for 1,456 microsatellite motifs. We randomly chose 2–5 microsatellite loci from the scaffolds corresponding to each chromosome (except for the 7th and sexual chromosomes) of M. musculus. This led to a total of 59 loci (see Supplementary Table S1 and Fig. S1 for detailed genomic locations) for us to test their amplification rates and polymorphism in A. semotus. We used 24 A. semotus samples for the test. We successfully amplified 44 loci, for which more than 80% of the 24 samples could be amplified (Supplementary Table S1). Among the 44 loci, 24 displayed polymorphism and could be clearly scored with no ambiguous peaks in size profiles (mean N_A = 7.4, mean H_E = 0.689, mean H_O = 0.599, 18 loci were in HWE; Table 1).

Table 1 Characteristics of 24 microsatellite loci genotyped in Apodemus semotus.

Full size table

Protein genes potentially linked to microsatellites

Of the 59 microsatellite loci, nearly 70% (41 loci) were less than 100 kbp from the closest exon or coding region along the A. semotus scaffolds (Fig. 1). In the 200-kbp region centering each of the 59 loci along the A. semotus scaffolds, 34% (20 loci) included one protein coding gene, 19% (11 loci) included two genes and 17% (10 loci) included three or more genes (Fig. 2). The patterns found in the A. semotus scaffolds (based on BLASTX results) were similar to those in the M. musculus genome (based on Map Viewer results; Figs 1 and 2; Supplementary Fig. S1). Given that a selective sweep might affect neutral genes up to 100 kbp away from a selected one⁹, genes in the 200 kbp region can be considered potentially linked to the microsatellites (Supplementary Fig. S1).

Figure 1: Distribution of the distance between a microsatellite locus and its nearest exon in the *Mus musculus* genome (identified using Map Viewer) or between a microsatellite locus and its nearest coding region in *Apodemus semotus* scaffolds (identified using BLASTX).

**Figure 2: Distribution of the number of genes located within 100 kbp (upstream and downstream) from a microstatellite locus.**

Cross-species amplification testing

At a success rate of 80% or better, we amplified 0 to 19 out of the 59 microsatellite loci across the 13 rodent species; at a success rate of 40% or better, we amplified 2 to 29 loci across the rodent species (Fig. 3 and Supplementary Table S1). A congener to A. semotus, A. agrarius had the highest amplification rate regardless the success rate threshold. Species that are closely related to A. semotus or M. musculus had higher success rates than others (Fig. 3).

**Figure 3: Numbers of amplified loci.**

Discussion

We successfully devised a reference genome mapping approach to develop microsatellite makers detailed with genomic locations and potentially linked genes, for a non-model rodent species A. semotus. For any species, to which the genome of a closely related species is available, this approach can efficiently generate informative microsatellite markers that have a wide range of applications from behavioral biology, adaptive evolution, to agriculture-based breeding programs.

An effective and economical approach to microsatellite development based on NGS

Our approach based on reference genome mapping is more efficient in terms of money and time than traditional approaches based on cloning of genetic libraries and Sanger sequencing. In general, traditional approaches require one to four weeks of bench work followed by Sanger sequencing³⁶, whereas our approach requires only one to two days of bench work for DNA extraction followed by shotgun sequencing. The total cost for bench work and sequencing (i.e., including the cost for processing through primer design but excluding PCR test) are similar between traditional (1,100 ~ 4,400 USD)³⁶ and our approaches (3,000 USD). However, the higher yield of loci of our approach (1,456 loci with at least eight tetra-nucleotide repeats) compared with the traditional ones (100 loci assuming a 50% of positive rate based on 200 screened clones)^36,37 makes the former at least 5 times more cost-effective on a per-locus basis (Supplementary Table S2). Moreover, our approach can improve the quality of microsatellite makers with detailed genomic background information, which is more difficult to achieve using the traditional approaches.

Regarding the applications of different NGS technologies in microsatellite development, Illumina approaches are more cost-effective than 454 approaches^14,38,39. Nevertheless, Illumina raw reads are usually too short to cover the entire microsatellites or to have sufficient flanking sequences for primer design³⁸. Even though the combination of two paired-end reads can partially solve the problem, it provides little information on the exact number of repeats in microsatellite loci³⁹. In addition, the total length of microsatellite loci developed from either Illumina or 454 raw reads is generally short^37,39.

Although de novo assembling raw reads into longer contigs can increase the number and length of isolated microsatellite makers^14,17, the required amount of raw reads and computational power are not trivial. By contrast, mapping raw reads against the reference genome of a closely related species is more cost-efficient^40,41. Given the rapid accumulation of whole genome data, soon most species will have reference genomes from species close enough for genome mapping. Here we devised an Illumina-based reference mapping approach to isolate microsatellite markers, which requires minimum laboratory and computing resources and thus is affordable for most research groups. Although the mapping-based genome of A. semotus (1.7 Gb scaffolds with an average sequencing coverage of 5.6X) is neither complete nor high-quality, and reference-guided genomes with similar levels of coverage have been found to miss a portion of microsatellites⁴¹, this genome still provides thousands of primers, which is sufficient for most of the biological questions. In fact, the development of informative microsatellite makers can become a routine by-product of a genome re-sequencing (mapping) project, adding values to the core genome product.

Furthermore, aligning the microsatellite markers developed for non-model species against the reference genome of a model species can help the former to adopt annotated gene information from the latter. In general, the distributions of protein coding genes surrounding the microsatellites in A. semotus scaffolds (based on the BLASTX results) reflect well those in the M. musculus genome (based on the Map Viewer results) despite some discrepancies (Supplementary Fig. S1). Several reasons could explain the observed discrepancies: (1) the gene sequences in the two species had differentiated too much to be detected by the BLASTX approach, (2) CLC mapping gaps or errors had caused the failure to detect these genes or exons, (3) the M. musculus protein database was incomplete and/or (4) the Map Viewer results overestimated the presence of coding regions. Some sequence regions in the Map Viewer results might be erroneously identified as coding regions (e.g., light green vertical lines in Supplementary Fig. S1) because the Map Viewer results were merged from multiple assembles and coding regions, some of which might not have been validated. Nevertheless, the discrepancies occurred infrequently and did not substantially impact the applications of these markers.

The markers developed in this study can also be used in other rodent species closely related to either A. semotus or M. musculus. Although these microsatellites do not have universal amplification success across all 14 test species, strong cross-amplification between the two genera Apodemus and Mus that have diverged c. 10 million years ago²⁴ is encouraging. By carefully choosing reference species (as distinct as possible but still close enough for mapping), we believe that this approach can generate makers with high cross-amplification success given the more conserved regions used for primer design.

Applications of informative microsatellite markers

The anonymity of microsatellites is sometimes overemphasized in population genetic analysis. Although anonymous microsatellites reflect randomness in sampling, informative microsatellite markers generated by our approach can have wider and deeper applications. For example, positive selection or selective sweeps have been found in some microsatellites^9,10,42, but the lack of information about the potential protein coding genes associated with these microsatellites prevents further understanding of the selection scenarios. Our approach is desirable in that it allows identification of functional genes linked to microsatellites. Even though informative microsatellites have been developed from expressed sequence tags (ESTs) or transcriptomes, such approaches can only generate markers that are mainly located in coding regions and the EST microsatellites are less polymorphic and less representative of genome-wide patterns^12,15. By contrast, our approach is more flexible in that genome-widely distributed microsatellite loci with different levels of linkage to protein coding genes can be developed, which provide rich materials for testing positive selection and selective sweeps⁹.

Given there is a wealth of quantitative trait loci (QTLs) with genome coordinates available in public databases for model species (e.g., Mouse Genome Informatics database; http://www.informatics.jax.org), our approach can also be applied to develop microsatellite markers near the QTLs of interest for non-model species (e.g., Apodemus and other Mus species). Such markers provide useful tools to investigate the genetic basis of quantitative phenotypic traits. Furthermore, the whole genome or QTL database of the study species per se is not required using our approach, making it widely applicable across species.

On the other hand, for genetic analyses that require neutral markers, such as effective population size estimation, our approach is useful in a priori filtering out makers that are closely linked to functional genes (e.g., microsatellites located in exonic or intronic regions of protein coding genes) and thus can potentially deviate from neutral patterns, which may improve the quality of the genetic inferences. Our approach also allows ones to choose independent markers that are widely distributed across the whole genome to avoid marker-biases in population genetics analysis.

The application of microsatellites in population genetics has been criticized due to the concern of size homoplasy⁴³, null alleles⁴⁴, or artificial population structure^45,46 although some argue that they could perform better than single-nucleotide polymorphism (SNP) data in recovering recent population structure⁴⁷. Aside from the controversial role of microsatellites in the studies of population genetics, the applications of microsatellites remain very useful in behavioral biology. For example, researchers can identify sex-linked loci using our approach to maximize the confident level of paternal assignment in mammals given that Y-chromosome loci have higher resolution than autosomal ones in identifying fathers of male mammalian individuals. An increase in precision of parentage assignment through informative microsatellite markers can enhance our ability to understand the selection processes underlying animal behaviors, such as extra-pair mating or conspecific brood parasitism.

Conclusion

In the NCBI Genbank database (February 2016), there are already 116 mammal genomes, 66 bird genomes, 66 fish genomes, 194 insect genomes, 159 land plant genomes, just to name a few. For mammals, 17 rodent genomes are available; for birds, almost every order has at least one genome available. There are more ambitious, ongoing genome projects, such as Genome 10K⁴⁸, i5K⁴⁹ and B10K⁵⁰ projects, which will generate thousands of genomes for several taxonomic groups in the near future. Our study is a timely demonstration of how to utilize the growing genome database to reenergize existing genetic tools. Even though most published genomes have not been studied as explicitly as the house mouse genome, they are still enough to develop annotated microsatellite makers. The mapping approach can also be applied to extinct species⁵¹ to isolate microsatellite markers for them. As the DNA extracted from fossils or old specimens is highly fragmented, microsatellites characterized by small lengths are ideal markers to study extinct species. In conclusion, this study highlights our untapped power to generate “custom” genetic markers in the genomics era.

Additional Information

Accession codes: Illumina short reads deposited in NCBI Genbank under accession number SRP071097. http://www.nature.com/srep

How to cite this article: Hung, C.-M. et al. Developing informative microsatellite markers for non-model species using reference mapping against a model species' genome. Sci. Rep. 6, 23087; doi: 10.1038/srep23087 (2016).

Change history

19 June 2017
A correction has been published and is appended to both the HTML and PDF versions of this paper. The error has been fixed in the paper.

References

Chistiakov, D. A., Hellemans, B. & Volckaert, F. A. Microsatellites and their genomic distribution, evolution, function and applications: a review with special reference to fish genetics. Aquaculture 255, 1–29 (2006).
Article CAS Google Scholar
Cullis, C. A. (2002). The use of DNA polymorphisms in genetic mapping in Genetic engineering (ed. Setlow, J. K. ) 179–189 (Springer, 2002).
Garcia de Leon, F. J., Canonne, M., Quillet, E., Bonhomme, F. & Chatain, B. The application of microsatellite markers to breeding programmes in the sea bass, Dicentrarchus labrax. Aquaculture 159, 303–316 (1998).
Article CAS Google Scholar
Hung, C. M., Li, S. H. & Lee, L. L. Faecal DNA typing to determine the abundance and spatial organisation of otters (Lutra lutra) along two stream systems in Kinmen. Anim. Conserv. 7, 301–311 (2004).
Article Google Scholar
Kirst, M., Cordeiro, C. M., Rezende, G. D. S. P. & Grattapaglia, D. Power of microsatellite markers for fingerprinting and parentage analysis in Eucalyptus grandis breeding populations. J. Hered. 96, 161–166 (2005).
Article CAS PubMed Google Scholar
Randi, E. et al. Multilocus detection of wolf x dog hybridization in Italy, and guidelines for marker selection. PloS One 9, e86409 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, Y. C., Korol, A. B., Fahima, T. & Nevo, E. Microsatellites within genes: structure, function, and evolution. Mol. Biol. Evol. 21, 991–1007 (2004).
Article CAS PubMed Google Scholar
Batra, R., Charizanis, K. & Swanson, M. S. Partners in crime: bidirectional transcription in unstable microsatellite disease. Hum. Mol. Genet. ddq132 (2010).
Kauer, M. O., Dieringer, D. & Schlötterer, C. A microsatellite variability screen for positive selection associated with the “out of Africa” habitat expansion of Drosophila melanogaster. Genetics 165, 1137–1148 (2003).
CAS PubMed PubMed Central Google Scholar
Nielsen, E. E., Hansen, M. M. & Meldrup, D. Evidence of microsatellite hitch‐hiking selection in Atlantic cod (Gadus morhua L.): implications for inferring population structure in nonmodel organisms. Mol. Ecol. 15, 3219–3229 (2006).
Article CAS PubMed Google Scholar
Guo, W. et al. A microsatellite-based, gene-rich linkage map reveals genome structure, function and evolution in Gossypium. Genetics 176, 527–541 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hibrand-Saint Oyant, L., Crespel, L., Rajapakse, S., Zhang, L. & Foucher, F. Genetic linkage maps of rose constructed with new microsatellite markers and locating QTL controlling flowering traits. Tree Genet. Genomes 4, 11–23 (2008).
Article Google Scholar
Gardner, M. G., Fitch, A. J., Bertozzi, T. & Lowe, A. J. Rise of the machines–recommendations for ecologists when using next generation sequencing for microsatellite development. Mol. Ecol. Resour. 11, 1093–1101 (2011).
Article PubMed Google Scholar
Rico, C., Normandeau, E., Dion-Côté, A. M., Rico, M. I., Côté, G. & Bernatchez, L. Combining next-generation sequencing and online databases for microsatellite development in non-model organisms. Sci. Rep. 3, 3376 (2013).
Article PubMed PubMed Central Google Scholar
Dufresnes, C., Brelsford, A., Béziers, P. & Perrin, N. Stronger transferability but lower variability in transcriptomic‐than in anonymous microsatellites: evidence from Hylid frogs. Mol. Ecol. Resour. 14, 716–725 (2014).
Article CAS PubMed Google Scholar
Grattapaglia, D., Mamani, E., Silva‐Junior, O. B. & Faria, D. A. A novel genome‐wide microsatellite resource for species of Eucalyptus with linkage‐to‐physical correspondence on the reference genome sequence. Mol. Ecol. Resour. 15, 437–448 (2015).
Article CAS PubMed Google Scholar
Jia, X., Deng, Y., Sun, X., Liang, L. & Ye, X. Characterization of the global transcriptome using Illumina sequencing and novel microsatellite marker information in seashore paspalum. Genes Genom. 37, 77–86 (2015).
Article CAS Google Scholar
Krebs, C. J. Population cycles revisited. J. Mammal. 77, 8–24 (1996).
Article Google Scholar
Brown, J. H., Fox, B. J. & Kelt, D. A. Assembly rules: desert rodent communities are structured at scales from local to continental. Am. Nat. 156, 314–321 (2000).
Article PubMed Google Scholar
Zhang, J., Dyer, K. D. & Rosenberg, H. F. Evolution of the rodent eosinophil-associated RNase gene family by rapid gene sorting and positive selection. Proc. Natl. Acad. Sci. USA 97, 4701–4706 (2000).
Article CAS ADS PubMed PubMed Central Google Scholar
Singleton, G. R., Hinds, L. A., Krebs, C. J. & Spratt, D. M. In Rats, mice and people: rodent biology and management (Australian Centre for International Agricultural Research, 2003).
Broadbent, N. J., Gaskin, S., Squire, L. R. & Clark, R. E. Object recognition memory and the rodent hippocampus. Learn. Memory 17, 5–11 (2010).
Article Google Scholar
Peña-Castillo, L. et al. A critical assessment of Mus musculus gene function prediction using integrated genomic evidence. Genome Biol. 9, S2 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liu, Q. et al. Phylogeographic study of Apodemus ilex (Rodentia: Muridae) in southwest China. PLoS One 7, e31453 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Corbet, G. B. In The mammals of the Palaearctic region: a taxonomic review (Cornell University Press, 1978).
Gemmell, N. J. & Akiyama, S. An efficient method for the extraction of DNA from vertebrate tissues. Trends Genet. 12, 338–339 (1996).
Article CAS PubMed Google Scholar
Faircloth, B. C. Msatcommander: detection of microsatellite repeat arrays and automated, locus‐specific primer design. Mol. Ecol. Resour. 8, 92–94 (2008).
Article CAS PubMed Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers in Bioinformatics methods and protocols (eds. Misener, S. & Krawtez, S. A. ) 365–386 (Humana Press, 1999).
Marshall, T. C., Slate, J. B. K. E., Kruuk, L. E. B. & Pemberton, J. M. Statistical confidence for likelihood‐based paternity inference in natural populations. Mol. Ecol. 7, 639–655 (1998).
Article CAS PubMed Google Scholar
Rousset, F. Genepop’007: a complete re‐implementation of the genepop software for Windows and Linux. Mol. Ecol. Resour. 8, 103–106 (2008).
Article PubMed Google Scholar
Van Oosterhout, C., Weetman, D. & Hutchinson, W. F. Estimation and adjustment of microsatellite null alleles in nonequilibrium populations. Mol. Ecol. Notes 6, 255–256 (2006).
Article Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
Article CAS PubMed PubMed Central Google Scholar
Sayers, E. W. et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 39, D38–D51 (2011).
Article CAS PubMed Google Scholar
Fabre, P. H., Hautier, L., Dimitrov, D. & Douzery, E. J. A glimpse on the pattern of rodent diversification: a phylogenetic approach. BMC Evol. Biol. 12, 88 (2012).
Article PubMed PubMed Central Google Scholar
Zane, L., Bargelloni, L. & Patarnello, T. Strategies for microsatellite isolation: a review. Mol. Ecol. 11, 1–16 (2002).
Article CAS PubMed Google Scholar
Malausa, T. et al. High‐throughput microsatellite isolation through 454 GS‐FLX Titanium pyrosequencing of enriched DNA libraries. Mol. Ecol. Resour. 11, 638–644 (2011).
Article CAS PubMed Google Scholar
Jennings, T. N., Knaus, B. J., Mullins, T. D., Haig, S. M. & Cronn, R. C. Multiplexed microsatellite recovery using massively parallel sequencing. Mol. Ecol. Resour. 11, 1060–1067 (2011).
Article CAS PubMed Google Scholar
Castoe, T. A. et al. Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake. PLoS One 7, e30953 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Pop, M. Genome assembly reborn: recent computational challenges. Briefings in Bioinformatics 10, 354–66 (2009).
Article CAS PubMed PubMed Central Google Scholar
Card, D. C. et al. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies. PLoS One 9, e106649 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Nevo, E. et al. Genomic microsatellite adaptive divergence of wild barley by microclimatic stress in ‘Evolution Canyon’, Israel. Biol. J. Linn. Soc. 84, 205–224 (2005).
Article Google Scholar
Estoup, A., Jarne, P. & Cornuet, J. M. Homoplasy and mutation model at microsatellite loci and their consequences for population genetics analysis. Mol. Ecol. 11, 1591–1604 (2002).
Article CAS PubMed Google Scholar
Chapuis, M. P. & Estoup, A. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 24, 621–631 (2007).
Article CAS PubMed Google Scholar
Hedrick, P. W. Perspective: highly variable loci and their interpretation in evolution and conservation. Evolution 53, 313–318 (1999).
Article PubMed Google Scholar
McKay, B. D. et al. Recent range-wide demographic expansion in a Taiwan endemic montane bird, Steere’s Liocichla (Liocichla steerii). BMC Evol. Biol. 10, 71 (2010).
Article CAS PubMed PubMed Central Google Scholar
Granevitze, Z. et al. Phylogenetic resolution power of microsatellites and various single‐nucleotide polymorphism types assessed in 10 divergent chicken populations. Anim. Genet. 45, 87–95 (2014).
Article CAS PubMed Google Scholar
Haussler, D. et al. Genome 10K: a proposal to obtain whole-genome sequence for 10 000 vertebrate species. J. Hered. 100, 659–674 (2009).
Article CAS Google Scholar
Robinson, G. E. et al. Creating a buzz about insect genomes. Science 331, 1386–1386 (2011).
Article ADS PubMed Google Scholar
Zhang, G. Genomics: Bird sequencing project takes off. Nature 522, 34–34 (2015).
Article CAS ADS PubMed Google Scholar
Hung, C. M. et al. Drastic population fluctuations explain the rapid extinction of the passenger pigeon. Proc. Natl. Acad. Sci. USA 111, 10636–10641 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Lingua Ke for help in Apodemus semotus sample collection. We are grateful to National Museum of Natural Science in Taiwan for providing other rodent samples for cross-amplification test. We thank Shou-Hsien Li for discussing research ideas and improving the manuscript. Support of this study came from Taiwan Ministry of Science and Technology (MOST 100-2621-B-003-006).

Author information

Authors and Affiliations

Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
Chih-Ming Hung
Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
Ai-Yun Yu, Yu-Ting Lai & Pei-Jen L. Shaner

Authors

Chih-Ming Hung
View author publications
You can also search for this author in PubMed Google Scholar
Ai-Yun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Ting Lai
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Jen L. Shaner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-M.H., A.-Y.Y. and P.L.S. designed the research; A.-Y.Y. performed the experiment; C.-M.H., A.-Y.Y. and Y.-T.L. analyzed the data; C.-M.H., A.-Y.Y. and P.L.S. wrote the paper.

Corresponding author

Correspondence to Pei-Jen L. Shaner.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (XLS 42 kb)

Supplementary Information (PDF 1354 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Hung, CM., Yu, AY., Lai, YT. et al. Developing informative microsatellite markers for non-model species using reference mapping against a model species’ genome. Sci Rep 6, 23087 (2016). https://doi.org/10.1038/srep23087

Download citation

Received: 27 September 2015
Accepted: 01 March 2016
Published: 15 March 2016
DOI: https://doi.org/10.1038/srep23087

This article is cited by

Genetic and molecular bases of esophageal Cancer among Iranians: an update
- Mohammad Reza Abbaszadegan
- Vahideh Keyvani
- Meysam Moghbeli
Diagnostic Pathology (2019)
A rapid and cost-effective approach for the development of polymorphic microsatellites in non-model species using paired-end RAD sequencing
- Dong-Xiu Xue
- Yu-Long Li
- Jin-Xian Liu
Molecular Genetics and Genomics (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.