Abstract
Ancestry informative single-nucleotide polymorphism (AISNP) panels for differentiating between East and Southeast Asian populations are scarce. This study aimed to identify AISNPs for ancestry assignment of five East and Southeast Asian populations, and Caucasians. We analyzed 145 autosomal SNPs of the 627 DNA samples from individuals of six populations (234 Taiwanese Han, 91 Filipinos, 79 Indonesians, 60 Thais, 71 Vietnamese, and 92 Caucasians) using arrays. The multiple logistic regression model and a multi-tier approach were used for ancestry classification. We observed that 130 AISNPs were effective for classifying the ethnic origins with fair accuracy. Among the 130 AISNPs, 122 were useful for stratification between these five Asian populations and 64 were effective for differentiating between Caucasians and these Asian populations. For differentiation between Caucasians and Asians, an accuracy rate of 100% was achieved in these 627 subjects with 50 optimal AISNPs among the 64 effective SNPs. For classification of the five Asian populations, the accuracy rates of ancestry inference using 20 to 57 SNPs for each of the two Asian populations ranged from 74.1% to 100%. Another 14 degraded DNA samples with incomplete profiling were analyzed, and the ancestry of 12 (85.7%) of those subjects was accurately assigned. We developed a 130-AISNP panel for ethnic origin differentiation between the five East and Southeast Asian populations and Caucasians. This AISNP set may be helpful for individual ancestral assignment of these populations in forensic casework.
Similar content being viewed by others
References
Kayser M, Schneider PM. DNA-based prediction of human externally visible characteristics in forensics: motivations, scientific challenges, and ethical considerations. Forensic Sci Int Genet. 2009;3:154–61.
Butler M. Forensic DNA typing: biology, technology, and genetics of STR markers. 2nd ed. London: Elsevier Academic Press; 2005.
Bouakaze C, Keyser C, Crubézy E, Montagnon D, Ludes B. Pigment phenotype and biogeographical ancestry from ancient skeletal remains: inferences from multiplexed autosomal SNP analysis. Int J Legal Med. 2009;123:315–25.
Walsh S, Wollstein A, Liu F, Chakravarthy U, Rahu M, Seland JH, et al. DNA-based eye colour prediction across Europe with the IrisPlex system. Forensic Sci Int Genet. 2012;6:330–40.
Spichenok O, Budimlija ZM, Mitchell AA, Jenny A, Kovacevic L, Marjanovic D, et al. Prediction of eye and skin color in diverse populations using seven SNPs. Forensic Sci Int Genet. 2011;5:472–8.
Myles S, Stoneking M, Timpson N. An assessment of the portability of ancestry informative markers between human populations. BMC Med Genet. 2009;2:45.
Phillips C, Salas A, Sánchez JJ, Fondevila M, Gómez-Tato A, Alvarez-Dios J, et al. Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. Forensic Sci Int Genet. 2007;1:273–80.
Poetsch M, Blöhm R, Harder M, Inoue H, von Wurmb-Schwark N, Freitag-Wolf S. Prediction of people's origin from degraded DNA-presentation of SNP assays and calculation of probability. Int J Legal Med. 2013;127:347–57.
Pneuman A, Budimlija ZM, Caragine T, Prinz M, Wurmbach E. Verification of eye and skin color predictors in various populations. Leg Med (Tokyo). 2012;14:78–83.
Kosoy R, Nassir R, Tian C, White PA, Butler LM, Silva G, et al. Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America. Hum Mutat. 2009;30:69–78.
Kidd JR, Friedlaender FR, Speed WC, Pakstis AJ, De La Vega FM, Kidd KK. Analyses of a set of 128 ancestry informative single-nucleotide polymorphisms in a global set of 119 population samples. Investig Genet. 2011;2:1.
Churchill JD, Schmedes SE, King JL, Budowle B. Evaluation of the Illumina Beta Version ForenSeq™ DNA signature prep kit for use in genetic profiling. Forensic Sci Int Genet. 2016;20:20–9.
Kidd KK, Speed WC, Pakstis AJ, Furtado MR, Fang R, Madbouly A, et al. Progress toward an efficient panel of SNPs for ancestry inference. Forensic Sci Int Genet. 2014;10:23–32.
Pakstis AJ, Haigh E, Cherni L, ElGaaied AB, Barton A, Evsanaa B, et al. 52 additional reference population samples for the 55 AISNP panel. Forensic Sci Int Genet. 2015;19:269–71.
Galanter JM, Fernandez-Lopez JC, Gignoux CR, Barnholtz-Sloan J, Fernandez-Rozadilla C, Via M, et al. Development of a panel of genome-wide ancestry informative markers to study admixture throughout the Americas. PLoS Genet. 2012;8:e1002554.
Phillips C, Parson W, Lundsberg B, Santos C, Freire-Aradas A, Torres M, et al. Building a forensic ancestry panel from the ground up: the EUROFORGEN global AIM-SNP set. Forensic Sci Int Genet. 2014;11:13–25.
de la Puente M, Santos C, Fondevila M, Manzo L, EUROFORGEN-NoE Consortium, Carracedo Á, et al. The Global AIMs Nano set: A 31-plex SNaPshot assay of ancestry-informative SNPs. Forensic Sci Int Genet. 2016;22:81–8.
Nassir R, Kosoy R, Tian C, White PA, Butler LM, Silva G, et al. An ancestry informative marker set for determining continental origin: validation and extension using human genome diversity panels. BMC Genet. 2009;10:39.
Tian C, Kosoy R, Lee A, Ransom M, Belmont JW, Gregersen PK, et al. Analysis of East Asia genetic substructure using genome-wide SNP arrays. PLoS One. 2008;3:e3862.
Paschou P, Lewis J, Javed A, Drineas P. Ancestry informative markers for fine-scale individual assignment to worldwide populations. J Med Genet. 2010;47:835–47.
Biswas S, Scheinfeldt LB, Akey JM. Genome-wide insights into the patterns and determinants of fine-scale population structure in humans. Am J Hum Genet. 2009;84:641–50.
Xing J, Watkins WS, Witherspoon DJ, Zhang Y, Guthery SL, Thara R, et al. Fine-scaled human genetic structure revealed by SNP microarrays. Genome Res. 2009;19:815–25.
Hwa HL, Wu LS, Lin CY, Huang TY, Yin HI, Tseng LH, et al. Genotyping of 75 SNPs using arrays for individual identification in five population groups. Int J Legal Med. 2016;130:81–9.
Devlin B, Risch N. A comparison of linkage disequilibrium measures for fine-scale mapping. Genomics. 1995;29:311–22.
Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–5.
Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2004;74:106–20.
Picard R, Cook D. Cross-validation of regression models. J Am Stat Assoc. 1984;79:575–83.
Kavakiotis I, Triantafyllidis A, Ntelidou D, Alexandri P, Megens HJ, Crooijmans RP, et al. TRES: identification of discriminatory and informative SNPs from population genomic data. J Hered. 2015;106:672–6.
Storer CG, Pascal CE, Roberts SB, Templin WD, Seeb LW, Seeb JE. Rank and order: evaluating the performance of SNPs for individual assignment in a non-model organism. PLoS One. 2012;7:e49018.
Budowle B, Moretti TR, Baumstark AL, Defenbaugh DA, Keys KM. Population data on the thirteen CODIS core short tandem repeat loci in African Americans, U.S. Caucasians, Hispanics, Bahamians, Jamaicans, and Trinidadians. J Forensic Sci. 1999;44:1277–86.
Gill P, Foreman L, Buckleton JS, Triggs CM, Allen H. A comparison of adjustment methods to test the robustness of an STR DNA database comprised of 24 European populations. Forensic Sci Int. 2003;131:184–96.
Hellenthal G, Busby GB, Band G, Wilson JF, Capelli C, Falush D, Myers S. A genetic atlas of human admixture history. Science. 2014;343:747–51.
Phillips C. Forensic genetic analysis of bio-geographical ancestry. Forensic Sci Int Genet. 2015;18:49–65.
Collins-Schramm HE, Phillips CM, Operario DJ, Lee JS, Weber JL, Hanson RL, Knowler WC, Cooper R, Li H, Seldin MF. Ethnic-difference markers for use in mapping by admixture linkage disequilibrium. Am J Hum Genet. 2002;70:737–50.
Pfaff CL, Barnholtz-Sloan J, Wagner JK, Long JC. Information on ancestry from genetic markers. Genet Epidemiol. 2004;26:305–15.
Hedman M, Pimenoff V, Lukka M, Sistonen P, Sajantila A. Analysis of 16 Y STR loci in the Finnish population reveals a local reduction in the diversity of male lineages. Forensic Sci Int. 2004;142:37–43.
Kayser M, de Knijff P. Improving human forensics through advances in genetics, genomics and molecular biology. Nat Rev Genet. 2011;12:179–92.
Lao O, Vallone PM, Coble MD, Diegoli TM, van Oven M, van der Gaag KJ, et al. Evaluating self-declared ancestry of U.S. Americans with autosomal, Y-chromosomal and mitochondrial DNA. Hum Mutat. 2010;31:E1875–93.
Kim JJ, Verdu P, Pakstis AJ, Speed WC, Kidd JR, Kidd KK. Use of autosomal loci for clustering individuals and populations of East Asian origin. Hum Genet. 2005;117:511–9.
Wei YL, Wei L, Zhao L, Sun QF, Jiang L, Zhang T, et al. A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents. Int J Legal Med. 2016;130:27–37.
Li CX, Pakstis AJ, Jiang L, Wei YL, Sun QF, Wu H, et al. A panel of 74 AISNPs: improved ancestry inference within eastern Asia. Forensic Sci Int Genet. 2016;23:101–10.
Bulbul O, Cherni L, Khodjet-El-Khil H, Rajeevan H, Kidd KK. Evaluating a subset of ancestry informative SNPs for discriminating among southwest Asian and circum-Mediterranean populations. Forensic Sci Int Genet. 2016;23:153–8.
Nelis M, Esko T, Mägi R, Zimprich F, Zimprich A, Toncheva D, et al. Genetic structure of Europeans: a view from the north-East. PLoS One. 2009;4:e5472.
Tian C, Kosoy R, Nassir R, Lee A, Villoslada P, Klareskog L, et al. European population genetic substructure: further definition of ancestry informative markers for distinguishing among diverse European ethnic groups. Mol Med. 2009;15:371–83.
Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164:1567–87.
Paschou P, Ziv E, Burchard EG, Choudhry S, Rodriguez-Cintron W, Mahoney MW, et al. PCA-correlated SNPs for structure identification in worldwide human populations. PLoS Genet. 2007;3:1672–86.
Tishkoff SA, Kidd KK. Implications of biogeography of human populations for 'race' and medicine. Nat Genet. 2004;36:S21–7.
Acknowledgements
The authors thank the National Center for Genome Medicine at Academia Sinica, Taiwan, for genotyping technical support. This Center was supported by grants from the National Core Facility Program for Biotechnology of National Science Council, Taiwan, R.O.C. We also acknowledge Ms. Pi-Mei Hsu, Ms. Shwu-Fang Li for technical support on DNA extraction. Special thanks are due to the many hundreds of individuals who volunteered to give biological samples for gene frequency studies.
This work was supported by the Ministry of Science and Technology, Taiwan, R.O.C. [grant numbers NSC 100-2320-B-002-013-MY3]; and Institute of Forensic Medicine, Ministry of Justice, Taiwan, R.O.C. [grant numbers 104-1301-05-05-06].
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Electronic supplementary material
ESM 1
(PDF 1030 kb)
Rights and permissions
About this article
Cite this article
Hwa, HL., Lin, CP., Huang, TY. et al. A panel of 130 autosomal single-nucleotide polymorphisms for ancestry assignment in five Asian populations and in Caucasians. Forensic Sci Med Pathol 13, 177–187 (2017). https://doi.org/10.1007/s12024-017-9863-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12024-017-9863-8