Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population

Haidar, Mahdi; Abbas, Fatimah A.; Alsaleh, Hussain; Haddrill, Penelope R.

doi:10.1038/s41598-021-81425-y

Download PDF

Article
Open access
Published: 21 January 2021

Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population

Mahdi Haidar^1,2,
Fatimah A. Abbas^1,2,
Hussain Alsaleh^1,2 &
…
Penelope R. Haddrill¹

Scientific Reports volume 11, Article number: 1865 (2021) Cite this article

2841 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

This study evaluates the forensic utility of 23 autosomal short tandem repeat markers in 400 samples from the Kuwaiti population, of which four markers (D10S1248, D22S1045, D2S441 and SE33) are reported for the first time for Kuwait. All the markers were shown to exhibit no deviation from Hardy–Weinberg equilibrium, nor any linkage disequilibrium between and within loci, indicating that these loci are inherited independently, and their allele frequencies can be used to estimate match probabilities in the Kuwaiti population. The low combined match probability of 7.37 × 10^–30 and the high paternity indices generated by these loci demonstrate the usefulness of the PowerPlex Fusion 6C kit for human identification in this population, as well as to strengthen the power of paternity testing. Off-ladder alleles were seen at several loci, and these were identified by examining their underlying nucleotide sequences. Principal component analysis (PCA) and STRUCTURE showed no genetic structure within the Kuwaiti population. However, PCA revealed a correlation between geographic and genetic distance. Finally, phylogenetic trees demonstrated a close relationship between Kuwaitis and Middle Easterners at a global level, and a recent common ancestry for Kuwait with its northern neighbours of Iraq and Iran, at a regional level.

Network of large pedigrees reveals social practices of Avar communities

Article Open access 24 April 2024

Genome-wide association studies

Article 26 August 2021

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

Introduction

The State of Kuwait is located on the Arabian Gulf in the northwest of the Asian continent and in the heart of the Middle East. Kuwait is bordered by the Kingdom of Saudi Arabia in the south, the Republic of Iraq in the north and west, and Iran in the east, across the Persian Gulf Sea. Kuwait’s population is about 4.8 million, which includes 1.4 million Kuwaiti nationals and 3.4 million foreign nationals, according to the 2019 census (https://www.paci.gov.kw/stat/Default.aspx). Currently, forensic DNA analysis in Kuwait is carried out by the Kuwaiti Identification DNA Laboratory (KIDL) using only short tandem repeat (STR) markers, including autosomal STRs, Y-chromosome STRs (Y-STRs) and X-chromosome STRs (X-STRs). Autosomal STRs are routinely used both for identification of individuals and paternity testing, whereas Y-STRs and X-STRs are used less frequently, and only for specific scenarios.

To date, few papers have been published investigating the forensic utility and genetic diversity of autosomal STR markers in the Kuwaiti population. In 2008, Alenizi and colleagues reported the allele frequencies of 15 STR loci included in the AmpFℓSTR Identifiler kit (Thermo Fisher Scientific, MA, USA)¹. Based on these 15 STRs, the F_ST distances between Kuwaiti nationals and foreign nationals from seven other populations residing in Kuwait were found to be consistent with their geographical distances². Another recent study investigated the forensic utility of 25 autosomal STRs included in two separate kits: the PowerPlex CS7 system and the PowerPlex 21 system (Promega Corporation, WI, USA)³. Although these existing STRs are efficient for analysing cases of simple relationships, more STRs are increasingly required, particularly for complex paternity cases or to increase the discrimination power in cases of partial DNA profiles and DNA mixtures.

Recently, Promega launched the PowerPlex Fusion 6C kit, a six-dye kit that can amplify 27 loci, including the 20 autosomal loci in the expanded CODIS set (CSF1PO, FGA, TH01, TPOX, vWA, D1S1656, D2S1338, D2S441, D3S1358, D5S818, D7S820, D8S1179, D10S1248, D12S391, D13S317, D16S539, D18S51, D19S433, D21S11, and D22S1045)⁴, three additional autosomal STRs (PentaE, PentaD, and SE33) to increase the power of discrimination, two sex chromosome markers (Amelogenin and DYS391), and two rapidly mutating Y-STRs (DYS570 and DYS576)⁵. The PowerPlex Fusion 6C kit was validated by multi-laboratory evaluation following SWGDAM guidelines⁵.

Before utilising this kit for criminal and relationship cases in Kuwait, population and forensic statistical data for the loci in the kit must be evaluated. In this study, we aim to increase the amount of genetic data available for the Kuwaiti population, using the 23 autosomal STRs in the PowerPlex Fusion 6C kit, of which four loci (D10S1248, D22S1045, D2S441 and SE33) have not been reported before for Kuwait. In addition, we aim to evaluate the forensic utility of these autosomal STRs in this underrepresented region, and to investigate the utility of these markers in population genetic differentiation by examining the genetic distance between the Kuwaiti population and other global populations for which data are available.

Materials and methods

Samples and genotyping

Blood samples were collected on Whatman FTA cards (GE Healthcare Life Sciences, IL, USA) from 400 unrelated Kuwaiti (253 males and 147 females). DNA was amplified directly, without quantification, from a 1.2 mm FTA card punch, according to the directions in the PowerPlex Fusion 6C manual, using a SureCycler 8800 thermal cycler (Agilent Technologies, CA, USA). Detection and separation of the DNA fragments were carried out using an Applied Biosystems 3500 Genetic Analyzer (Thermo Fisher Scientific) with the internal lane standard WEN ILS 500 and allelic ladder provided with the PowerPlex Fusion 6C kit. Genotype determination and allele calling for only the 23 autosomal loci were carried out using GeneMapper ID-X software version 1.4 (Thermo Fisher Scientific).

Statistical analysis

Data analysis was carried out for the 23 autosomal loci only (the sex chromosomes are not included in this paper). Arlequin statistical software version 3.5 was used to calculate allele frequencies, to test for linkage disequilibrium, and to test for deviation from the Hardy–Weinberg Equilibrium⁶. Forensic parameters, including the random match probability (RMP), discrimination power (DP), power of exclusion (PE), typical paternity index (TPI) and polymorphic information content (PIC), were calculated using STRAF (http://cmpg.unibe.ch/shiny/STRAF/), an online tool for STR data analysis⁷.

Intra-population genetic structure among Kuwaitis

Countries in the Arabian Peninsula, including Kuwait, have a high rate of consanguineous marriage, which causes differential distribution of alleles among families and tribes, resulting in population genetic stratification^{8, 9}. Newly presented markers therefore must be assessed for the presence of any population structure, to avoid calculation of forensic parameters using inaccurate allele frequencies taken from the total population, rather than the relevant subpopulation. Stratification also negatively impacts discrimination power, because the chance of random individuals possessing similar genotypes is higher within a subpopulation, than within the total population¹⁰. Two methods were therefore used to detect genetic structure in the population, principal component analysis (PCA), and a Bayesian-based method implemented in STRUCTURE version 2.3.4^11,12,13.

In order to demonstrate whether these two methods were able to cluster the samples into their real subpopulations, each sample was categorised into one of three ancestral subgroups (K = 3) based on the donor’s surname. It has previously been found that the Kuwaiti population is mainly composed of settlers coming from three different regions: the Arabian Peninsula (from Saudi Arabia), the desert (representing nomadic tribes), and Persian countries (mainly from Iran)^{9, 14,15,16,17}. On this basis, the samples were categorised into three groups: KW-1 (n = 162) representing individuals originating from the Arabian Peninsula, KW-2 (n = 163), which consists of those coming from Persian countries and Iraq (north), and KW-3 (n = 75) composed of Bedouin individuals coming from nomadic tribes. PCA was carried out on allele frequencies at the 23 autosomal STR loci for the different population groups KW-1–KW-3 using R software³³ and visualised using the factoextra package³⁴.

In contrast to PCA, which is an unsupervised clustering algorithm, STRUCTURE (a Bayesian-based approach) takes a range of numbers of populations (K) in order to calculate the proportion of the genome of each individual in the sample originating from each inferred population¹¹. STRUCTURE software calculates the likelihood of the data (X) for range of K values, and the true number of K is determined by the maximal value of Ln P(X|K). However, it was found by Evanno et al.¹⁸ that the maximal value does not always provide the correct number of K in the data. Instead, the maximal value of the rate of change (Delta K) in the Ln P(X|K) between successive K values accurately infers the true number of genetic clusters in the data¹⁸. As such, both Ln P(X|K) and Delta K at each K were calculated and reported. STRUCTURE was run without population information, as recommended in the STRUCTURE documentation, in order to check whether the results approximately agreed with the separation of samples into their subgroups. Thus, the predefined groups (KW-1 to KW-3) were only included as a population label rather than as prior information for the analysis. The parameters for the analysis were set as follows: ‘admixture’ and ‘correlated allele frequencies’ models using 100,000 Markov Chain Monte Carlo (MCMC) steps for each run, with the first 100,000 discarded as a burn-in, and the inferred number of K was set from 1 to 10. At each K, the analysis was repeated five times in order to test the results for consistency. The results were visualised using CLUMPAK (Clustering Markov Packager Across K, available at http://clumpak.tau.ac.il/index.html)¹⁹, and the best K was calculated using STRUCTURE HARVESTER (available at http://taylor0.biology.ucla.edu/structureHarvester/)²⁰.

Inter-population genetic structure and population relationships

To assess the genetic distance between the Kuwaiti sample and other global populations, PCA was conducted based on allele frequencies of the 23 autosomal loci for 57 global populations grouped into seven continental regions: Africa (AFR), America (AMR), Central and South Asia (C_S_ASIA), the Middle East (ME), Europe (EUR) and East Asia (E_ASIA). Data for the global populations were obtained from the HGDP-CEPH Human Genome Diversity Panel (HGDP-CEPH) using the online forensic STR frequency browser, popSTR (http://spsmart.cesga.es/popstr.php)^21,22,23. Data from Lebanon (LEB) and an Indian (IND) population from Madhya Pradesh typed for the 23 autosomal loci²⁴ were also included in the analysis. Genetic distance was also assessed at a regional level using allele frequencies for the 13 of the 23 autosomal loci (CSF1PO, D13S317, D16S539, D18S51, D21S11, D3S1358, D5S818, D7S820, D8S1179, FGA, TH01, TPOX, vWA) that are shared between the data reported in this study and other studies of Kuwait and neighbouring counties: Kuwait (KW1³ and KW2²), Iran (IRN²⁵ and IRN1²), Saudi Arabia (SA²⁶ and SA1²⁷), Qatar (QAT²⁸), Oman (OMN²⁹), Yemen (YEM²⁹), United Arab Emirates (UAE³⁰), Bahrain (BAH³¹), and Iraq (IRQ³² and IRQ1²). PCA analysis was conducted using R software³³ and visualised using the factoextra package³⁴.

In addition to the PCA, we studied the genetic relationship between the Kuwaiti samples and the other populations at both the continental and regional levels, using phylogenetic trees. These trees were constructed using pairwise genetic distances (D_A) based on Nei et al.³⁵, which were calculated from the allele frequencies of the populations using POPTREE2 software³⁶. The type of phylogenetic trees used were Neighbour-joining (NJ) trees, constructed using Mega X software version 10.0.5³⁷.

Ethics statement

The study was performed in accordance with the University of Strathclyde code of practice on investigations involving human beings, and ethical approval (reference number DEC18/PAC06) was granted by the Department of Pure and Applied Chemistry Ethics Committee. Written, informed consent was obtained from all participants prior to sampling.

Results and discussion

Allele frequencies and forensic performance

Full PowerPlex Fusion 6C STR profiles were recovered from blood samples taken from 400 Kuwaiti individuals. Table 1 shows the allele frequencies and forensic parameters calculated for these samples. Similar to studies of other global populations^{38, 39}, SE33 was the most discriminative locus in the Kuwaiti population, having 45 different alleles (PIC = 0.945). In contrast, TPOX was the least discriminative locus, with only eight different alleles (PIC = 0.616). The calculated combined match probability (CMP) was 7.37 × 10^–30, meaning that the probability of observing two identical profiles for the 23 autosomal loci in the Kuwaiti population was 1 in 1.36 × 10²⁹ The TPI ranged between 1.439 (TPOX) and 8.333 (SE33), and the combined PE was > 99.9999%. These high values indicate the usefulness of the PowerPlex Fusion 6C kit for both human identification and paternity testing in the Kuwaiti population.

Table 1 Allele frequencies among 400 Kuwaiti individuals typed at 23 autosomal STR loci in the PowerPlex Fusion 6C kit.

Full size table

Statistical analysis of populations

No significant deviation from the expectations of the Hardy–Weinberg Equilibrium was detected at any locus in the Kuwaiti genotypic data, therefore, the PowerPlex Fusion 6C autosomal STR alleles are independent and can be used to estimate allele frequencies from their genotype frequencies. Association between alleles at all possible pairwise combinations of loci was evaluated using the linkage disequilibrium test. Significant linkage disequilibrium was detected between 22 (of a total of 253) pairs of loci (p < 0.05). However, after Bonferroni correction of the significance level using the number of tests (0.05/253 = 0.000198), none of the pairs of loci showed significant linkage disequilibrium, indicating that all loci are statistically independent. Therefore, their allele frequencies can be multiplied together to estimate match probabilities in the Kuwaiti population.

Off-ladder and novel alleles

Alleles that could not be identified using the GeneMapper allelic ladder for the PowerPlex Fusion 6C kit were assigned as off-ladder (OL) alleles, and were observed in 13 samples. These samples were re-amplified for confirmation and all OL alleles were confirmed. OL alleles were observed at the PentaE (5 alleles), PentaD (1 allele), D22S1045 (1 allele), SE33 (5 alleles), and D18S51 (1 allele) loci. The samples were previously sequenced using the Verogen ForenSeq DNA Signature Prep kit (manuscript in preparation), and these data were examined to determine whether the undesignated alleles at the PentaE, PentaD, D22S1045 and SE33 loci could be identified; the repeat structure sequences from this dataset are shown in Table 2 and permitted all alleles to be identified. The D18S51 locus is not included in the ForenSeq kit therefore, its OL allele was identified using the allelic ladder bins created in GeneMapper software.

Table 2 Off-ladder alleles in the Kuwaiti population identified using sequencing data.

Full size table

All of the identified alleles have been reported previously in the STRBase database (an online STR database created by the United States National Institute of Standards and Technology (NIST)⁴⁰), except for the PentaD 11.2 allele, which is a novel allele not reported before in the literature.

Intra-population genetic structure

Markers that are used for human identification may have weaker discrimination power in populations with genetic structure than in unstructured populations, due to the impact that the presence of subpopulation groups has on the random match probability. This is due to the fact that individuals coming from the same subpopulation groups tend to possess similar alleles, which means the likelihood of seeing random individuals possessing similar genotypes would increase in the presence of genetic structure¹⁰. Despite the fact that, in this study, no significant deviation from the expectations of the Hardy–Weinberg Equilibrium was detected between the markers, indicating that there is no genetic stratification, it is useful to assess the markers to see if they reveal any genetic clusters within the data. To achieve this, PCA was carried out on the DNA profiles obtained from the Kuwaiti samples for the 23 autosomal PowerPlex Fusion 6C markers. PCA is an unsupervised clustering method that does not require any prior information about the ancestral origin of the samples. Simply, it clusters the samples based on their similarities to each other, forming homogenous clusters of individuals that can be seen on a PCA plot. As expected, the PCA plot (Fig. 1), did not show any pattern of segregation that could be related to the ancestral population of origin of the individuals in the data, indicating that there is no genetic structure within the sample.

Another widely used method to infer population structure in genetic data is the Bayesian-based model implemented in the STRUCTURE software, which calculates how likely each individual in the data is to belong to each of a number of K (predetermined by the user) populations, and then uses this information to assign individuals into population subgroups¹⁸. The analysis was run without population information, and the mean log likelihood across five repeated runs of the analysis for each value of K (from 2 to 10) was estimated. The results showed inconsistency in estimating the log likelihood at K = 5 and over, which is indicated by the high standard deviation (SD), as presented in Supplementary Figure S1A. Based on the method described in Evanno et al.¹⁸, the most likely inferred value of K was 7, as this is the number of populations at which the highest Delta K value was recorded (Supplementary Figure S1B).

However, whilst the results indicated that the data is most probable at K = 7, there was no clear genetic differentiation between individuals in the sample. This can be seen in Supplementary Figure S2, which shows no clear signal of structuring between the three subpopulation groups, in terms of the proportion of each individual’s genetic ancestry assigned to each population, regardless of the number of populations assumed. This is further supported by the relatively small increases in mean log likelihood and Delta K values from K = 2 to K = 3, suggesting that there is limited evidence for any genetic structuring within the Kuwaiti population sample, in agreement with the PCA analysis above.

Genetic distance

To investigate the genetic distance between the Kuwaiti population and other global populations, allele frequencies for the 23 autosomal STRs in the PowerPlex Fusion 6C kit were pooled from the HGDP-CEPH global panel, which contains 57 populations grouped into seven global regions, and consists of eight African (N = 507), six American (N = 551), nine Central and South Asian (N = 202), four Middle Eastern (N = 160), 11 European (N = 2135), and 17 East Asian (N = 227) populations. An Indian population (N = 374) and a Lebanese population (N = 505) were also typed for the 23 loci, thus were added to the analysis. Both PCA and phylogenetic analyses were carried out, and the resulting plots characterise the genetic differentiation between populations. The distribution of the populations on the PCA plot (Fig. 2), and the genetic distances between them on the NJ tree (Fig. 4A) show that the Kuwaiti population is genetically closest to the Lebanese and Middle Eastern groups, which includes Mozabite, Druze, Palestinian and Bedouin populations. This is explained by the gene flow between these geographically close locations, which consequently leads to more similar allele frequency distributions among them.

At the regional level, genetic distance was assessed based on the 13 loci shared between this study and studies examining other populations from the Arabian Peninsula. The resulting PCA plot (Fig. 3), and NJ tree (Fig. 4B) show that our Kuwaiti dataset broadly clustered with the previously published Kuwaiti data, and was genetically closer to the countries in the north of the region such as Iraq and Iran. In contrast, there was a higher level of genetic differentiation between Kuwait and Saudi Arabia, Yemen and Qatar, which were clustered in the upper-right part of the PCA plot, and Bahrain, Oman and UAE, which were clustered in the lower part of the plot.

In this study, 30% of individuals declared their origins as being from the north (Iraq and Iran), 39% from the south region (Saudi Arabia, Bedouin and Bahrain), and 24% had parents of different origin (admixed). Therefore, the closer genetic relationship of our samples to the northern region might be due to the presence of these individuals. There is no information available about the population of origin for the samples collected in the two previous Kuwaiti studies (KW1³ and KW2²). It is therefore not possible to determine whether sampling from different sub-populations could explain why, in contrast to our sample, these two Kuwaiti samples cluster more closely with the Saudi Arabian sample than the samples from Iran and Iraq. Overall, it can be seen that the allele frequencies of the 23 autosomal markers in the PowerPlex Fusion 6C kit can be successfully used to separate both geographically distant global populations and closely related populations on the basis of their genetic distance, making them a good choice for detecting genetic differentiation between populations.

Conclusion

This study evaluated the forensic utility of the 23 autosomal STR loci included in the Promega PowerPlex Fusion 6C kit for the Kuwaiti population. Among these loci, D10S1248, D22S1045, D2S441 and SE33 are reported for the first time for Kuwait. The genetic data indicate that these 23 autosomal STRs are highly polymorphic in the Kuwaiti population and are of high value for human identification and paternity testing. STRUCTURE and PCA analysis show no signature of genetic structuring of the Kuwaiti population into subpopulations. Comparison of the Kuwaiti population to other global populations indicates that Kuwait clusters with other Middle Eastern populations, and shows a close relationship with Iran and Iraq, suggesting that they may share common ancestry.

References

Alenizi, M., Goodwin, W., Ismael, S. & Hadi, S. STR data for the AmpFℓSTR® Identifiler® loci in Kuwaiti population. Leg. Med. 10, 321–325. https://doi.org/10.1016/j.legalmed.2008.05.003 (2008).
Article CAS Google Scholar
Al-Enizi, M. et al. Population genetic analyses of 15 STR loci from seven forensically-relevant populations residing in the state of Kuwait. Forensic Sci. Int. Genet. 7, e106-107. https://doi.org/10.1016/j.fsigen.2013.04.007 (2013).
Article PubMed Google Scholar
Al-enizi, M. et al. Population data on 25 autosomal STRs for 500 unrelated Kuwaitis. Forensic Sci. Int. Genet. 12, 126–127. https://doi.org/10.1016/j.fsigen.2014.05.008 (2014).
Article CAS PubMed Google Scholar
Hares, D. R. Selection and implementation of expanded CODIS core loci in the United States. Forensic Sci. Int. Genet. 17, 33–34. https://doi.org/10.1016/j.fsigen.2015.03.006 (2015).
Article CAS PubMed Google Scholar
Ensenberger, M. G. et al. Developmental validation of the PowerPlex® Fusion 6C System. Forensic Sci. Int. Genet. 21, 134–144. https://doi.org/10.1016/j.fsigen.2015.12.011 (2016).
Article CAS PubMed Google Scholar
Excoffier, L. & Lischer, H. E. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567. https://doi.org/10.1111/j.1755-0998.2010.02847.x (2010).
Article PubMed Google Scholar
Gouy, A. & Zieger, M. STRAF-A convenient online tool for STR data evaluation in forensic genetics. Forensic Sci. Int. Genet. 30, 148–151. https://doi.org/10.1016/j.fsigen.2017.07.007 (2017).
Article CAS PubMed Google Scholar
Teebi, A. S. Autosomal recessive disorders among Arabs: An overview from Kuwait. J. Med. Genet. 31, 224–233 (1994).
Article CAS PubMed PubMed Central Google Scholar
Alsmadi, O. et al. Genetic substructure of Kuwaiti population reveals migration history. PLoS One 8, e74913. https://doi.org/10.1371/journal.pone.0074913 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Balding, D. J. Weight-of-Evidence for Forensic DNA Profiles (Wiley, New York, 2005).
Book Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Falush, D., Stephens, M. & Pritchard, J. K. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003).
CAS PubMed PubMed Central Google Scholar
Hubisz, M. J., Falush, D., Stephens, M. & Pritchard, J. K. Inferring weak population structure with the assistance of sample group information. Mol. Ecol. Resour. 9, 1322–1332. https://doi.org/10.1111/j.1755-0998.2009.02591.x (2009).
Article PubMed PubMed Central Google Scholar
Triki-Fendri, S. et al. Genetic structure of Kuwaiti population revealed by Y-STR diversity. Ann. Hum. Biol. 37, 827–835. https://doi.org/10.3109/03014461003720296 (2010).
Article PubMed Google Scholar
Theyab, J. B., Al-Bustan, S. & Crawford, M. H. The genetic structure of the Kuwaiti population: mtDNA Inter- and intra-population variation. Hum. Biol. 84, 379–403. https://doi.org/10.3378/027.084.0403 (2012).
Article PubMed Google Scholar
John, S. E. et al. Kuwaiti population subgroup of nomadic Bedouin ancestry-Whole genome sequence and analysis. Genom. Data 3, 116–127. https://doi.org/10.1016/j.gdata.2014.11.016 (2015).
Article PubMed Google Scholar
Triki-Fendri, S. et al. Genetic structure of the Kuwaiti population revealed by paternal lineages. Am. J. Human Biol. 28, 203–212. https://doi.org/10.1002/ajhb.22773 (2016).
Article Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 14, 2611–2620. https://doi.org/10.1111/j.1365-294X.2005.02553.x (2005).
Article CAS PubMed Google Scholar
Kopelman, N. M., Mayzel, J., Jakobsson, M., Rosenberg, N. A. & Mayrose, I. Clumpak: A program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. 15, 1179–1191. https://doi.org/10.1111/1755-0998.12387 (2015).
Article CAS PubMed PubMed Central Google Scholar
Earl, D. A. & von Holdt, B. M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361. https://doi.org/10.1007/s12686-011-9548-7 (2012).
Article Google Scholar
Amigo, J. et al. pop.STR—an online population frequency browser for established and new forensic STRs. Forensic Sci. Int. Genet. Suppl. Ser. 2, 361–362. https://doi.org/10.1016/j.fsigss.2009.08.178 (2009).
Article Google Scholar
Phillips, C. et al. Global population variability in Qiagen Investigator HDplex STRs. Forensic Sci. Int. Genet. 8, 36–43. https://doi.org/10.1016/j.fsigen.2013.07.006 (2014).
Article CAS PubMed Google Scholar
Phillips, C. et al. Analysis of global variability in 15 established and 5 new European Standard Set (ESS) STRs using the CEPH human genome diversity panel. Forensic Sci. Int. Genet. 5, 155–169. https://doi.org/10.1016/j.fsigen.2010.02.003 (2011).
Article CAS PubMed Google Scholar
Dixit, S. et al. Forensic genetic analysis of population of Madhya Pradesh with PowerPlex Fusion 6C^TM Multiplex System. Int. J. Legal Med. 133, 803–805. https://doi.org/10.1007/s00414-019-02017-0 (2019).
Article PubMed PubMed Central Google Scholar
Hedjazi, A., Nikbakht, A., Hosseini, M., Hoseinzadeh, A. & Hosseini, S. M. Allele frequencies for 15 autosomal STR loci in Fars province population, southwest of Iran. Legal Med. (Tokyo, Jpn.) 15, 226–228. https://doi.org/10.1016/j.legalmed.2013.01.005 (2013).
Article CAS Google Scholar
Khubrani, Y. M., Wetton, J. H. & Jobling, M. A. Analysis of 21 autosomal STRs in Saudi Arabia reveals population structure and the influence of consanguinity. Forensic Sci. Int. Genet. 39, 97–102. https://doi.org/10.1016/j.fsigen.2018.12.006 (2019).
Article CAS PubMed Google Scholar
Alsafiah, H. M., Goodwin, W. H., Hadi, S., Alshaikhi, M. A. & Wepeba, P. P. Population genetic data for 21 autosomal STR loci for the Saudi Arabian population using the GlobalFiler((R)) PCR amplification kit. Forensic Sci. Int. Genet. 31, e59–e61. https://doi.org/10.1016/j.fsigen.2017.09.014 (2017).
Article CAS PubMed Google Scholar
Perez-Miranda, A. M., Alfonso-Sanchez, M. A., Pena, J. A. & Herrera, R. J. Qatari DNA variation at a crossroad of human migrations. Hum. Hered. 61, 67–79. https://doi.org/10.1159/000092648 (2006).
Article CAS PubMed Google Scholar
Alshamali, F., Alkhayat, A. Q., Budowle, B. & Watson, N. D. STR population diversity in nine ethnic populations living in Dubai. Forensic Sci. Int. 152, 267–279. https://doi.org/10.1016/j.forsciint.2004.09.133 (2005).
Article CAS PubMed Google Scholar
Jones, R. J., Tayyare, W. A., Tay, G. K., Alsafar, H. & Goodwin, W. H. Population data for 21 autosomal short tandem repeat markers in the Arabic population of the United Arab Emirates. Forensic Sci. Int. Genet. 28, e41–e42. https://doi.org/10.1016/j.fsigen.2017.02.015 (2017).
Article CAS PubMed Google Scholar
Al-Snan, N. R., Messaoudi, S., Babu, S. R. & Bakhiet, M. Population genetic data of the 21 autosomal STRs included in GlobalFiler kit of a population sample from the Kingdom of Bahrain. PLoS One 14, e0220620. https://doi.org/10.1371/journal.pone.0220620 (2019).
Article CAS PubMed PubMed Central Google Scholar
Farhan, M. M., Hadi, S., Iyengar, A. & Goodwin, W. Population genetic data for 20 autosomal STR loci in an Iraqi Arab population: Application to the identification of human remains. Forensic Sci. Int. Genet. 25, e10–e11. https://doi.org/10.1016/j.fsigen.2016.07.017 (2016).
Article CAS PubMed Google Scholar
Team, R. C. R: A Language and Environment for Statistical Computing. https://www.R-project.org/ (2019).
Mundt, A. K. a. F. factoextra: Extract and Visualize the Results of Multivariate Data Analyses. https://CRAN.R-project.org/package=factoextra (2017).
Nei, M., Tajima, F. & Tateno, Y. Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data. J. Mol. Evol. 19, 153–170. https://doi.org/10.1007/bf02300753 (1983).
Article ADS CAS PubMed Google Scholar
Takezaki, N., Nei, M. & Tamura, K. POPTREEW: Web version of POPTREE for constructing population trees from allele frequency data and computing some other quantities. Mol. Biol. Evol. 31, 1622–1624. https://doi.org/10.1093/molbev/msu093 (2014).
Article CAS PubMed Google Scholar
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. https://doi.org/10.1093/molbev/msy096 (2018).
Article CAS PubMed PubMed Central Google Scholar
Butler, J. M. et al. The single most polymorphic STR Locus: SE33 performance in US populations. Forensic Sci. Int. Genet. Suppl. Ser. 2, 23–24. https://doi.org/10.1016/j.fsigss.2009.08.173 (2009).
Article Google Scholar
Alghafri, R. Population data for SE33 locus in United Arab Emirates Arab population. Forensic Sci. Int. Genet. Suppl. Ser. 5, e238–e239. https://doi.org/10.1016/j.fsigss.2015.09.095 (2015).
Article Google Scholar
Butler, J. M. New resources for the forensic genetics community available on the NIST STRBase website. Forensic Sci. Int. Genet. Suppl. Ser. 1, 97–99. https://doi.org/10.1016/j.fsigss.2007.10.035 (2008).
Article Google Scholar
Borsuk, L. A., Gettings, K. B., Steffen, C. R., Kiesler, K. M. & Vallone, P. M. Sequence-based US population data for the SE33 locus. Electrophoresis 39, 2694–2701. https://doi.org/10.1002/elps.201800091 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We gratefully thank Promega Corporation for supplying the kits, with special thanks to Dr Andy Hopwood for his support. We thank Dr Mohammed Alenizi, the director of the Kuwait Identification DNA Laboratory (General Department of Criminal Evidence) for his support. We also thank all the participants who donated their samples. This work was supported by doctoral funding awarded by the Ministry of Interior of Kuwait.

Author information

Authors and Affiliations

Centre for Forensic Science, Department of Pure and Applied Chemistry, University of Strathclyde, 204 George Street, Glasgow, G1 1XW, Scotland, UK
Mahdi Haidar, Fatimah A. Abbas, Hussain Alsaleh & Penelope R. Haddrill
Kuwait Identification DNA Laboratory (KIDL), General Department of Criminal Evidence, Ministry of Interior, Kuwait City, Kuwait
Mahdi Haidar, Fatimah A. Abbas & Hussain Alsaleh

Authors

Mahdi Haidar
View author publications
You can also search for this author in PubMed Google Scholar
Fatimah A. Abbas
View author publications
You can also search for this author in PubMed Google Scholar
Hussain Alsaleh
View author publications
You can also search for this author in PubMed Google Scholar
Penelope R. Haddrill
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.H. reference sample collection and data analysis. F.A.A. laboratory work. H.A. data analysis and co-writer. P.R.H. supervisor of the study and co-writer. All authors reviewed the manuscript.

Corresponding author

Correspondence to Mahdi Haidar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Haidar, M., Abbas, F.A., Alsaleh, H. et al. Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population. Sci Rep 11, 1865 (2021). https://doi.org/10.1038/s41598-021-81425-y

Download citation

Received: 30 September 2020
Accepted: 16 December 2020
Published: 21 January 2021
DOI: https://doi.org/10.1038/s41598-021-81425-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.