Power of genome-wide linkage disequilibrium testing by using microsatellite markers

Ohashi, Jun; Tokunaga, Katsushi

doi:10.1007/s10038-003-0058-7

Download PDF

Short Communication
Published: 22 August 2003

Power of genome-wide linkage disequilibrium testing by using microsatellite markers

Jun Ohashi^1,2 &
Katsushi Tokunaga¹

Journal of Human Genetics volume 48, pages 487–491 (2003)Cite this article

588 Accesses
30 Citations
1 Altmetric
Metrics details

Abstract

Linkage disequilibrium (LD) testing is often used in the search for disease genes. In this study, we developed a method for calculating the expected power of genome-wide LD testing by using microsatellite markers under the following assumptions: (1) microsatellite markers have unequally frequent alleles, (2) markers are equally spaced through the human genome, (3) the degree of LD between the disease variant and the marker decays gradually because of recombination and mutation, (4) the population frequency of the disease variant is low (e.g., 0.05), (5) a single-marker test is performed in a case-control study, and (6) the significance level is adjusted by the number of tests to avoid inflation of the type I error. Our calculations revealed a markedly higher power for microsatellite markers than for single nucleotide polymorphism (SNP) markers, even if more SNPs are analyzed, suggesting that the use of microsatellite markers is preferable to the use of SNPs for genome-wide screening under the above assumptions. This method will be helpful to researchers who design genome-wide LD testing with microsatellite markers.

Population-specific long-range linkage disequilibrium in the human genome and its influence on identifying common disease variants

Article Open access 06 August 2019

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Article 18 May 2020

Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors

Article Open access 08 December 2021

Introduction

The advent of inexpensive genotyping technologies has made it possible to use polymorphic markers in genome-wide linkage disequilibrium (LD) testing to detect genes involved in genetically complex diseases. Two types of markers are suitable for such a purpose: (1) single nucleotide polymorphism (SNP) markers, which are the most abundant genetic marker in the human genome, and (2) microsatellite markers, which may have many alleles and show high degrees of polymorphism.

Based on a simple deterministic model assuming steady decay of LD, Ohashi and Tokunaga (2002) have concluded that LD testing with SNP markers does not allow the detection of low-frequency disease variants with a modest contribution to the onset of a disease. This is mainly because there would be a difference in allele frequency between the low-frequency disease variant and the associated allele at the adjacent SNP marker, regardless of the degree of LD between them. The difference in allele frequency is known to reduce the power of LD studies markedly (Ohashi and Tokunaga 2001). If, however, the genetic markers have many alleles, the difference in allele frequency is expected to be small.

When a marker locus with equally frequent alleles is analyzed in LD testing, the statistical power increases with an increasing number of alleles (Chapman and Wijsman 1998; Xiong and Jin 1999), because of the increased probability that one of the marker alleles in strong LD with the disease variant has a similar allele frequency as the disease variant. However, the assumption of equally frequent alleles leads to an overestimation of the power of LD testing by using microsatellite markers (Chapman and Wijsman 1998). To consider unequally frequent alleles in the calculation of power, we have developed a model with unequal mutation rates among alleles. This paper aims to provide a reliable method for the calculation of the power of genome-wide LD testing by using microsatellite markers. Furthermore, we demonstrate the way in which microsatellite markers are more powerful for genome-wide LD testing compared with SNP markers.

Materials and methods

Genetic model

It is assumed that a disease locus has two alleles, a disease allele, D, and a normal allele, d. Allele frequencies of D and d are p and 1-p, respectively. At a microsatellite marker locus, there are m different alleles, M _i, i=1, ..., m, with allele frequencies q _i ($ {{\sum\limits_{i = 1}^m {q_{i} } } = 1} $). Throughout the paper, these frequencies are assumed to be constant, with the population being in Hardy-Weinberg equilibrium. The recombination fraction between the two loci is denoted by θ. The frequency of the haplotype carrying D and M _i at time t (measured in generations) is represented by H _Di(t). Similarly, the frequency of the haplotype carrying d and M _i at time t is represented by H _di(t). Note that $ q_{i} = H_{{Di}} + H_{{di}} $. Because microsatellite markers usually show a high mutation rate (e.g., 10⁻³~10⁻⁵ per generation), mutations are assumed only for the marker. Here, a one-step stepwise mutation model (SMM) is assumed. In this model, a marker allele M _i mutates to the next larger allele M _i+1, and to the next smaller allele M _i−1, each with a probability of u _i. The smallest allele, M ₁, can mutate only to M ₂ with a probability of u ₁, and the largest allele, M _m, can mutate only to M _m−1 with a probability of u _m. Under these assumptions, the following deterministic equations hold for the microsatellite markers: $ {H_{{D1}} (t + 1) = (1 - \theta - u_{1} )H_{{D1}} (t) + u_{2} H_{{D2}} (t) + \theta pq_{1} } $, $ {H_{{Di}} (t + 1) = (1 - \theta - 2u_{i} )H_{{Di}} (t) + u_{{i - 1}} H_{{Di - {\rm{1}}}} (t) + u_{{i + 1}} H_{{Di + 1}} (t) + \theta pq_{i} } $ (i=2, ..., m−1), $ {H_{{Dm}} (t + 1) = (1 - \theta - u_{m} )H_{{Dm}} (t) + u_{{m - 1}} H_{{Dm - 1}} (t) + \theta pq_{m} } $, $ {H_{{d1}} (t + 1) = (1 - \theta - u_{1} )H_{{d1}} (t) + u_{2} H_{{d2}} (t) + \theta (1 - p)q_{1} } $, $ {H_{{di}} (t + 1) = (1 - \theta - 2u_{i} )H_{{di}} (t) + u_{{i - 1}} H_{{di - {\rm{1}}}} (t) + u_{{i + 1}} H_{{di + 1}} (t) + \theta (1 - p)q_{i} } $ (i=2, ..., m−1), and $ {H_{{dm}} (t + 1) = (1 - \theta - u_{m} )H_{{dm}} (t) + u_{{m - 1}} H_{{dm - 1}} (t) + \theta (1 - p)q_{m} } $. When initial parameter values are given, we obtain each haplotype frequency for any t, by using these recurrence formulae.

Equilibrium distribution of allele-frequency at marker locus

Farrall and Weeks (1998) investigated the equilibrium distributions of microsatellite allele frequency of (CA)_n microsatellite makers in the Généthon database. Although they examined only microsatellite markers with a CA repeat, we regard their results as representative of microsatellite markers. From their web page (http://www.well.ox.ac.uk/~mfarrall/microsatellite.html), we obtained the observed equilibrium distributions of (CA)_n microsatellite makers with various numbers of alleles. In the SMM, which assumes a fixed number of marker alleles and an equilibrium distribution of allele frequency, proper mutation rates, u _i, should be given to achieve the equilibrium distribution of $ {\bar{q}_{i} } $. When the mean mutation rate at a marker is u, the following equation should be satisfied:

$$ {\bar{q}_{1} u_{1} + \bar{q}_{m} u_{m} + {\sum\limits_{i = 2}^{m - 1} {2\bar{q}_{i} u_{i} } } = u} $$

It should be noted here that $ {u_{i} \bar{q}_{i} } $ requires to be constant regardless of i, because allele frequencies of microsatellite marker are not changed though the calculation. Thus, we get $ {u_{i} = {u \over {2(m - 1)\bar{q}_{i} }}} $ as the mutation rate for M _i.

Disease model

We will consider a case-control study of complex disease genes. For a complex disease, the case group will contain some case individuals who do not possess the susceptibility allele, and the control group will contain some control individuals who possess the susceptibility allele. Here, penetrances for genotypes DD, Dd, and dd are denoted by f ₂, f ₁, and f ₀, respectively (f ₂≥f ₁≥f ₀), and a multiplicative mode of inheritance with a genotype relative risk of r (i.e., f ₂=r ²f ₀ and f ₁=rf ₀) is examined in this study. Note that the present method can be applied to any other modes of inheritance if penetrances are specified. The conditional probabilities of the DD, Dd, and dd genotypes, given that the individual is affected (case), are given by $ P(DD|case) = p^{2} f_{2} /e $, $ P(Dd|case) = 2p(1 - p)f_{1} /e $, and $ P(dd|case) = (1 - p)^{2} f_{0} /e $, respectively, where e represents the disease prevalence, $ p^{2} f_{2} + 2p(1 - p)f_{1} + (1 - p)^{2} f_{0} $, in the studied population. Similarly, the conditional probabilities of each genotype, given that the individual is not affected (control), are given by $ P(DD|control) = p^{2} (1 - f_{2} )/(1 - e) $, $ P(Dd|control) = 2p(1 - p)(1 - f_{1} )/(1 - e) $, and $ P(dd|control) = (1 - p)^{2} (1 - f_{0} )/(1 - e) $. By Bayes' theorem, the probability of an affected individual being of the M _iM _i genotype at time t is given as $ {P(M_{i} M_{i} |case,\;t) = {1 \over e}{\left\{ {f_{{\rm{2}}} H_{{Di}} (t)^{2} + 2f_{{\rm{1}}} H_{{Di}} (t)H_{{di}} (t) + f_{0} H_{{di}} (t)^{2} } \right\}}} $, and the probability of an affected individual being of the M _iM _j (j≠i) is given as $ {P(M_{i} M_{j} |case,\;t) = {1 \over e}{\left\{ {2f_{{\rm{2}}} H_{{Di}} (t)H_{{Dj}} (t) + 2f_{{\rm{1}}} {\left( {H_{{Di}} (t)H_{{dj}} (t) + H_{{di}} (t)H_{{Dj}} (t)} \right)} + 2f_{0} H_{{di}} (t)H_{{dj}} (t)} \right\}}} $. For control, P(M _iM _i|control, t) and P(M _iM _j|control, t) are represented by $ {P(M_{i} M_{i} |control,\;t) = {1 \over {1 - e}}\left\{ {(1 - f_{{\rm{2}}} )H_{{Di}} (t)^{2} + 2(1 - f_{{\rm{1}}} )} \right.\left. {H_{{Di}} (t)H_{{di}} (t) + (1 - f_{{\rm{0}}} )H_{{di}} (t)^{2} } \right\}} $ and $ {\eqalign{ & P(M_{i} M_{j} |control,\;t) = {1 \over {1 - e}}\left\{ {2(1 - f_{{\rm{2}}} )H_{{Di}} (t)H_{{Dj}} (t) + 2(1 - f_{{\rm{1}}} )} \right.{\left( {H_{{Di}} (t)H_{{dj}} (t) + H_{{di}} (t)H_{{Dj}} (t)} \right)} \cr & \left. { + 2(1 - f_{{\rm{0}}} )H_{{di}} (t)H_{{dj}} (t)} \right\} \cr} } $, respectively.

Power

In genome-wide LD testing with a case-control design, marker allele frequencies are compared between case individuals and control individuals. For a comparison of allele frequencies, the data for a microsatellite marker with m alleles are summarized in an m×2 contingency table (Table 1). When the number of cases, N, is equal to that of controls, a χ² statistic for an m×2 contingency table, $ {X^{2} = 2N{\sum\limits_{i = 1}^m {{{(\tilde{x}_{i} - \tilde{y}_{i} )^{2} } \over {\tilde{x}_{i} + \tilde{y}_{i} }}} }} $, can be used to test the null hypothesis of no difference in frequencies of m alleles between cases and controls. Here, $ {\tilde{x}_{i} } $ and $ {\tilde{y}_{i} } $ indicate the observed M _i frequencies in cases and controls, respectively (Chapman and Wijsman 1998). That is, $ {\tilde{x}_{i} } $ and $ {\tilde{y}_{i} } $ represent $ {W_{{i1}} /(2N)} $ and $ {W_{{i2}} /(2N)} $, respectively. Under the null hypothesis, X ² is asymptotically distributed as a χ² distribution with m-1 degrees of freedom. Under the alternative hypothesis of LD or different allele frequencies between cases and controls, X ² is asymptotically distributed as a χ² distribution with m-1 degrees of freedom and with a noncentrality parameter of $ {\gamma = 2NG^{2} = 2N{\sum\limits_{i = 1}^m {{{(x_{i} - y_{i} )^{2} } \over {x_{i} + y_{i} }}} }} $. After t generations, a noncentrality parameter is represented by $ {\gamma (t) = 2NG^{2} (t) = 2N{\sum\limits_{i = 1}^m {{{(x_{i} (t) - y_{i} (t))^{2} } \over {x_{i} (t) + y_{i} (t)}}} }} $, where x _i(t) and y _i(t) are $ {{\left( {P{\left( {M_{i} M_{i} |case,\;t} \right)} + {\sum\limits_{j = 1}^m {P{\left( {M_{i} M_{j} |case,\;t} \right)}} }} \right)}} \mathord{\left/ {\vphantom {{{\left( {P{\left( {M_{i} M_{i} |case,\;t} \right)} + {\sum\limits_{j = 1}^m {P{\left( {M_{i} M_{j} |case,\;t} \right)}} }} \right)}} 2}} \right. \kern-\nulldelimiterspace} 2 $ and $ {{\left( {P{\left( {M_{i} M_{i} |control,\;t} \right)} + {\sum\limits_{j = 1}^m {P{\left( {M_{i} M_{j} |control,\;t} \right)}} }} \right)}} \mathord{\left/ {\vphantom {{{\left( {P{\left( {M_{i} M_{i} |control,\;t} \right)} + {\sum\limits_{j = 1}^m {P{\left( {M_{i} M_{j} |control,\;t} \right)}} }} \right)}} 2}} \right. \kern-\nulldelimiterspace} 2 $, respectively.

Table 1. m×2 contingency table (W _ij represents the number of observed M _i alleles in a group of jth column)

Full size table

We assume that marker allele frequencies are stable at any generation and that only one of the marker alleles is in complete LD with the D allele at the LD-generating event or at t=0, as in previous studies (Chapman and Wijsman 1998; Xiong and Jin 1999). That is, H _Dk(0)=p, H _Di(0)=0 (i≠k), H _dk(0)=q _k−p, and H _di(0)=q _i (i≠k), where q _k is larger than p. When the frequency of D is small, this assumption seems to be valid. If the probability of M _k being in complete LD with D is proportional to q _k, the expected value of G ²(t) for microsatellite markers is given by a weighted average of the conditional expected values: $ {G^{2} (t) = {\sum\limits_{k = 1}^m {q_{k} G^{2} _{k} (t)} }} $ (Chapman and Wijsman 1998), where G ² _k(t) represents G ²(t) under the condition that M _k allele is in complete LD with D allele at t=0.

In genome-wide LD testing, a large number of markers are examined, which causes an inflation of the type I error. To avoid this inflation, the significance level should be adjusted by the number of markers to be tested. When l markers are tested, viz., l independent association tests are performed, the significance level of α should be set to 0.05/l. We consider the case that the LD testing is performed against a region spanning L cM. The disease locus is assumed to be exactly located at the middle of two adjacent markers, and the statistical power is calculated only for the closest marker to the disease locus. Thus, the genetic distance between the disease locus and the most adjacent marker locus is given as L/(2l). By using Haldane's map function, the recombination rate between the nearest marker and the disease locus is given by $ {\theta = {\left[ {1 - \exp ( - L/l)} \right]}/2} $. The total length of the human genome, L, is assumed to be 30 M. To avoid an inflation of the type I error rate attributable to multiple testing, the significance level of α is set to 0.05/l. The asymptotic power, 1−β, for the significance level of α is given as Prob[χ² _m−1(γ)≧χ² _m−1, _1−α(γ)]. The calculations were performed by using SAS software (SAS Institute, Cary, N.C.).

Results and discussion

The number of alleles differs among microsatellite markers. Because microsatellite markers with six alleles are the most frequently observed (Farrall and Weeks 1998), we examined mainly the case of m=6. Following the data of Farrall and Weeks (1998), the equilibrium distribution of allele frequencies for m=6 was set as: q ₁=0.160, q ₂=0.199, q ₃=0.214, q ₄=0.205, q ₅=0.158, and q ₆=0.064. The population frequency of a disease variant, p, was assumed to be 0.05 throughout this study, and the penetrance for the normal genotype f ₀ was set to be 0.01 (a power for f ₀ of less than 0.01 is as same as that for an f ₀ of 0.01). Table 2 shows the expected power of LD testing under the condition that the marker is in complete LD with a low-frequency disease variant. In other words, this is the case of t=0 and the most ideal LD situation. The power was found to depend largely on sample size, genotype relative risk, and significance level. The results suggest that, in a case-control study with 100 cases and 100 controls, it would be difficult to detect a low-frequency disease variant with a genotype relative risk of 2, even at a significance level of 0.05, and that the power for the variant does not reach 0.8 even when 1000 cases and 1000 controls are analyzed. Thus, we may say that LD testing with microsatellite markers can detect a disease variant only with a large genotype relative risk when the allele frequency of the disease variant is low.

Table 2. Expected power of LD testing by using microsatellite markers being in complete LD with a disease variant (at t=0). Power was calculated for microsatellite markers with six alleles (q ₁=0.160, q ₂=0.199, q ₃=0.214, q ₄=0.205, q ₅=0.158, and q ₆=0.064). In the calculation, p and f ₀ were set to 0.05 and 0.01, respectively

Full size table

Although a high statistical power is revealed for a disease variant with a genotype relative risk of 4 in Table 2, the power is reduced by recombination between marker and disease loci and mutation at the marker. To examine the effects of recombination and mutation on the power, we calculated the expected power of LD testing at 50 and 500 generations after the LD-generating event for a study involving the analysis of 1000 cases and 1000 controls (Fig. 1). Figure 1 shows that recombination and mutation markedly influence the power of genome-wide LD testing. In the case of m=6 and t=50 (Fig. 1a), the examination of 2,000–3,000 markers provides a power of more than 0.8, whereas approximately 30,000 markers are required to gain the same power for m=6 and t=500, even when u=10⁻⁵ (Fig. 1b). Above a certain number of markers, the addition of more markers decreases the power because of the correction of significance level based on the number of tests (Fig. 1a). However, a large number of markers needs to be analyzed to attain a high statistical power. Because the reduction in power attributable to the correction of significance level is small, we recommend the analysis of as many microsatellite markers as possible in LD testing.

In Fig. 1, microsatellite markers with three alleles are also considered; the equilibrium distribution of allele frequencies for m=3 was set as: q ₁=0.357, q ₂=0.412, and q ₃=0.232. The power for m=3 is clearly lower than that for m=6. Thus, we should use microsatellite markers with more alleles, even though this increases the degrees of freedom in the statistical test. Note, however, that this is true only when all alleles at a marker have similar allele frequencies. If a marker locus has several alleles with very low allele frequencies, the power is markedly lower than that for markers whose allele frequencies are equal, assuming the same number of alleles. For example, when we assumed that allele frequencies at microsatellite markers with m=6 were q ₁=q ₂=q ₃=q ₄=q ₅=0.05, and q ₆=0.75, the power was as same as that for SNPs in the same condition as in Fig. 1 (data not shown). Thus, it is necessary to use microsatellite markers with non-skewed frequent alleles to obtain a high power. To avoid a reduction in power for microsatellite markers with many low-frequency alleles, all the very low-frequency alleles (e.g., q<0.01) should be regarded as one allele, and the degrees of freedom in the test should be reduced.

Although we do not consider the difference in the number of alleles among microsatellite markers in the present study, the following method allows us to deal with this problem. The proportion of microsatellite marker with m alleles in markers to be used for LD testing is denoted by s _m, and the minimum and maximum numbers of allele are denoted by a and b, i.e., $ {{\sum\limits_{m = a}^b {s_{m} } } = 1} $. In this case, the non-centrality parameter can be given by $ {\gamma *(t) = 2N{\sum\limits_{m = a}^b {s_{m} H^{2} _{m} (t)} }} $, where H ² _m(t) represents G ²(t) for microsatellite marker with m alleles. When the proportion s _m is known, this method would estimate a more reliable power.

Microsatellite markers seem to be more effective than SNPs if the genetic distance between the disease and marker loci is the same. However, the human genome contains more SNP markers than microsatellite markers. In addition, the cost of SNP typing is much lower than that of microsatellite typing, implying that more SNP markers can be tested with the same cost and resources. The use of more markers reduces the expected genetic distance between marker and disease loci. Thus, it is not immediately apparent which marker should be used in genome-wide LD testing. Figure 1 compares the expected power of LD testing by using SNP markers with that of microsatellite markers. Here, the SNPs are assumed to have a minor allele frequency of more than 0.2 (see Ohashi and Tokunaga 2002, for details). No mutation is considered for SNP markers. Microsatellite markers generally reveal a higher power than SNPs even if large numbers of SNPs are analyzed. In particular, we note that LD testing with SNP markers cannot attain a high power under the conditions of Fig. 1. Although our perspective is true only for a low-frequency disease variant and microsatellite markers with non-skewed frequent alleles, we conclude that microsatellite markers are preferable to SNPs for initial genome-wide screening, and SNPs should be used for fine-scale mapping after the screen.

However, if the analysis uses SNPs only in intragenic regions, especially those in coding and regulatory regions, a high power is expected to be attained, even in LD testing with SNPs. A disease variant may be an SNP allele. It should be noted that if a disease variant is included in the SNPs to be tested, the use of SNPs shows a higher power than that of microsatellite markers. LD defined by SNPs is likely to be structured into discrete blocks of tens to hundreds of kilobases in the human genome (Daly et al. 2001). If the pattern of LD in the human genome is clarified by being based on SNPs, the number of SNPs to be analyzed can be reduced, because only a few SNPs within the same block of LD are regarded as representative. This would increase the power of the study, because fewer SNPs can cover the entire genome. Furthermore, LD testing by using SNP haplotypes may reveal a higher power than a single-marker test. When a microsatellite marker is in an LD block, the significant association of each allele at the marker with one of two alleles at the SNP site in the same LD block is expected to be found (Omi et al. 2003), whereas it is unclear whether a microsatellite marker outside of the LD block would still show LD with SNPs or a disease variant inside of the LD block. If there is no LD between microsatellite markers outside the LD block and SNPs inside the LD block, at least one microsatellite marker is required to be in each LD block. However, it is unlikely that useful microsatellite markers are always in the LD block. Thus, the question of whether microsatellite markers are more suitable for genome-wide LD testing than SNPs remains open.

References

Chapman NH, Wijsman EM (1998) Genome screens using linkage disequilibrium tests: optimal marker characteristics and feasibility. Am J Hum Genet 63:1872–1885
Article CAS Google Scholar
Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES (2001) High-resolution haplotype structure in the human genome. Nat Genet 29: 229-232
Article CAS Google Scholar
Farrall M, Weeks DE (1998) Mutational mechanisms for generating microsatellite allele-frequency distributions: an analysis of 4,558 markers. Am J Hum Genet 62:1260–1262
Article CAS Google Scholar
Ohashi J, Tokunaga K (2001) Power of genome wide association studies of complex disease genes: statistical limitation of indirect approaches using SNP markers. J Hum Genet 46:478–482
Article CAS Google Scholar
Ohashi J, Tokunaga K (2002) The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant. Ann Hum Genet 66:297–306
Article CAS Google Scholar
Omi K, Ohashi J, Patarapotikul J, Hananantachai H, Naka I, Looareesuwan S, Tokunaga K (2003) CD36 polymorphism is associated with protection from cerebral malaria. Am J Hum Genet 72:364-374
Article CAS Google Scholar
Xiong M, Jin L (1999) Comparison of the power and accuracy of biallelic and microsatellite markers in population-based gene-mapping methods. Am J Hum Genet 64:629–640
Article CAS Google Scholar

Download references

Acknowledgements

I greatly appreciate two anonymous reviewers concerning their suggestions for improvements to this paper. This study was supported by a Grant-in-Aid for Scientific Research on Priority Areas (C) "Medical Science" from the Ministry of Education, Culture, Sports, Science and Technology, Japan (J.O.), the Genetic Diversity Project supported by the New Energy and Industrial Technology Development Organization (J.O.), and Health Sciences Research Grants for Research on Human Genome from Ministry of Health Labour and Welfare of Japan (J.O.).

Author information

Authors and Affiliations

Department of Human Genetics, Graduate School of Medicine, University of Tokyo, Tokyo 113-0033, Japan
Jun Ohashi & Katsushi Tokunaga
Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
Jun Ohashi

Authors

Jun Ohashi
View author publications
You can also search for this author in PubMed Google Scholar
Katsushi Tokunaga
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ohashi, J., Tokunaga, K. Power of genome-wide linkage disequilibrium testing by using microsatellite markers. J Hum Genet 48, 487–491 (2003). https://doi.org/10.1007/s10038-003-0058-7

Download citation

Received: 06 May 2003
Accepted: 02 July 2003
Published: 22 August 2003
Issue Date: September 2003
DOI: https://doi.org/10.1007/s10038-003-0058-7

Keywords

This article is cited by

Performance comparison of gel and capillary electrophoresis-based microsatellite genotyping strategies in a population research and kinship testing framework
- Julissa J. Sánchez-Velásquez
- Lorenzo E. Reyes-Flores
- Eliana Zelada-Mázmela
BMC Research Notes (2021)
Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers
- Philippe Suarez
- Karine Baumer
- Diana Hall
Scientific Reports (2021)
Linkage disequilibrium and population-structure analysis among Capsicum annuum L. cultivars for use in association mapping
- Padma Nimmakayala
- Venkata L. Abburi
- Umesh K. Reddy
Molecular Genetics and Genomics (2014)
Map-based molecular diversity, linkage disequilibrium and association mapping of fruit traits in melon
- Yan Tomason
- Padma Nimmakayala
- Umesh K. Reddy
Molecular Breeding (2013)
Megakaryoblastic leukemia factor-1 gene in the susceptibility to coronary artery disease
- Kunihiko Hinohara
- Toshiaki Nakajima
- Akinori Kimura
Human Genetics (2009)

Power of genome-wide linkage disequilibrium testing by using microsatellite markers

Abstract

Similar content being viewed by others

Population-specific long-range linkage disequilibrium in the human genome and its influence on identifying common disease variants

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors

Introduction

Materials and methods

Genetic model

Equilibrium distribution of allele-frequency at marker locus

Disease model

Power

Results and discussion

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

This article is cited by

Performance comparison of gel and capillary electrophoresis-based microsatellite genotyping strategies in a population research and kinship testing framework

Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Linkage disequilibrium and population-structure analysis among Capsicum annuum L. cultivars for use in association mapping

Map-based molecular diversity, linkage disequilibrium and association mapping of fruit traits in melon

Megakaryoblastic leukemia factor-1 gene in the susceptibility to coronary artery disease

Search

Quick links

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Genetic model

Equilibrium distribution of allele-frequency at marker locus

Disease model

Power

Results and discussion

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Search

Quick links