High resolution mapping of quantitative trait loci by linkage disequilibrium analysis

Fan, Ruzong; Xiong, Momiao

doi:10.1038/sj.ejhg.5200843

Download PDF

Article
Published: 02 October 2002

High resolution mapping of quantitative trait loci by linkage disequilibrium analysis

Ruzong Fan^1,2 &
Momiao Xiong³

European Journal of Human Genetics volume 10, pages 607–615 (2002)Cite this article

1025 Accesses
15 Citations
Metrics details

Abstract

Two methods, linkage analysis and linkage disequilibrium (LD) mapping or association study, are usually utilised for mapping quantitative trait loci (QTL). Linkage mapping is appropriate for low resolution mapping to localise trait loci to broad chromosome regions within a few cM (<10 cM), and is based on family data. Linkage disequilibrium mapping, on the other hand, is useful in high resolution or fine mapping, and is based on both population and family data. Using only one marker, one may carry out single-point linkage analysis and linkage disequilibrium mapping. Using two or more markers, it is possible to flank the QTL by multipoint analysis. The development and thus availability of dense marker maps, such as single nucleotide polymorphisms (SNP) in human genome, presents a tremendous opportunity for multipoint fine mapping. In this article, we propose a regression approach of mapping QTL by linkage disequilibrium mapping based on population data. Assuming that two marker loci flank one quantitative trait locus, a two-point linear regression is proposed to analyse population data. We derive analytical formulas of parameter estimations, and non-centrality parameters of appropriate tests of genetic effects and linkage disequilibrium coefficients. The merit of the method is shown by the power calculation and comparison. The two-point regression model can capture much more linkage and linkage disequilibrium information than that derived when only one marker is used. For a complex disease with heritability h²⩾0.15, a study with sample size of 250 can provide high power for QTL detection under moderate linkage disequilibria.

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Introduction

Much research has been done on linkage mapping of qualitative or quantitative trait loci (QTL). Schork investigated multipoint identity-by-descent analysis of human quantitative traits.¹ Fulker and Cardon worked out a sib-pair approach of a two-point interval mapping for QTL.² Researchers have been extending the available methods toward several directions: (1) Fulker et al.³ extended the method of Fulker and Cardon² for usage in multipoint interval mapping; (2) Almasy and Blangero worked on multipoint mapping for general pedigrees;⁴ (3) Liang et al. proposed a unified sampling method for both qualitative and quantitative traits;⁵ (4) Pratt et al. used an exact multipoint algorithm to analyse family data by variance component models.⁶ The focus of the above studies was on linkage mapping, which is based on family data. Linkage analysis is appropriate for low resolution genetic mapping to localise trait loci to broad chromosome regions within a few cM (<10 cM).

Linkage disequilibrium (LD) mapping or association study, on the other hand, is based on both family and population data, and is useful in high resolution of genetic mapping, ie, fine disease gene mapping. The reason for the high resolution of linkage disequilibrium mapping is that the allelic association due to linkage disequilibrium usually operates over short genetic distances. Linkage analysis and linkage disequilibrium mapping are complementary in disease gene mapping. To localise genetic traits, one may carry out linkage analysis as the first step on a sparse map to get suggestive linkage between genetic traits and markers. Then linkage disequilibrium mapping can be used as a follow-up in high resolution mapping of the genetic traits on a more dense map.

Abecasis et al.,⁷ Fulker et al.⁸ and Sham et al.⁹ have explored linkage and association studies of quantitative traits by variance–component procedures, allowing a simultaneous test of allelic association for family data. Zhao et al.¹⁰ applied a regression approach of linkage disequilibrium mapping to localise QTL in humans. In these studies the investigators used only one marker in their analysis. However, very dense maps such as single nucleotide polymorphisms (SNPs) in human genome (The International SNP Map Work Group) are available now.¹¹ These exciting developments allow us to explore models and methodologies of simultaneously using two or more markers in high resolution linkage disequilibrium mapping of QTL.

In this article, we propose a linear regression method of high resolution mapping for QTL by using linkage disequilibrium analysis which is based on population data. Assuming that two marker loci flank one genetic trait locus, a linear regression is introduced based on an intuitive rationale. Then we derive analytical formulas of parameter estimations, and non-centrality parameters of appropriate tests of genetic effects and linkage disequilibrium coefficients. The merit of the regression method is shown by the power calculation and comparison.

Models

Consider a quantitative trait which is influenced by a quantitative trait locus Q, which is flanked by two markers A and B in an order of AQB. Suppose that there are two alleles Q₁ and Q₂ at the trait locus with frequencies q₁ and q₂. At the marker locus A, assume there are two alleles A and a with frequencies P_A and P_a, respectively. For the marker B, assume that there are two alleles B and b with frequencies P_B and P_b, respectively. Suppose that markers A and B are in Hardy–Weinberg equilibrium, ie,

.

However, they may be in linkage disequilibrium. Let us denote the measure of linkage disequilibrium between trait locus Q and marker A by D_AQ=P(AQ₁)−q₁P_A, the measure of linkage disequilibrium between trait locus Q and marker B by D_QB=P(BQ₁)−q₁P_B, and the measure of linkage disequilibrium between marker A and marker B by D_AB=P(AB)−P_AP_B.^12,13,14 In addition to the major QTL Q, assume that there is an error effect that influences the trait. Then the total variance can be decomposed as is variance explained by the putative QTL Q, and is error variance. The genetic variances is decomposed into additive and dominant components, respectively. Assume that there are n independent individuals from a population with trait values y_i, genotype A_i at marker A and genotype B_i at marker B. Consider the following regression equation

where β is overall mean, w_i is a row vector of covariates such as sex and age, γ is a column vector of regression coefficients for the covariates w_i, and e_i is error term. Assume that e_i is normal . Besides, x_Ai, x_Bi,z_Ai and z_Bi are dummy random variables that are independent of e_i, and are defined by

α_A, α_B, δ_A and δ_B are regression coefficients of the dummy variables x_Ai, x_Bi, z_Ai and z_Bi. Let us denote an experimental design matrix X by

a vector of regression coefficients by μ=(β,γ^τ_,α_A,α_B,δ_A,δ_B)^τ, the quantitative traits by a vector Y=(y₁,y₂,…,y_n)^τ, and errors terms by e=(e₁,e₂,…,e_n)^τ. Then we may write the model (1) as Y=Xμ+e. By standard regression theory, we may estimate the coefficients by .

To give an intuitive rationale of model (1), let μ_ij be the effect of genotype Q_iQ_j,i,j=1,2,μ₁₂=μ₂₁. Let the genic effect of allele Q_i be α_i,i=1,2. Then genotypic effects can be expressed as μ₁₁=μ₀+2α₁+d₁,μ₁₂=μ₀+α₁+α₂+d₂,μ₂₂=μ₀+2α₂+d₃, where μ₀ is the overall population mean, d_i is the deviation of the related genotypic value from that of an additive effect model. Minimising , the estimates of μ₀,α₁,α₂ are , and (Jacquard,¹⁵ Chapter 5). Plugging these estimates into μ_ij, we can obtain that . Here α_Q=q₁μ₁₁+(q₂−q₁)μ₁₂−q₂μ₂₂ is the average effect of gene substitution, and δ_Q=2μ₁₂−μ₁₁−μ₂₂ is the dominant deviation. Assume that marker A coincides with the trait locus Q, and marker allele A is trait allele Q₁ and marker allele a is trait allele Q₂. Then the trait value can be expressed as y_i=μ+x_Qiα_Q+z_Qiδ_Q+e_i. In practice, information of trait locus Q is unknown, but the information at marker loci is available. This prompts us to propose regression model (1) to map QTLs.

Assume that there are no covariates. Suppose that the markers A and B are in Hardy–Weinberg equilibrium. Then Ex_Ai=Ex_Bi=Ez_Ai=Ez_Bi=0. When the sample size n is large enough, we show in Appendix A that the coefficients are approximately given by

If the markers A and B are in linkage equilibrium, ie, D_AB=0, then the above equations simplify to the following (Appendix A)

Property of regression coefficients

As in the previous section, let μ_ij be the effect of genotype Q_iQ_j,i,j=1,2. If μ₁₁=a, μ₁₂=d, and μ₂₂=−a as in the traditional quantitative genetics (Falconer and Mackay¹⁶), α_Q=a+(q₂−q₁)d and δ_Q=2d. For general case, one may form the above relations by letting a=μ₁₁−(μ₁₁+μ₂₂)/2 and d=μ₁₂−(μ₁₁−μ₂₂)/2. It is well known that the additive variance and the dominant variance . A true random effect model describing the trait value is

where

Let us denote three ratios . In Appendix B, we will show that the coefficients of regression equation (1) are given by

Assume that the two markers A and B are not in linkage disequilibrium, ie, D_AB=0. Then , , and . Hence, marker A and marker B independently contribute to the analysis of the trait values. Furthermore, assume the trait locus Q is in linkage disequilibrium with marker A but not with marker B. Then D_QB=0 and so . Hence, only marker A contributes to the analysis and marker B has no effect on the result. This is equivalent to using one marker for the analysis.

If one marker coincides with the trait locus, for instance locus Q is marker A, we can show that the other marker B does not contribute to estimations of the substitution and dominant effects of the trait locus. Actually, assume that allele A=Q₁ and allele a=Q₂. Then D_AB=D_QB and D_AQ=q₁q₂. This leads to . Hence, marker A can fully estimate the substitution and dominant effects of the trait locus Q.

In general, assume that marker A and marker B are in linkage disequilibrium. Then model (1) simultaneously takes care of the linkage disequilibrium and the effects of the putative trait locus Q. The parameters of linkage disequilibrium (ie, D_AQ and D_QB) and gene effect (ie, α_Q and δ_Q) are contained in the mean coefficients. We may simultaneously test linkage disequilibrium of marker A and marker B with trait locus Q, the gene substitution and dominant effects by testing α_A=α_B=δ_A=δ_B=0. From equation (4), we may test the linkage disequilibrium of markers A and B with the trait locus Q and the gene substitution effect α_Q by testing α_A=α_B=0. From equation (5), we may test the linkage disequilibrium of markers A and B with the trait locus Q and the dominant effect by testing δ_A=δ_B=0.

Non-centrality parameters

Assume that there are no covariates. Then μ=(β,α_A,α_B,δ_A,δ_B)^τ. Let H be a q×5 matrix of rank q. By Graybill,¹⁷ Chapter 6, the test statistic of a hypothesis Hμ=0 is non-central F(q,n−5) defined by , where I_n is the n×n identity matrix. The non-centrality parameter of the test statistic F can be calculated by λ=[1/(2σ²)] (Hμ)^τ[H[X^τX]⁻¹H²]⁻¹Hμ. To test if there are additive and dominant effects, we may test the hypothesis H_AB,ad : α_A=α_B=δ_A=δ_B=0. Then the test matrix H is defined by

Let us denote the corresponding F-test statistic by F_AB,ad. In Appendix C, we show

If one assumes that (a) the two markers A and B are not in linkage disequilibrium, then D_AB=0; (b) the trait locus Q is in linkage disequilibrium with marker A but not with marker B, then D_QB=0 and . Then , which only involves marker A and can be written as λ_A,ad. Correspondingly, we denote the related F-test statistic by F_A,ad. Furthermore, assume (c) there is no dominant effect, ie, . Then is the non-centrality parameter of the related F-test statistic F_A,a.

To test other hypotheses, we may get the non-centrality parameters in a similar way by taking appropriate test matrices H. To test if there is dominant effect, we may test the hypothesis H_A,B,d : δ_A=δ_B=0. The non-centrality parameter is . The related F-test statistic is denoted by F_AB,d. To test if there is additive or substitution effect, we may test the hypothesis H_AB,a : α_A=α_B=0. The non-centrality parameter is . The related F-test statistic is denoted by F_AB,a. To test if there are additive and dominant effects at marker locus A given that there are effects at marker locus B, we may test the hypothesis H_A|B,ad : α_A=δ_A=0. The non-centrality parameter is

To test if there is dominant effect at marker locus A given that there are effects at marker locus B, we may test the hypothesis H_A|B,d : δ_A=0. The non-centrality parameter is .

Power calculation and comparison

To investigate the usefulness of the methods proposed in this article, we performed power and sample size calculations. As usual, we denote the heritability by h² which is defined by . In the power calculations, we first take the equal allele frequencies P_A=q₁=P_B=0.5 at the two markers A and B, and the trait locus Q. Moreover, suppose that μ₁₁=a,μ₁₂=μ₂₁=d and μ₂₂=−a. Assume that marker A and marker B are in linkage equilibrium, ie, D_AB=0, the heritability h²=0.25, and a sample size n=120. Figures 1 and 2 show the power curves of the test statistics F_AB,ad, F_A,ad, and F_A,a against the disequilibrium coefficient D_AQ when D_QB=0.15 for a mode of dominant inheritance with a=d=1.0 and a mode of recessive inheritance with a=1.0, d=−0.5, respectively. The statistic F_AB,ad has the highest power, and F_A,ad has higher power than that of F_A,a. Hence, the regression approach that uses two markers A and B is advantageous over the one marker mapping that uses only one marker A or B.

Assume that the markers A and B are in moderate linkage disequilibrium, ie, D_AB=0.1, and that the linkage disequilibrium coefficients D_AQ=D_QB=0.15. Figures 3 and 4 show the power curves of the test statistics F_AB,ad, F_A,ad and F_A,a against the heritability h² for a mode of dominant inheritance with a=d=1.0 and a mode of recessive inheritance with a=1.0, d=−0.5, respectively. For a population with sample size n=250, the regression approach can achieve a high power for a trait with heritability h²⩾0.15. Hence, the high resolution linkage disequilibrium mapping is a promising tool in mapping complex traits.

In a population, the linkage disequilibrium exists if mutations at the trait locus occur. Once the mutations occur, the recombination between a marker locus and the trait locus can dissipate the disequilibrium from generation to generation. Let us denote the frequency of haplo type AQ at the generation when the mutations occur by P(AQ)(0). Then the linkage disequilibrium coefficient is D_AQ(0)=P(AQ)(0)-q₁P_A at the generation when the mutations occur. For the following generations, the disequilibrium coefficient is reduced by a factor 1−θ_AQ in each generation,¹² where θ_AQ is the recombination fraction between trait locus Q and marker A. Suppose that the mutation is already T generations old. Then the disequilibrium coefficient is D_AQ(T)=D_AQ(0)(1−θ_AQ)^T. Similarly, we may calculate the disequilibrium coefficients by D_AB(T)= D_AB(0)(1−θ_AB)^T and D_QB(T)=D_QB(0)(1−θ_QB)^T, where θ_QB is the recombination fraction between trait locus Q and marker B, and θ_AB is the recombination fraction between marker A and marker B.

Suppose that we know the map distance λ_AB between marker A and marker B. Under the assumption of no interference, we may calculate the recombination fraction θ_AB=[1−exp(−2λ_AB]/2 by Haldane's map function. Similarly, we may calculate the recombination fractions θ_AQ and θ_QB by the map distances λ_AQ and λ_QB. Assume that the map distance between marker A and marker B is λ_AB=5cM, and the other parameters are given by D_AB(0)=0.20, D_AQ(0)=D_QB(0)=0.25, h²=0.25, n=120, T=20. Figures 5 and 6 show the power curves of the test statistics F_AB,ad, F_A,ad, and F_A,a against the recombination fraction θ_AQ, for a mode of dominant inheritance with a=d=1.0 and a mode of recessive inheritance with a=1.0, d=−0.5, respectively. We can see that the power of F_AB,ad is very high, although the power of F_A,ad and F_A,a decreases very rapidly as the recombination fraction θ_AQ increases. Hence, the regressions using two markers are advantageous for fine gene mapping, and appropriate for the dense marker map such as SNPs in human genome.

To investigate the less favourable case other than the equal allele frequencies of trait locus and marker loci, Figure 7 shows the power curves of F_AB,ad, F_A,ad, and F_A,a against the linkage disequilibrium coefficient D_AQ when q₁=0.20, P_A=P_B=0.80, D_AB=0.0, D_QB=0.04, h²=0.25, n=120 for a dominant trait a=1.0 and d=0.8. The three power curves are very close. Moreover, the power decreases rapidly when the linkage disequilibrium between trait locus Q and marker A decreases. For a recessive trait a=1.0 and d=−0.5, Figure 8 shows the power curves against the recombination fraction when q₁=0.20, P_A=P_B=0.80, D_AB(0)=0.10, D_AQ(0)=D_QB(0)=−0.15, h²=0.25, λ_AB=5cM, n=120, T=20.

Figures 9 and 10 show two plots of the sample size against the heritability h² at a significant level 0.05 for a given power 0.80. In a favourable case when q₁=P_A=P_B=0.50, D_AB=0.10, D_AQ=D_QB=0.15 for a dominant trait a=1.0 and d=0.80, the required sample size is lower than 400, if the heritability is not lower than 0.1 (Figure 9). However, for an extremely less favourable case when q₁=0.20, P_A=P_B=0.80, D_AB=0.0, D_AQ=0.03, D_QB=0.04 for a recessive trait a=1.0 and d=−0.5, the required sample size is huge (Figure 10). Unfortunately, the true QTL frequency is rarely, if ever, known. Hence, linkage disequilibrium mapping works only when the linkage disequilibria are reasonably high, at least one needs moderate linkage disequilibria.

Discussion

With the development of dense marker maps, such as SNPs in human genome (The International SNP Map Work Group¹¹), fine disease gene mapping is getting more and more important for the study of complex diseases. Association study is a simple and useful method in fine disease gene mapping (Cardon and Bell¹⁸; Risch and Merikangas¹⁹). In this article, we proposed a linear regression method to perform high resolution linkage disequilibrium mapping of QTLs. In the regression, we used information of two flanking markers to model the additive and dominant effects of a QTL, and also the linkage disequilibria between the markers and the trait locus. In addition to the additive and dominant effects, we may add the covariates to model their effects. Due to the simplicity, the method can be easily performed by routine statistical analysis softwares such as SAS and Splus.

After studying the merits of the method of using two markers as proposed in this article, we concluded that this method is well suited for mapping complex diseases. It provides higher power than that of using only one marker approach. The advantages of high resolution mapping have been explored by many authors by using linkage analysis of family data or plant/animal data^{20,21,22,23,24,25}). However, there is not sufficient statistical analysis regarding the high resolution mapping by linkage disequilibrium mapping method. Using population data, Zhao et al.¹⁰ applied an approach of linkage disequilibrium mapping based on regression to map QTL in humans. Abecasis et al.,⁷ Allison et al.,²⁶ Fulker et al.,³ Göring and Terwillinger,²⁷ and Sham et al.⁹ have explored linkage and association studies of quantitative traits by variance–component procedures allowing a simultaneous test of allelic association for family data. One interesting approach is to combine both family and population data, and perform combined linkage analysis and linkage disequilibrium high resolution mapping.

The power of linkage disequilibrium mapping depends on the existence of disequilibrium between a trait locus and a marker. In a population, linkage disequilibrium exists if mutations at the trait locus occur. In the absence of tight linkage, the degree of linkage disequilibrium decreases very rapidly after a few generations due to the recombination between the trait locus and the markers. Hence, linkage disequilibrium mapping is appropriate for the analysis of dense marker maps to do high resolution fine gene mapping. In practice, one can perform linkage disequilibrium mapping following prior evidence of linkage. Linkage analysis is less sensitive to population stratification, population history, or environmental effects. Moreover, linkage mapping is appropriate for low resolution mapping to localise trait loci to broad chromosome regions (<10 cM). The two methods, linkage mapping and linkage disequilibrium mapping, are complementary for disease gene mapping.

Potential problems of linkage disequilibrium mapping include population stratification, population history, or environmental effects. It is well understood that for the same number of individuals, family based linkage disequilibrium methods are less powerful than the population based methods. However, utilising family based linkage disequilibrium approaches may avoid false positives due to the sources of linkage disequilibrium such as population admixtures rather than linkage. One research area is to combine the population and pedigree data to do linkage disequilibrium mapping, and use the pedigree data alone to perform linkage mapping (Fulker et al.⁸).

As in Sham et al.,⁹ we notice that the non-centrality parameter is reduced by a factor equal to R²_AQ for additive variance, and a factor of R⁴_AQ for dominant variance, if we use only one marker A to perform analysis. Hence, the power decreases rapidly when the linkage disequilibrium between the trait locus and the marker is reduced. The degree of linkage disequilibrium depends heavily on the map distance between the trait locus and the marker locus, and most likely maintains high linkage disequilibrium when the two loci are very close. Hence, the high resolution mapping method proposed in this article has a good potential for being used in fine disease gene mapping. As mentioned in Sham et al.,⁹ the property of the measurements R²_AQ, R²_QB and R²_AB needs more investigation, and their roles in different scenarios should be studied more thoroughly.

References

Schork NJ . Extended multipoint identity-by-descent analysis of human quantitative traits: efficiency, power, and modeling considerations Am J Hum Genet 1993 53: 1306–1319
CAS PubMed PubMed Central Google Scholar
Fulker DW, Cardon LR . A sib-pair approach to interval mapping of quantitative trait loci Am J Hum Genet 1994 54: 1092–1103
CAS PubMed PubMed Central Google Scholar
Fulker DW, Cherny SS, Cardon LR . Multipoint interval mapping of quantitative trait loci, using sibpairs Am J Hum Genet 1995 56: 1224–1233
CAS PubMed PubMed Central Google Scholar
Almasy L, Blangero J . Multipoint quantitative trait linkage analysis in general pedigrees Am J Hum Genet 1998 62: 1198–1211
Article CAS Google Scholar
Liang KY, Huang CY, Beaty TH . A unified sampling approach for multipoint analysis of qualitative and quantitative traits in sibs Am J Hum Genet 2000 66: 1631–1641
Article CAS Google Scholar
Pratt SC, Daly M, Kruglyak L . Exact multipoint quantitative-trait linkage analysis in pedigrees by variance components Am J Hum Genet 2000 66: 1153–1157
Article CAS Google Scholar
Abecasis GR, Cardon LR, Cookson WOC . A general test of association for quantitative traits in nuclear families Am J Hum Genet 2000 66: 279–292
Article CAS Google Scholar
Fulker DW, Cherny SS, Sham PC, Hewitt JK . Combined linkage and association sib-pair analysis for quantitative traits Am J Hum Genet 1999 64: 259–267
Article CAS Google Scholar
Sham PC, Cherny SS, Purcell S, Hewitt JK . Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data Am J Hum Genet 2000 66: 1616–1630
Article CAS Google Scholar
Zhao J, Li W, Xiong M . Population based linkage disequilibrium mapping of QTL: an application to simulated data in an isolated population Genetic Epidemiology 2001 21 S1: S655–S659
Article Google Scholar
The International SNP Map Working Group. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms Nature 2001 409: 928–933
Hartl DL, Clark AG . Principles of Population Genetics 2nd edn Sunderland, MA: Sinauer Associates 1989
Google Scholar
Hedrick PW . Gametic disequilibrium measures: proceed with caution Genetics 1987 117: 331–341
CAS PubMed PubMed Central Google Scholar
Lewontin RC . The interaction of selection and linkage. I. General considerations; heterotic models Genetics 1964 49: 67
Google Scholar
Jacquard A . The Genetic Structure of Populations New York: Springer-Verlag 1974
Book Google Scholar
Falconer DS, Mackay TFC . Introduction to Quantitative Genetics 4th edn London: Longman 1996
Google Scholar
Graybill FA . Theory and Application of the Linear Model California: Pacific Grove 1976
Google Scholar
Cardon LR, Bell J . Association study designs for complex diseases Nature Reviews Genetics 2001 2: 91–99
Article CAS Google Scholar
Risch N, Merikangas K . The future of genetic studies of complex human diseases Science 1996 273: 1516–1517
Article CAS Google Scholar
Amos C, Andrade MD . Genetic linkage methods for quantitative traits Statistical Methods in Medical Research 2001 10: 3–25
Article CAS Google Scholar
Goldgar DE . Multipoint analysis of human quantitative genetic variation Amr J Hum Genet 1990 47: 957–967
CAS Google Scholar
Haley CS, Knott SA . A simple regression method for mapping quantitative trait loci in line crosses using flanking markers Heredity 1992 69: 315–324
Article CAS Google Scholar
Jansen RC . Interval mapping of multiple quantitative trait loci Genetics 1993 135: 205–211
CAS PubMed PubMed Central Google Scholar
Xu SZ, Atchley WR . A random model approach to interval mapping of quantitative trait loci Genetics 1995 141: 1189–1197
CAS PubMed PubMed Central Google Scholar
Zeng ZB . Precision mapping of quantitative trait loci Genetics 1994 136: 1457–1468
CAS PubMed PubMed Central Google Scholar
Allison DB, Heo M, Kaplan N, Martin ER . Sibling-based tests of linkage and association for quantitative traits Am J Hum Genet 1999 64: 1754–1764
Article CAS Google Scholar
Göring HHH, Terwillinger JD . Linkage analysis in the presence of error IV: joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified Am J Hum Genet 2000 66: 1310–1327
Article Google Scholar

Download references

Acknowledgements

We thank two reviewers, Section Editors, and Dr Gert-Jan B von Ommen for their helpful comments to improve the article. Dr Joanna Floros read the manuscript and provided helpful suggestions in improving the grammar. R Fan was supported partially by a research fellowship from Alexander von Humboldt Foundation, Germany, and an International Research Travel Assistance Grant of the Texas A&M University. M Xiong was supported by NIH grant R01-GM56515, and MH59518.

Author information

Authors and Affiliations

Department of Statistics, The Texas A&M University, 447 Blocker Building, College Station, TX 77843-3143, Texas, USA
Ruzong Fan
Institute of Medical Biometry, Informatics and Epidemiology, University of Bonn, Sigmund Freud Strasse 25, Bonn, D-53105, Germany
Ruzong Fan
Human Genetics Center, University of Texas–Houston, P.O. Box 20334, Houston, TX 77225, Texas, USA
Momiao Xiong

Authors

Ruzong Fan
View author publications
You can also search for this author in PubMed Google Scholar
Momiao Xiong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruzong Fan.

Appendices

Appendix A

Suppose that the markers A and B are in Hardy–Weinberg equilibrium. Then Ex_Ai=Ex_Bi=Ez_Ai=Ez_Bi=0. We first can show the following equations

In the following, we are going to show the first two of the above equations. The other equations can be shown by similar calculations. Actually, we have

When the sample size n is large enough, the large number law leads to

This implies that the coefficients are approximately given by , and

If the marker A and marker B are in linkage equilibrium, i.e., D_AB=0, then and . This will lead to equations in (2).

Appendix B

Notice that we have the following variance-covariance equations from model (1)

The elements of the variance-covariance matrix on the left-hand side of the above equation are given in equations (8). For the elements on the right-hand side, we can show that

In the following, we are going to show the first one of the above equations. The rest can be shown in the same way.

For the first equation, we have

Plugging equations (8) and (11) into equation (10), we have

Hence, one may get equations (4) and (5).

Appendix C

Using equations (4), (5), (6) and (9), the non-centrality parameter is

which is equal to that in (7) by using equations .

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fan, R., Xiong, M. High resolution mapping of quantitative trait loci by linkage disequilibrium analysis. Eur J Hum Genet 10, 607–615 (2002). https://doi.org/10.1038/sj.ejhg.5200843

Download citation

Received: 18 October 2001
Revised: 12 March 2002
Accepted: 16 May 2002
Published: 02 October 2002
Issue Date: 01 October 2002
DOI: https://doi.org/10.1038/sj.ejhg.5200843

Keywords

This article is cited by

Statistical distributions of test statistics used for quantitative trait association mapping in structured populations
- Simon Teyssèdre
- Jean-Michel Elsen
- Anne Ricard
Genetics Selection Evolution (2012)
Pedigree linkage disequilibrium mapping of quantitative trait loci
- Ruzong Fan
- Christie Spinka
- Jee Sun Jung
European Journal of Human Genetics (2005)
Combined high resolution linkage and association mapping of quantitative trait loci
- Ruzong Fan
- Momiao Xiong
European Journal of Human Genetics (2003)

High resolution mapping of quantitative trait loci by linkage disequilibrium analysis

Abstract