Autosomal recessive disease in children of consanguineous parents: inferences from the proportion of compound heterozygotes

ten Kate, Leo P.; Teeuw, Marieke; Henneman, Lidewij; Cornel, Martina C.

doi:10.1007/s12687-010-0002-4

Autosomal recessive disease in children of consanguineous parents: inferences from the proportion of compound heterozygotes

Short communication
Open access
Published: 25 February 2010

Volume 1, pages 37–40, (2010)
Cite this article

Download PDF

You have full access to this open access article

Journal of Community Genetics Aims and scope Submit manuscript

Autosomal recessive disease in children of consanguineous parents: inferences from the proportion of compound heterozygotes

Download PDF

Leo P. ten Kate¹,
Marieke Teeuw¹,
Lidewij Henneman¹ &
…
Martina C. Cornel¹

1241 Accesses
10 Citations
Explore all metrics

Abstract

This short communication deals with the questions of how to calculate the expected proportion of compound heterozygous patients among affected offspring of consanguineous parents, and how, from an observed proportion of compound heterozygotes, to calculate both the proportion of homozygotes not identical by descent and the frequency of pathogenic alleles in the population. This estimate of allele frequency may be useful when dealing with populations with a considerable number of consanguineous matings.

Introduction

In a recent search for offspring of consanguineous matings affected by autosomal recessive diseases, we came across four compound heterozygous patients among 38 affected children. This raised the question of whether this was an unexpectedly high proportion or not.

In the past, when we reported about a first compound heterozygous cystic fibrosis (CF) patient with consanguineous parents, we showed that the proportion of affected children with two alleles not identical by descent (non-IBD) can be considerable (Ten Kate et al. 1991). However, alleles non-IBD may still be identical by state (IBS). So the affected compound heterozygous children are just a subset of the affected children who do not have both alleles IBD notwithstanding parental consanguinity.

Therefore, we wondered what proportion of non-IBD patients with consanguineous parents represent compound heterozygotes, and what proportion is non-IBD but still IBS. Secondly, we wanted to know whether it is possible to calculate the overall pathogenetic allele frequency for an autosomal recessive disorder on the basis of knowledge of the proportion of compound heterozygotes among affected children of consanguineous parents. This might be a useful application as the current global prevalence of consanguineous marriage is estimated at 10.4%, (Bittles and Black 2009), with much higher percentages in many non-Western countries.

Methods

We start our exploration with the well-known formula to calculate the probability of the presence of a given autosomal recessive disease X in the children of a consanguineous couple (Li, 1955).

$$ P(X) = Fq + \left( {1 - F} \right){q^2} $$

(1)

In this formula, F is the inbreeding coefficient and q is the total frequency of all pathogenic alleles causing disorder X. The fraction Fq represents the proportion of affected children whose alleles are IBD, and the fraction (1−F)q ² corresponds with the proportion of affected children whose alleles are not IBD.

The fraction (1−F)q ² is composed of two parts—one part comprising the compound heterozygotes (CH), and the other part combining all homozygotes non-IBD (HN). The relative frequencies of the two sets within the fraction (1−F)q ² are (in reversed order)

$$ R\left( {\hbox{HN}} \right) = \sum\limits_{i = 1}^n {\mathop a\nolimits_i^2 }, {\hbox{and}} $$

(2)

$$ R\left( {\hbox{CH}} \right) = 1 - \sum\limits_{i = 1}^n {\mathop a\nolimits_i^2 } $$

(3)

In Eqs. 2 and 3, a _i represents the relative frequency of the i^th allele. So its square, a _i ², is the relative frequency of homozygotes of the i^th allele non-IBD.

From Eqs. 1 and 3, it follows that the proportion of compound heterozygotes, P(CH), among affected children of consanguineous parents is

$$ P\left( {\hbox{CH}} \right) = \frac{{\left( {1 - \sum\limits_{i = 1}^n {a_i^2} } \right) \times \left( {1 - F} \right){q^2}}}{{Fq + \left( {1 - F} \right){q^2}}} $$

(4)

We can now calculate the expected proportion of compound heterozygotes, P(CH), if we know F, q, and the relative frequencies of the pathogenic alleles. Conversely, knowing P(CH) by observation, as mentioned in the introduction, we can estimate R(CH), R(HN), and P(HN), if we know F and q, as follows:

$$ R\left( {\hbox{CH}} \right) = \left( {1 - \sum\limits_{i = 1}^n {\mathop a\nolimits_i^2 } } \right) = \frac{{P\left( {\text{CH}} \right) \times \left[ {Fq + \left( {1 - F} \right){q^2}} \right]}}{{\left( {1 - F} \right){q^2}}} = \frac{{P\left( {\text{CH}} \right) \times \left[ {F + \left( {1 - F} \right)q} \right]}}{{\left( {1 - F} \right)q}}, $$

(5)

$$ R\left( {\hbox{HN}} \right) = 1 - R\left( {\hbox{CH}} \right),\,{\hbox{and}} $$

(6)

$$ P\left( {\hbox{HN}} \right) = \frac{{R\left( {\text{HN}} \right) \times \left( {1 - F} \right){q^2}}}{{Fq + \left( {1 - F} \right){q^2}}} = \frac{{R\left( {\text{HN}} \right) \times \left( {1 - F} \right)q}}{{F + \left( {1 - F} \right)q}} $$

(7)

We can also calculate q from (4) or (5) if we know P(CH), F and R(CH) or the relative frequencies of the pathogenic alleles.

$$ q = \frac{{P\left( {\text{CH}} \right) \times \left( {F + q - Fq} \right)}}{{\left( {1 - F} \right) \times R\left( {\hbox{CH}} \right)}}{,}\,{\hbox{from}}\;{\hbox{which}}\;q\;{\hbox{can}}\;{\hbox{be}}\;{\hbox{solved}}{.} $$

(8)

Results

Table 1 shows the dependency of the proportion of compound heterozygotes among affected offspring of consanguineous parents, P(CH), upon the parameters F, q, and R(CH) (see Eqs. 3 and 4). The examples given illustrate that P(CH) is positively correlated with R(CH) and q, and negatively with F,—as expected.

Table 1 Expected proportions of compound heterozygotes among affected children of consanguineous parent, P(CH), given some values of F, q, and R(CH), the relative frequency of these compound heterozygotes among non-IBD affected children

Full size table

Table 2 shows the results of calculations of the frequencies of homozygotes IBD and non-IBD among affected children of first cousins, and the total frequency of pathogenic alleles in the population in case of 10% compound heterozygotes and with different numbers and relative frequencies of pathogenic alleles. As the proportion of compound heterozygotes is fixed at 10% in this table, the row sum of the proportions of homozygotes IBD and non-IBD (third and fourth columns) add up to 90%. The table shows that knowledge of the proportion of compound heterozygotes, the inbreeding coefficient, and the number and relative frequencies of pathogenic alleles (first and second columns) allows one to calculate the total frequency of pathogenic alleles of a gene in the population (fifth column). Not unexpectedly, the higher the frequency of the major allele, the higher is the frequency of homozygotes non-IBD and the higher the total frequency of pathogenic alleles in the population for a given frequency of compound heterozygotes among affected offspring of consanguineous matings. The same trend can be observed for children of second cousins (data not shown) and other levels of inbreeding.

Table 2 Frequencies of homozygotes IBD and non-IBD among children with an autosomal recessive disease whose parents are first cousins when 10% of these children are compound heterozygotes as well as total frequency of pathogenic alleles in the population for different numbers and relative frequencies of alleles

Full size table

Discussion

Since our observation of a compound heterozygous CF patient with consanguineous parents back in 1990, many more observations of compound heterozygotes in consanguineous families have been reported (summarized in Petukhova et al. 2009). Such patients present a problem to researchers using autozygosity mapping for identification of recessive disease genes. Still, finding compound heterozygosity among affected children of consanguineous couples has potential advantages. It may comfort parents, who thought or were told that their consanguinity was causally related to the disorder in their children, to learn now that their consanguinity cannot be blamed for it. The same applies to some extent for parents who can be told that there is a considerable chance that the homozygosity in their affected child is not caused by alleles IBD. Moreover, the mere existence of compound heterozygotes and the presence of non-IBD homozygotes among affected offspring of consanguineous parents may be of help in disputes with opponents of consanguineous matings, who are abundantly present in countries where consanguineous matings are not customary. As we have shown here, we can also learn more from the frequency of compound heterozygotes, as this frequency is related to the inbreeding coefficient, the number and relative frequencies of alleles, and their total frequency.

While preparing the manuscript of this communication, we came across the paper of Petukhova et al. (2009). These authors developed a formula to calculate the frequency of compound heterozygotes in the presence of inbreeding as we did, but unfortunately assumed equal frequencies of disease-causing mutations. As we have shown here, this is a serious omission and, moreover, far from realistic. A second difference with their paper is that we did not only calculate the frequency of compound heterozygotes, but turned the problem upside down by looking for inferences following from observed frequencies of compound heterozygotes.

One may question the usefulness of being able to make these calculations. If F is known in a certain (sub)population, then the most straightforward way to estimate q would be via the prevalence of the disease in that (sub)population. In practice, however, F and the prevalence of the disease in a population are seldom known with any certainty. Most of the times, they are unknown or the estimates are debatable because of large variances or possible biases. Arriving at accurate and dependable estimates of both parameters takes a lot of effort and resources. For this reason, any method to estimate q from other sources, such as the one we describe, is an improvement. While estimating F in a population requires knowledge of the prevalence of consanguineous matings and the distribution of different degrees of consanguinity among them, estimating F from a small number of consanguineous families known to a laboratory in general is less of a challenge.

Once the total frequency of pathogenic alleles is known, the frequency of an autosomal recessive disease in a population, P(D), can be inferred from the total frequency of disease-causing alleles, especially when the frequency of consanguineous matings, c, is known as well, using the equation

$$ P(D) = \left( {1 - c} \right){q^2} + c\left[ {Fq + \left( {1 - F} \right){q^2}} \right] $$

(9)

Others have taken a different approach to calculate the frequency of a disease in the population by looking at the proportion of consanguineous parents among affected children and inferring from there, taking into account the frequency of consanguineous matings, the total pathogenic allele frequency and the total frequency of recessives in the general population (Romeo et al. 1985; Koochmeshgi et al. 2002). This method might result in a biased estimate if the presence of consanguinity of the parents alerts the physician to think of the possibility of a recessive disorder and diagnose accordingly, while this might not have been the case in the absence of consanguinity. Starting from the proportion of compound heterozygotes gives an unbiased estimate and therefore at least represents an additional tool to determine disease frequency in the general population.

Of course our method has some limitations too. Firstly, inferences can only be made about the population to which the cases belong. If a population is non-homogeneous as to the frequency of consanguineous matings, population stratification has to be taken into account. Secondly, for any recessive disorder, the number of compound heterozygotes among affected children of consanguineous parents will be limited. This means that estimates of the proportion of compound heterozygotes will tend to have rather wide confidence intervals, which will persist in derived figures. Nevertheless, a provisional estimate of the frequency of pathogenic alleles using our method can be useful before embarking on larger studies, or as a check when other data are already available.

References

Bittles AH, Black ML (2009) Consanguinity, human evolution and complex diseases. Proc Natl Acad Sci USA (in press)
Koochmeshgi J, Bagheri A, Hosseini-Mazinani SM (2002) Incidence of phenylketonuria in Iran estimated from consanguineous marriages. J Inherit Metab Dis 25:80–81
Article PubMed CAS Google Scholar
Li CC (1955) Population genetics. University of Chicago Press, Chicago
Google Scholar
Petukhova L, Shimomura Y, Wajid M, Gorroochurn P, Hodge SE, Christiano A (2009) The effect of inbreeding on the distribution of compound heterozygotes: a lesson from Lipase H mutations in autosomal recessive woolly hair/hypotrichosis. Hum Hered 68:117–130
Article PubMed Google Scholar
Romeo G, Bianco M, Devoto M, Menozzi P, Mastella G, Giunta AM, Micalizzi C, Antonelli M, Battistini A, Santamaria F, Castello D, Marianelli A, Marchi AG, Manca A, Miano A (1985) Incidence in Italy, genetic heterogeneity, and segregation analysis of cystic fibrosis. Am J Hum Genet 37:338–349
PubMed CAS Google Scholar
Ten Kate LP, Scheffer H, Cornel MC, Van Lookeren Campagne JG (1991) Consanguinity sans reproche. Hum Genet 86:295–296
PubMed Google Scholar

Download references

Acknowledgment

We acknowledge the financial support from the Netherlands Organization for Health Research and Development (ZonMw, project no. 60040005)

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Department of Clinical Genetics, Community Genetics Section, and EMGO Institute for Health and Care Research, VU University Medical Center, Amsterdam, The Netherlands
Leo P. ten Kate, Marieke Teeuw, Lidewij Henneman & Martina C. Cornel

Authors

Leo P. ten Kate
View author publications
You can also search for this author in PubMed Google Scholar
Marieke Teeuw
View author publications
You can also search for this author in PubMed Google Scholar
Lidewij Henneman
View author publications
You can also search for this author in PubMed Google Scholar
Martina C. Cornel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leo P. ten Kate.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

ten Kate, L.P., Teeuw, M., Henneman, L. et al. Autosomal recessive disease in children of consanguineous parents: inferences from the proportion of compound heterozygotes. J Community Genet 1, 37–40 (2010). https://doi.org/10.1007/s12687-010-0002-4

Download citation

Received: 13 October 2009
Accepted: 06 January 2010
Published: 25 February 2010
Issue Date: March 2010
DOI: https://doi.org/10.1007/s12687-010-0002-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Autosomal recessive disease in children of consanguineous parents: inferences from the proportion of compound heterozygotes

Abstract

Introduction

Methods

Results

Discussion

References

Acknowledgment

Open Access

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation