Introduction

Microsomal epoxide hydrolase (EPHX) catalyses the phase I hydrolysis of epoxides, playing a role in detoxification processes and in the metabolism of endogenous and exogenous compounds. The gene is located in chromosomal region 1q42.1 and two single nucleotide polymorphisms (SNPs) have been described in the coding region of the EPHX1 gene that produce two protein variants.1 Hassett et al demonstrated in in vitro expression studies of cDNA that an exon 3 T to C mutation, that changes the tyrosine residue at codon 113 to histidine, decreased EPHX enzymatic activity by approximately 40%, whereas a change of histidine to arginine at codon 139 of exon 4 decreased EPHX activity by approximately 25%.2 These polymorphisms are thought to be linked to protein stability. Zusterzeel et al described an association between the high activity genotype in exon 3 and preeclampsia.3 The authors postulated that this genotype might be reflected in enhanced enzyme activation by either endogenous or exogenous substrates to result in more reactive mutagenic diol derivatives in these patients. Polymorphism in the EPHX gene may also have other roles in the reproductive system and modify the susceptibility to spontaneous abortion4 and ovarian cancer,5 the high activity genotype being associated with ovarian cancer and protection against spontaneous abortions. Again, mechanisms behind these observations are highly speculative but may be associated with alterations in steroid metabolism.

Taken together, these reports suggest that a role of EPHX in female reproduction and especially in the development of preeclampsia is plausible but not completely understood. The Finnish population is considered to be a genetic isolate, and thus ideal for genetic association studies.6 We designed a controlled study to test the possible association between two EPHX gene SNPs and preeclampsia to determine whether such an association exists across populations.

Materials and methods

Written approval for the study was obtained from the Ethics Committee of Kuopio University Hospital. Informed consent was obtained from all patients and the 115 controls, and documented.

Information was collected retrospectively in connection with 133 preeclamptic pregnancies of primiparous women and 115 control women with no history of preeclampsia who delivered at Kuopio University Hospital between January 1994 and December 1998. Using the Birth Registry at Kuopio, we sought preeclamptic patients, called them back, and at that time they were asked to sign an informed consent. During the same time, we collected blood samples from controls who had given birth in the same hospital after uncomplicated pregnancies and who had at least two normal pregnancies, including the current one. From controls, blood was drawn for DNA analysis at enrolment. To ensure homogeneity of the genetic background, the controls, originating from a regional population and with no clinical signs of the disorder, were enrolled by random selection in this case–control study.

Preeclampsia was defined as the development of hypertension and new–onset proteinuria (>300 mg of urinary protein in 24 h) in women with no proteinuria at baseline. Hypertension was defined according to current guidelines that accept 140 and/or 90 mmHg of systolic and diastolic pressure respectively, or higher, as hypertension, when measured on two consecutive occasions at least 24 h apart (Report of the National High Blood Pressure Education Program Working Group on High Blood Pressure in Pregnancy, 2000).7 Women with chronic hypertension were excluded from the study.

DNA was extracted from peripheral blood lymphocytes using a standard phenol–chloroform extraction method. Two sequence variants of the EPHX gene, a transition of T to C leading to the replacement of tyrosine by histidine (Tyr113His) in exon 3 and a transition of A to G leading to the replacement of histidine by arginine (His139Arg) in exon 4, were detected. The target DNA sequence in EPHX gene exon 3 was amplified using primers 5′-GGG GTC CTG AAT TTT GCT CC-3′ and 5′-CAA TCT TAG TCT TGA AGT GAC GGT-3′, and in EPHX gene exon 4 the primers used were 5′-TCT GGT GCC AGA GCC TGA CCG TGC-3′ and 5′-ATG GAA CCT CTA GCA GCC CCG TAC C-3′. The incubation mixture contained 50 ng of genomic DNA, 10 pmol of the amplification primers for exons 3 or 4, 5 nmol each of four deoxynucleotide triphosphates, 1 U AmpliTaq Gold DNA polymerase (Perkin Elmer), 10 mM Tris-HCl (pH 8.3 at 25°C), 50 mM KCl, 1.5 mM MgCl2 in a reaction volume of 25 μl. The cycling protocol included initial denaturation at 95°C for 7 min, followed by a cycle of 95°C for 1 min, 55°C for 1 min (for exon 3) or 64°C for 1 min (for exon 4), and 72°C for 1 min. A total of 35 cycles were carried out, with final extension at 72°C for 10 min. For detection of the Tyr113His polymorphism, the PCR mixtures containing an amplified 198-bp DNA region were incubated with 5 U of Tth111 I enzyme (Promega), 1× Buffer B (6 mM Tris-HCl, pH 7.5 at 37°C, 50 mM NaCl, 6 mM MgCl2, 1 mM dithiothreitol), and 1 μg of acetylated bovine serum albumin in a total volume of 30 μl for 2 h at 65°C. The His139Arg polymorphism in exon 4 was detected by incubating the PCR mixtures containing a 322-bp PCR product with 5 U RsaI enzyme (Promega), 1× Buffer C (10 mM Tris-HCl, pH 7.9 at 37°C, 50 mM NaCl, 10 mM MgCl2, 1 mM dithiothreitol), and 1 μg of acetylated bovine serum albumin in a total volume of 30 μl for 2 h at 37°C. The reaction mixtures were subjected to electrophoresis on 3% agarose gels and photographed under an UV transilluminator after staining with ethidium bromide.

Two-tailed pooled t-tests were used to analyse continuous variables. Statistical analyses for comparing single-point allele and genotype frequencies were carried out using Pearson's χ2-test (two-sided asymptotic P values), with SPSS 9.0 software. Odds ratios (ORs), as the estimates of relative risk of disease, were calculated using 95% confidence intervals (CIs). We used an expectation–maximisation (EM) algorithm to obtain maximum-likelihood estimates of EPHX exon 3 and 4 haplotype frequencies with standard deviations (SD) (each haplotype consisting of a pair of SNPs, and unknown gametic phase; Arlequin ver. 2.000 software). Haplotype frequency comparisons between the preeclamptic and control groups with absolute chromosome numbers were carried out using Fisher's exact test (two-sided P values with Monte Carlo estimate at the 99% confidence level). Hardy–Weinberg distribution of genotypes in the preeclamptic and control groups, as well as pairwise linkage disequilibrium (LD) analyses, were assessed using Arlequin ver. 2.000 software. RxC program employing the Metropolis algorithm was used for analysis of contingency tables to obtain unbiased estimates of exact P values with standard errors (SEs) in haplotype analyses. Power determinations were performed using nQuery Advisor Release 3.0 software. The level of statistical significance was defined as P <0.05.

Results

Clinical characteristics of preeclamptic women and controls are shown in Table 1. All preeclamptic women were nulliparas, whereas controls had two or more deliveries. However, in the control group the case records of their first pregnancies were also used for comparison to exclude the confounding effect of parity on pregnancy outcome. On average, deliveries occurred 5 weeks earlier in preeclamptic women than controls. The mean (± standard deviation (SD)) gestational age at the development of preeclampsia was 31.7 (±3.5) weeks. There were no cases with HELLP syndrome. When only the first-degree relatives of the index patients were taken into account, a positive family history was reported in 30 affected women, who had six affected sisters and 28 affected mothers.

Table 1 Clinical characteristics

To determine whether the EPHX SNP loci at exon 3 (T→C; Tyr113His) and exon 4 (A→G; His139Arg) were associated with preeclampsia, we initially compared the single-point genotype and allele distributions of these SNPs between preeclamptic and control groups (Table 2). Genotype and allele distributions of exons 3 and 4 SNPs did not reveal statistically significant single-point association with preeclampsia. However, the T allele of the exon 3 SNP (Tyr113) was over-represented among the preeclampsia group (0.74) when compared with the control group (0.66), displaying a borderline association with preeclampsia (P=0.05; OR 1.46 (95% CI: 0.99–2.15)). To study the above-mentioned borderline association result in terms of detection power, we estimated the possible number of subjects needed for a significant result with the exon 3 polymorphism, using simulation analysis. This analysis indicated that more than 150 patients and controls would have been needed at 80% power to show a significant difference (P<0.05) in allele frequencies between patient and control groups.

Table 2 Genotype and allele distributions of the EPHX gene among women with preeclampsia and healthy pregnant controls

The borderline allele association result regarding the exon 3 SNP, as well as the fact that there is evidence that analysis of two-loci haplotypes yields results that are occasionally more significant than those of single-locus tests,8 prompted us to use haplo type estimation analysis in evaluating the role of the EPHX gene in preeclampsia. Haplotype estimation analysis of exons 3 and 4 SNP genotype data by means of the maximum-likelihood method employing an expectation–maximization algorithm (EM) revealed that estimated pair of loci haplotype frequencies between the preeclampsia and control groups differed significantly (P=0.02) (Table 3). Moreover, according to single haplotype association analysis, the high activity haplotype T-A (Tyr113–His139) was significantly overrepresented in the preeclampsia group (P=0.01; OR 1.61 (95% CI: 1.12–2.32)). To obtain unbiased estimates for the above-mentioned exact test P values, the Metropolis algorithm was used to analyse overall (P=0.017; SE±0.003) and T-A (P=0.008; SE±0.001) haplotype contingency tables. In addition, power analysis was also applied to haplotype association analysis to evaluate whether the chromosome number used in the analysis was sufficient to detect a true positive association with preeclampsia. According to simulation analysis, approximately 370 chromosomes would be needed in each group for a statistically significant value (80% power; P<0.05), calculated on the basis of the haplotype T-A odds ratio (OR=1.61). On the other hand, taking account of the 95% confidence intervals for the haplotype T-A odds ratio (1.12 to 2.32), it was estimated that the number of chromosomes needed would range from 140 (upper OR=2.32) to 6000 (lower OR=1.12) when the power limit is set at 80%.

Table 3 Estimated haplotype frequency distributions of EPHX gene exons 3 and 4 polymorphisms in preeclamptic and control chromosomes

In order to study the extent of linkage disequilibrium (LD) between exons 3 and 4 loci, we performed pairwise LD analysis. Interestingly, pairwise LD analysis between exons 3 and 4 loci, which are located approximately 7 kb from each other, revealed stronger LD for all pairs of loci in the preeclampsia group (P=0.03) than in the control group (P=0.85). This result is consistent with the assumption that Finnish preeclampsia patients share a common genetic susceptibility background, and it shows that the LD between these two loci is reduced in both groups, which is probably a consequence of the extended physical distance (7 kb) between these loci.

Discussion

The familial nature of preeclampsia clearly indicates a significant genetic component,9,10 and this genetic component probably comprises multiple gene variants each contributing a small effect.11,12,13,14,15,16,1718 In this study, the candidate gene was selected on the basis of the previously reported association between preeclampsia and EPHX3, and tested for significance using single-point as well as haplotype association analyses with two intragenic SNPs. A borderline association was found between the T allele of the exon 3 SNP and preeclampsia and it was similar, although weaker than previously described.3 To further explore the association we undertook pair of loci haplotype estimation analysis, which involves the two SNP alleles, and we found that the haplotype T–A (Tyr113–His139) was significantly associated with preeclampsia. The validity of such an approach may be questioned, but predictions based on empirical data drawn from the literature currently support the feasibility of haplotype estimation analysis for detecting association more efficiently than single-point association analysis.8 The association between preeclampsia and haplotype T–A (Tyr113–His139) was apparent and this multipoint association may be of biological significance, since two SNPs in the coding region were used and both over-represented genetic variations result in an effect in the same direction, both increasing enzyme activity.

Any polymorphism is likely to make only a small individual contribution to the disease phenotype. A panel of two functional polymorphisms was required in the search for significant association in this study. The erosion of exons 3 and 4 inter-marker association as a result of recombination over generations was obvious according to the pairwise LD analysis. Therefore, it was preferable to use haplotype analysis to investigate the possible association between the disease and multiple SNPs before ruling out the candidate gene as contributing to preeclampsia. This also illustrates the difficulty that can be present in case–control association studies with negative results, where methodological flaws may compromise the validity of data, especially when P values nearing significance are rejected and lead attention away from promising candidate genes. We have here shown that statistical analysis based on haplotypes can be more informative than multiple single-locus tests.

Zusterzeel et al. also found a significant association between EPHX SNPs and preeclampsia, and in their study, tests for allelic associations between two SNPs, were conducted in their case and control groups separately.3 The tests were based on the multinomial probability of the two-locus genotype instead of a pair of single-locus genotypes. Based on a priori information linking the wild-type genotypes of the EPHX gene to high-activity phenotypes, they utilised predicted fast vs slow combined genotypes and concluded that the high activity genotype combination was associated with preeclampsia.2 Another interesting finding was that SNP allele frequencies of the EPHX gene vary considerably across human ethnic groups and populations. Frequencies in our populations were similar to those found in some other white populations,2,19,20 whereas in the Netherlands the population frequencies were similar to those of Chinese controls.3

In summary, the present data on Finnish women support the hypothesis that EPHX gene exon polymorphisms may be related to preeclampsia risk, as previously reported by Zusterzeel et al.3 The frequency of the haplotype T-A (Tyr113–His139) was significantly associated with preeclampsia, and this observation was in line with the results reported by Zusterzeel et al, suggesting that this particular haplotype may lead to an increased risk of preeclampsia.3 Further studies are called for to elucidate the influence of EPHX on the development of preeclampsia.