Introduction

Recurrent miscarriages (RM-OMIM: 614389) are considered as post-implantation failures in natural conception; and clinically described as three or more consecutive miscarriages before the 20th week of pregnancy. RM is a heterogeneous disease associated with various maternal and fetal factors.1 Pregnancy is considered as an unique immunological paradox in which a specific population of T lymphocyte described as CD4+CD25+ regulatory T (Treg) cells protect fetus during gestation by inhibiting maternal allo-immunity reactions in opposition to paternal antigens among fetal cells.2, 3 Treg cells regulate the effector function of T cell, for instance helper T-cells and Th17 cells.4 Th17 cells secrets interleukin (IL)-17, which is involved in the determination of inflammatory reactions, act together compassionately with helper T-cells and are probably associated with idiopathic recurrent miscarriages (IRM).5 As IRM women show a considerably decreased frequencies of Treg cells,6 and decreased inhibitory capability of regulatory T-cells have been reported in peripheral blood along with decidual tissues.7 CD28-mediated activation of NF-kappaB family of transcription factors initiate Treg cell development.8 Hence, we hypothesized that the decreased number and/or functional insufficiency of Treg cells is due to the genetic variations in CD28 and NFKB1 may increase the risk to IRM.

Human CD28 gene is positioned on the long arm q33.2 band of chromosome 2 with a 52.44 Kb nucleotide size, and comprises 4 exons and 3 introns. CD28 is recognized as a co-stimulatory molecule, which is constitutively expressed on naive and activated T-cells. The interaction of CD28 to their CD80 (B7.1)/CD86 (B7.2) ligand-mediated signals encourages differentiation and proliferation of T lymphocyte, and increases production of antibody through B lymphocytes.9, 10 Deficiency of co-stimulatory signal leads to T-lymphocyte tolerance and anergy.11 In vitro and in vivo studies suggested the deregulation in CD28 pathways result into the perturbed adaptive immune responses.12, 13 Thus, it may be postulated that the deregulation in CD28 pathways due to genetic variants may cause the predisposition to IRM.

Human NFKB1 maps to band q24 of chromosome 4 and constitute a size of ~115.97 kb having 25 exons and 24 introns. The NF-κB of mammalian family consists of five proteins namely; NF-κB1, NF-κB2, RelA, RelB and c-Rel.14 NF-kB has been considered as an essential transcription factor for sustaining normal immune homeostasis; an insufficient NF-kB stimulation may mediate inflammation.15 NF-κB1 suppresses NF-κB2 and act as a main player in the regulation of their activity. NFKB1 and NFKB2 genes encode the proteins NF-κB1 and NF-κB2, respectively. Considering the important role of NF-κB signaling pathway in the control of inflammatory responses, the genetic alterations in NFKB1 gene content may modulate the risk for IRM.

Although the research on CD28 and transcription factor NF-κB family of proteins in IRM are still in its infancy, however, few studies have provided the insight into their potential role in IRM. Interestingly, a study revealed significantly reduced CD28 expression in IRM women and recommended that a disproportion in the ratio of CTLA4/CD28 expression on the feto-maternal membrane inhibits the activity of T cell in pregnant women, which may corroborate increased risk to IRM.16 Further, a study conducted on animal model of early pregnancy has suggested stimulated NF-κB for the duration of uterine receptivity among equally the cyclic (breeding cycle) and pregnant endometrium.17 Similarly, in human it has been reported that placental NF-κB stimulated ~10-fold in women with pre-eclampsia.18 Therefore, in this study we made an assumption that the deregulation in the expression and activity of CD28 and NF-κB may be due to the genetic variations or single nucleotide polymorphisms (SNP) in these genes and hence, may be associated with increased risk to IRM.

CD28 gene polymorphisms have been linked with the autoimmunity disorders, like Bechet’s disease19 and rheumatoid arthritis.20 Recently, NFKB1 gene variants have been reported in relation to autoimmunity disorders, such as systemic lupus erythematosus (SLE),21 rheumatoid arthritis21 and extensive colitis in inflammatory bowel disease.22 However, CD28 and NFKB1 gene polymorphisms have not been investigated previously in IRM cases. Therefore, we conducted this study to fulfill these spaces.

Tag-SNP rs3116496 T/C exists in intronic region of CD28 gene. NFKB1 tag-SNPs namely; rs28362491 ins/del and rs696 A/G are located in the promoter and 3′ un-translated region (UTR), respectively. We aimed to investigate the association of functional variants of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) with IRM cases.

Materials and Methods

Subjects and recruitment criteria

This study cohort consists of cases that went under assessment for the RM in the out-patients department (OPD) Department of Medical Genetics, Sanjay Gandhi Post-Graduate Institute of Medical Sciences (SGPGIMS), Lucknow. RM cases included in this study were recruited from November, 2011–August, 2015. All the registered RM women were primary aborters with a history of at least three recurrent pregnancy losses. We used a well-structured study proforma to note the detailed clinical information of RM and control women before their recruitment in the present study. All recurrent miscarriages cases included in this study are of clinical intrauterine pregnancies. The exclusion and inclusion criteria for recurrent miscarriages and control women have been provided in the section below.

RM women were assessed for various known causes of pregnancy losses through proper investigations such as karyotypes of the couple; antiphospholipid antibodies including lupus anticoagulant (PLR, 0.8–1.05) and anticardiolipin antibodies (IgG 0–12 IgG anticardiolipin units, immunoglobulin M 0–5 IgM anticardiolipin units) and prothrombotic risk factors including factor V Leiden and prothrombin mutations; day 21 progesterone levels, prolactin level; glycaemic curve, thyroid hormone levels. The uterine anomalies and cervical incompetence were ruled out by history, ultrasonography and appropriate investigations.

Several genetic association studies have been conducted in IRM cases. Some studies have found genetic variants associated with IRM,23, 24, 25 whereas other studies have reported no association for genetic factors with IRM.26, 27 Thus, the field is still muddled and only the genetic studies conducted on IRM, which excluded the cases with abnormal parental karyotype will clarify this situation. Considering the facts that if a fetus was aneuploid, investigating the CD28 and NFKB1 tag-SNPs with clinical outcome of IRM would be illogical, hence, we excluded both the couples with aneuploid abortuses as well as couples having abnormal parental karyotypes.

Among initially screened 566 individuals, 35.34% (n=200) had unexplained origin of RM and were recruited for the present study. Our center is a tertiary care referral hospital, where RM women visited after third miscarriages, only with abortus material were included in this study and then the karyotyping was done on the abortus material. In case of women with more than three miscarriages included in this study, the karyotyping was done on the abortus material at the time of their inclusion in the study, but not on their previous miscarriages due to the unavailability of the samples. The following RM cases were excluded, which do not meet the inclusion criteria: secondary aborters (n=44), abnormal parental karyotype (n=38), abortus material having aneuploidy (n=52), maternal thrombophilias (n=56), endocrine defects (n=76), abnormal uterine anatomy (n=52) and plasma lupus anticoagulant (n=14); we have also excluded cases, which were not from the same linguistic group (n=34). Among cases abnormal parental karyotype (n=38) was seen in 14 male and 24 female partners. The 14 male cases revealed structural chromosomal abnormalities like (i) balanced translocations (n=10), (ii) balanced Robertsonian translocation (n=2) and (iii) inversion (n=2). In female cases, 8 had numerical and 16 had structural chromosomal abnormalities. The structural chromosomal abnormalities were either balanced translocations (n=12) or balanced Robertsonian translocation (n=4). The anueploidy of abortus material (n=52) included monosomy X (n=18), triploidy (n=10), trisomy of chromosome 13 (n=4), trisomy of chromosome 15 (n=2), trisomy of chromosome 16 (n=14), trisomy of chromosome 18 (n=2) and trisomy of chromosome 21 (n=2). After excluding all the known causes of recurrent miscarriages among 566 RM women recruited initially for the present study, we remained with only 200 RM women with no known cause of RM for the genetic analysis of CD28 and NFKB1 tag-SNPs.

We included 300 control women with age and ethnicity matched to IRM (Supplementary Table S1) with minimum two live births and with no history of miscarriages, pre-eclampsia, ectopic pregnancy or preterm delivery. Controls were randomly chosen from the same population (based on caste system) during the same period, matched by age (±3 years) and residential location with cases. Two independent IRM cases and controls sets were constructed. The first set of cases–controls comprised Hindu women (100 cases and 200 controls), which matched literally 1:2 as they are more frequent in North Indians. The second set of case–control cohort matched 1:1 and comprises North Indian Muslim women (100 cases and 100 controls) as their populations are comparatively lower. The ethnic classification was based on caste system among North Indian populations. The study cohort recruited in the present study belong to the Uttar Pradesh state of North India and fall within the same linguistic group (Indo-Aryan speakers), which indicates that IRM and control women were of the same ethnicity. Proper stratification of IRM and control women was performed using different Alu repeat markers as reported in our earlier study.28 Institutional Ethics Committee of SGPGIMS, Lucknow granted the permission for the present study and the study was conducted in accordance to the ethical standards laid down by the Declaration of Helsinki for performing medical research in human subjects. All subject provided written informed consent before their inclusion in the study.

Selection of tag-SNPs in CD28 and NFKB1 gene regions

We carried out selection of tag-SNPs in CD28 and NFKB1 gene regions by using International HapMap database (http://hapmap.ncbi.nlm.nih.gov/index.html.en) and Ensembl (http://www.ensembl.org/index.html) SNP databases. Tagger software was employed for tagging with the pairwise tagging algorithm r2 ⩾0.8 with a minor allele frequency (MAF) more than 5% available in each database.29 We selected a MAF of >5% to maintain the balance for the factors such as: (a) the existing sample size, (b) a reasonable power of the study (at least 80%) to detect the risk allele and (c) the number of tag-SNPs to be investigated. Additional criteria for the selection of tag-SNPs was based on published scientific reports to demonstrate their potential role in autoimmunity or inflammatory diseases in CD2819, 20 and NFKB1 gene region.22, 30 Consequently, in this study we selected three-tag SNPs in the gene regions of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) with minor allele frequencies >5% in Asian population.

Laboratory analysis

Three milliliters of venous blood was taken from each IRM and control women in collection vials coated with ethylene diamine tetra acetic acid and preserved at or below −20 °C until use. Extraction of genomic DNA was carried out by QIAGEN genomic DNA extraction kit (Brand GMbII and Co KG, Cat # 51104, Valencia, CA, USA).

Determination of genetic variants of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) gene regions were performed through polymerase chain reaction (PCR) pursued by restriction fragment length polymorphism (RFLP) analysis as mentioned in the earlier published reports.20, 31 The primers and restriction enzymes used for the determination of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) polymorphic markers have been provided in Supplementary Table S2. Detection of genotypes in each IRM and control women has been carried out in absence of prior information about their control or disease status. We constituted two independent set each with 100 cases and 150 controls. Two independent workers have carried out genotyping on each set independently without the knowledge of subject’s case or control status. On comparison we found 10% discrepancy in genotyping results of PCR–RFLP assay. These samples were again repeated blindly, and on comparison we found the concordance in the genotyping result. Further, Sanger sequencing was performed on nearly 20% of the randomly selected samples for the validation of PCR–RFLP genotyping analysis. The genotypes obtained from both Sanger sequencing and PCR–RFLP analysis were compared for quality control, which revealed 100% similar genotype call rate.

Statistical analysis

The estimation of sample size for IRM and control groups was carried out by using QUANTO version 1.2 (http://biostats.usc.edu/Quanto.html) program in the supervision of a statistician. We considered a power of 80% with 5% type 1 error (α=0.05) for sample size calculation. Chi-square (χ2) test was used for estimation of the Hardy–Weinberg equilibrium P-value for rs3116496, rs28362491 and rs696 tag-SNPs in both IRM and control cohort. We used logistic regression analysis for the estimation of odds ratios (OR) with 95% confidence intervals (CI) using genotypes as well as alleles of the rs3116496, rs28362491 and rs696 tag-SNPs as the independent factors, and the risk of IRM being the dependent variable. Several confounding variables like ethnicity and age of IRM women at the time of diagnosis were included in the analysis, but removed from the final model because of their minimal affect on disease causation (not altering OR>5%), therefore, unadjusted logistic regression analysis was used.

We address the issue of multiple testing for CD28 and NFKB1 tag-SNPs by using the Benjamini–Hochberg approach, which regulates the false discovery rate (FDR).32 The FDR-adjusted P-value has been computed by regulating FDR at 5% level to evaluate the statistical significance of studied CD28 and NFKB1 tag-SNPs following correction for multiple testing. After correction for multiple testing P-values less than or equal to 0.05 were considered as statistically significant. This study employed bootstrap re-sampling approach for the purpose of internal validation of the presented observations. We produced 500 bootstrapped samples from the genetic data of CD28 (rs3116496) and NFKB1 (rs28362491 and rs696) SNPs, and then computed genotype and allele specific P-value for three studied SNPs. Statistical tests were carried out by using the R language for statistical computing (https://cran.r-project.org/bin/windows/base/old/3.2.4/).

Multifactor dimensionality reduction (MDR) analysis was carried out by using MDR software version 1.2 (https://cran.r-project.org/web/packages/MDR/index.html). MDR analysis uses a non-parametric test to calculate all the possible one to three way SNP–SNP interactions. MDR approach has been explained in detail elsewhere,33 and applied in this study among IRM and control cohort. In MDR analysis, a permutation test was used to calculate the multiple tests-adjusted P-value.

Expressed quantitative trait locus (eQTL) mapping analysis

We determined the functional affects of CD28 and NFKB1 tag-SNPs using eQTL mapping analysis. The eQTL mapping analysis was performed for CD28 tag-SNP by using blood eQTL.34 Human genetic variation database (http://www.genome.med.kyoto-u.ac.jp/SnpDB/index.html) was used for the eQTL mapping analysis of NFKB1 tag-SNPs.

Results

Demographic clinical features of IRM and control women

Two hundred IRM cases and three hundred fertile control women were screened for the evaluation of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) polymorphic markers. We mentioned the demographic characteristics of cases and control women in Supplementary Table S1. We have selected age and ethnicity matched IRM (mean age: 28.6±3.2) and control (mean age: 29.2±6.8) women.

Distribution of CD28 and NFKB1 gene variants

Distribution of minor allele frequencies of the CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) tag-SNPs are shown in Table 1. CD28 and NFKB1 tag-SNPs included in this study were observed in Hardy–Weinberg equilibrium among women of IRM and control group in both combined data set and sub-group analysis of Hindu and Muslim populations (Supplementary Table S3). The combined data set consists of both IRM cases and controls from the sub-group of Hindu and Muslim populations (Table 1). In combined data analysis, we observed nearly three- to twofold increased risk in single marker analysis for CD28 rs3116496 T/C tag-SNP in additive and recessive models, and NFKB1 tag-SNPs rs696 A/G and rs28362491 ins/del in additive, dominant and recessive models in IRM cases (Table 1). Further, in sub-group analysis of Hindu and Muslim populations the minor allele homozygous genotype carriers of IRM women revealed an increased risk of almost threefold in additive model and twofold in recessive model for both NFKB1 tag-SNPs rs28362491 ins/del and rs696 A/G (Table 1). However, it is important to note that almost similar frequencies of minor allele homozygous genotypes of rs3116496 T/C (CD28) were found among IRM women (25% and 24.5%) and controls (19% and 19%) in sub-group analysis of both Hindu and Muslim populations, however, differences were significant only in cases of Hindu women but not in Muslim women (Table 1), this may be because of smaller sample size in later group. Interestingly, CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) tag-SNPs, which revealed significance after corrections for multiple testing in IRM cases were found significant in the bootstrap analysis presenting internal validation to our observations in single marker analysis in both combined data analysis and sub-group analysis.

Table 1 Distribution of CD28 (rs3116496) and NFKB1 (rs28362491 and rs696) gene variants among IRM cases and controls

SNP–SNP interaction analysis

We determined all the possible one to three way SNP–SNP interactions by using MDR model. MDR analysis revealed 10.5-fold increased risk under three-factor model of rs3116496 T/C, rs28362491 ins/del and rs696 A/G tag-SNPs (OR=10.5, 95% CI=6.50–16.94, P=1.000E−04). The three-tag SNPs model shows maximum testing balanced accuracy (TBA), highest (10-fold) cross validation consistency (CVC) and greater significance (Table 2). Meanwhile, in two-factor (rs28362491 ins/del and rs696 A/G) and one-factor (rs28362491 ins/del) models the TBA, CVC and significance were reduced, respectively (Table 2).

Table 2 Multifactor dimensionality reduction (MDR) models showing association of high-order SNP–SNP interactions with IRM cases

eQTL mapping analysis for CD28 and NFKB1 tag-SNPs

We performed eQTL mapping analysis to determine the functional affects of bi-allelic tag-SNPs of CD28 rs3116496 T/C, and NFKB1 rs696 A/G for which we observed the significant difference between IRM’s and control women at genomic level in this study. However, we did not included NFKB1 rs28362491 marker in eQTL mapping analysis because it is an insertion/deletion polymorphism and may not capture by the same probes used in eQTL mapping analysis for the bi-allelic SNPs. As tag-SNPs are situated in the 5′UTR/3′UTR, it has been reasonable that tag-SNPs may affect the abundance of transcript of the studied polymorphisms. Blood eQTL and Human Genetic Variation Database was used for eQTL mapping analysis for minor variant at 5′UTR of CD28 rs3116496 and 3′UTR of NFKB1 rs696 tag-SNP, respectively, which revealed that these tag-SNPs belong to Cis-eQTL and may be responsible for the changes in abundance of transcripts of CD28 and Family with sequence similarity 177 member A1 (FAM177A1) genes, respectively (Table 3). Interestingly, the eQTL analysis suggested that the genotype of rs696 tag-SNP was significantly associated with the changes in expression level of FAM177A1, but not NFKB1.

Table 3 eQTL mapping analysis for CD28 and NFKB1 tag-SNPs

Discussion

Co-stimulatory gene CD28 is vital for the production of immune responses. The binding of CD28 to CD80 (B7.1) and CD86 (B7.2) on antigen presenting cells, is arguably the most effective co-stimulatory molecule. Disturbance of CD28/B7 co-stimulation has been suggested to attenuate the severity of inflammation in some autoimmune disease models.35 The vital controllers of NF-kB stimulation in T-cells are antigenic T cell receptor and CD28 receptor.36 During inflammatory responses, NF-kB is important for the stimulation of T-cells, differentiation and survival. As CD28 and NFKB1 genes have an important function in the control of inflammatory responses, therefore, the genetic variation in these genes may be associated with etiology of IRM. This study has been designed to evaluate the relationship of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) gene variants with IRM cases.

This study found considerably increased prevalence of minor homozygous genotypes and minor allele frequencies of rs3116496 T/C, rs28362491 ins/del and rs696 A/G tag-SNPs in IRM cases as compared with controls, suggesting the risk association ranges from three- to twofolds for IRM (Table 1). Similarly, in sub-group analysis of Hindu and Muslim populations the minor allele homozygous genotype carriers of IRM women revealed an increased risk of almost threefold in additive model and twofold in recessive model for both NFKB1 tag-SNPs rs28362491 ins/del and rs696 A/G. Further, it has been worth to mention that almost similar frequencies of minor allele homozygous genotypes of rs3116496 T/C (CD28) were found among IRM women (25% and 24.5%) and controls (19% and 19%) in sub-group analysis of both Hindu and Muslim populations, however, differences were significant only in cases of Hindu women but not in Muslim women (Table 1), this may be because of smaller sample size in later group. Interestingly, rs3116496 T/C, rs28362491 ins/del and rs696 A/G tag-SNPs, which revealed significance after corrections for multiple testing in IRM cases stayed significant in bootstrap analysis presenting internal validation to our observations in both combined data analysis and sub-group analysis. MDR analysis was performed to estimate the possible SNP–SNP interactions in women of IRM and control groups. Interestingly, MDR analysis suggested 10.5-fold enhanced risk in three-factor model (rs3116496 T/C, rs28362491 ins/del and rs696 A/G) for IRM as compared with controls. The three-tag-SNP model revealed maximum TBA, greatest CVC and highest significance for possible prediction of IRM. The eQTL mapping analysis further strengthens our observations and revealed the influence of minor variants at CD28 rs3116496 T/C, and NFKB1 rs696 A/G tag-SNPs on the abundance of transcripts of CD28 and FAM177A1 genes, respectively (Table 3). It is important to consider that the genotype of rs696 tag-SNP was significantly associated with the changes in expression level of FAM177A1, but not NFKB1. We performed eQTL mapping analysis only for bi-allelic SNPs of CD28 rs3116496 T/C, and rs696 A/G but not for rs28362491 ins/del polymorphisms because eQTL for ins/del polymorphisms may require different probe set and may not be capture by the same probes used in eQTL mapping analysis for the bi-allelic SNPs. The earlier report may recommend the influence of CD28 gene variants on human diseases risk, potentially by altering CD28 function and/or expression.20 The findings of this study are in concordance with earlier studies that suggested a risk association for rs3116496 T/C tag-SNP of CD28 with human diseases like rheumatoid arthritis20 and Bechet’s disease.19 It has been shown that minor homozygous (C/C) genotype of rs3116496 T/C tag-SNP reduces the sCD28 levels.20 The down-regulation of sCD28 levels may affect CD28/B7 co-stimulation, and therefore, might be associated with the susceptibility to IRM.

Previous studies have reported that minor homozygous genotypes of NFKB1 namely; rs28362491 ins/del and rs696 A/G tag-SNPs are linked with augmented risk to endometriosis37 and extensive colitis in inflammatory bowel disease,22 respectively, which are consistent with the observations of this study among IRM women. The potential role of tag-SNPs of NFKB1 namely; rs28362491 ins/del and rs696 A/G in the correlation of disease risk appears to be associated with functional activity and gene expression of NFKB1, which have been considered vital in the regulation of crucial cellular mechanisms such as apoptosis and cell death independent of the NF-κB complex.38 The earlier studies suggested that rs28362491 ins/del tag-SNP in the promoter (5′UTR) element of NFKB1 has regulatory affect on gene expression of NFKB1, and for insertion (ATTG1) allele the functional activity has been found two times more than the deletion (ATTG2) allele.39 A functional report has proposed that minor alleles of 3′UTR of NFKB1 gene significantly decrease NFKB1 mRNA stability.31 Thus, it may be anticipated that the difference in the expression of NFKB1 gene due to the variants of 5′UTR and 3′UTR of this gene may modify the risk of IRM.

Limitation of this study is that we have not estimated the CD28 and NFKB1 at proteomic levels, hence, future studies need to accomplish this issue for the complete understanding of the function of CD28 and NFKB1 in the etiology of IRM. The evaluation of potential link between alleles of risk variant and protein concentrations of CD28 and NFKB1 in blood obtained from women with IRM may provide useful information in the identification of a suitable biomarker. The pharmacologic approaches that could modulate these concentrations might be evaluated for the identification of their potential role in the determination of IRM risk.

Taken together, this study suggests an association of CD28 (rs3116496 T/C) and NFKB1 (rs28362491 ins/del and rs696 A/G) gene variants with augmented risk to IRM. The eQTL mapping analyses performed in this study strengthen these observations, and suggested that the variation in the CD28 and NFKB1 gene content may affect the abundance of transcripts of CD28 and FAM177A1 genes, respectively. To the best of our understanding based on the available information, this is the first study, which has examined the role of CD28 and NFKB1 tag-SNPs in IRM women. This appears to be an initial report, which revealed significant findings, however, needs further validation at both global and functional level.