Introduction

Hashimoto’s thyroiditis (HT) is a chronic autoimmune disease and the incidence of HT has increased worldwide and is estimated to be 0.3–1.5 cases per 1000 people per year.1 The disease is characterized by abnormities of thyroid function as a result of the production of thyroid anti-antibodies (thyroid peroxidase and antithyroglobulin) and generation of thyroid antigen-specific lymphocytes, which in turn lead to gradual destruction of the gland by infiltrating the thyroid gland with immune effector cells.2 To date, previous studies have confirmed HT to be a multifactorial disorder, meaning that both genetic and environmental predisposing factors are involved in the pathogenesis of HT.3, 4, 5 In addition, a recent study has indicated that females, compared with males, have a much higher risk of autoimmune thyroid disease.6

As the most common type of chronic autoimmune thyroid disease,3, 7 variations in several candidate genetic loci have been identified as contributing to one’s risk of developing an HT phenotype by previous linkage and association studies.8, 9 Among these candidate genes, the SEPS1 gene has gained much attention. This gene encodes a member of the selenoprotein family, Selenoprotein S, which is widely expressed in many tissues. The organization of SEPS1 is not complex for encoding a 189-amino acid protein and only contains 7 exons spanning a 7-kb region located on human chromosome 15 at 15q26.3. The SEPS1 gene was first suggested to be related to the stress response process including immune and inflammatory processes through the study of Curran et al.10 In addition, several diseases, including inflammatory disorders such as insulin-dependent diabetes mellitus, Alzheimer’s disease and celiac disease, have been shown to be linked to the region.11, 12, 13, 14 Therefore, in the sense of function and position for involvement in inflammation-related disorders, it is reasonable for the SEPS1 gene to be considered as a potential candidate gene.

Recent studies have provided supportive evidence for the potential association between rs28665122 in the SEPS1 gene and a wide spectrum of disease,15, 16, 17, 18, 19, 20 including HT.21 However, a previous study by Santos et al.21 on the positive association of SEPS1 with HT was from a Portuguese population. Although an association study has been proved to be a powerful tool in unraveling the genetic mechanisms of several complex disorders,22, 23, 24, 25 it is clear that an association study has not explained most of the underlying genetic risks for many diseases. Identifying individual candidate genes/variants on disease risks remains important. Despite evidence of a strongly significant association in Europeans, the underlying biological mechanisms are largely unknown and the genetic loci that contribute to the disorder remain unknown. Therefore, it is still necessary to explore the potential association between SEPS1 polymorphisms and HT in additional genetically independent populations.

To confirm the association between the SEPS1 gene and HT, further studies in a larger sample size may be required. In the present study, we conducted the first large genetic association study of the SEPS1 gene containing a single-nucleotide polymorphism (SNP) of rs28665122, which is significantly associated with HT in Europeans, to determine whether the SEPS1 gene is associated with HT in the Han Chinese population.

Materials and Methods

Patients and controls

We implemented a case–control design for our study. A total of 4011 individuals, ranging from 21 to 68 years old, were randomly recruited for our study. We enrolled Han Chinese individuals with no migration history within the immediate three generations from Shaanxi Province. To avoid socio-economic status being a potential confounding factor, all of the participants had a similar socio-economic level. Living situation, annual income, basic education and subject characteristics served as the primary socio-economic indicators for this study. A face-to-face interview with a standard questionnaire was conducted.

A total of 1013 HT patients (791 females/222 males, range 23–66 years, mean age=35.13 years, s.d.=11.53) were diagnosed and recruited at the Second Affiliated Hospital of Xi’an Jiaotong University (Xi’an, China). These patients were enrolled between November 2011 and August 2014, and patients were selected based on an enlarged thyroid, characteristic ultrasound signs (hypoechogenicity and non-homogeneous texture) and a high level of either anti-thyroid peroxidase (normal range in the method used: 0–34 IU ml−1) or anti-thyroglobulin (normal range in the method used: 0–115 IU ml−1), with or without clinical and biochemical hypothyroidism. The Roche Diagnostics COBAS 6000 E601 Module Immunochemistry Analyzer (Roche, Basal, Switzerland) was applied for thyroid function tests. For suspected cases, the diagnoses were conducted by fine needle aspiration cytology (FNAC). To control potential confounding factors, participants with a reported history of thyroid cancer and/or previous thyroid surgery were excluded from the study. All patients underwent thyroid hormone replacement therapy.

We recruited 2998 unrelated healthy controls (2345 females/653 males, range 21–68 years, mean age=33.54 years, s.d.=12.51) from the Health Check-up Center of the same hospital, who did not have any personal or family history of thyroid disease. It is well known that especially in females, ~10% of euthyroid individuals show an elevated level of thyroid-specific autoantibody and they have the possibility of having autoimmune thyroiditis. Thus, the healthy controls collected in our study had normal thyroid functions and no ultrasound changes were found in the thyroid, and they were negative for thyroid autoantibodies (within the normal ranges mentioned above). Exclusion criteria also included the existence of any comorbid cardiac, autoimmune, infectious, musculoskeletal or malignant disease, or a recent history of operation or trauma. This study was conducted in accordance with the ethical guidelines of the Declaration of Helsinki (version 2002) and was approved by the Xi’an Jiaotong University Ethics Committee. All participants completed written informed consent forms. Baseline characteristics and primary clinical information for all subjects were obtained and are summarized in Table 1.

Table 1 Baseline characteristics and clinical parameters of patients and controls

SNP selection and genotyping

We have chosen our candidate SNPs through the dbSNP and HapMap databases. As an initial screen of the most common SNPs in the Han Chinese population, a minor allele frequency 0.05 was used as the cut-off value. We searched for all SNPs with minor allele frequency 0.05 between 5 kb upstream and 5 kb downstream of the SEPS1 gene in the CHB databases of HapMap and dbSNP. Based on the criteria, we finally selected seven SNPs (rs4965373, rs78817529, rs4965814, rs2306534, rs2009895, rs28665122 and rs34713741) within the 17-kb region containing the SEPS1 gene.

Peripheral venous blood samples were collected in plain tubes and were stored at −20 °C. Genomic DNA was isolated from peripheral blood leukocytes according to the manufacturer’s protocol (Genomic DNA kit, Axygen Scientific, Union City, CA, USA) and DNA samples were stored at − 20 °C for SNP analysis. All SNPs were genotyped using high-throughput, matrix-assisted laser desorption ionization–time-of-flight mass spectrometry. The resulting spectra were processed using Typer Analyzer software (Sequenom, San Diego, CA, USA) for final data acquisition.

Statistical analysis and power analysis

We used GENEPOP v4.0 (François Rousset, http://kimura.univ-montp2.fr/~rousset/Genepop.htm) to conduct the Hardy–Weinberg equilibrium test for each SNP in both the case and control groups. Allelic and genotypic association tests were performed using the genetic analysis software CLUMP v2.4 (Dave Curtis, http://www.davecurtis.net/software/dcurtis/clump24.zip). With 10 000 empirical Monte Carlo simulations implemented by this software, multiple comparison problems can be solved.26 Haploview v4.2 (Broad Institute of MIT and Harvard, http://www.broadinstitute.org/scientific-community/science/programs/medical-and-population-genetics/haploview/haploview) was used to investigate the linkage disequilibrium (LD) structure of the candidate markers. To further examine the LD structure around the significant SNP, we used HapMap data as a reference data set to generate a regional LD plot. Haplotype frequency was calculated using GENECOUNTING v2.2 software.27, 28 All haplotypes including rare ones (frequency <0.01) were selected for a significant test through a likelihood ratio test, followed by permutation testing.28, 29 Curtis and Xu30 reported that minor differences in haplotype frequency estimates can produce very large differences in heterogeneity test statistics. Thus, we also implemented score tests that account for the linkage phase ambiguity for the haplotype association.31 The score tests were originally derived from generalized linear models and have been used for global tests of association and haplotype-specific tests. Permutation was implemented to obtain the maximum haplotype-specific score statistics and its P-values. Haplo Stats package v1.6.11 (Sinnwell JP and Schaid DJ, http://www.mayo.edu/research/labs/statistical-genetics-genetic-epidemiology/software) was used for these analyses.31 We performed stratification analysis to investigate the potential effects of gender on the single SNP and haplotype association analyses. The global P-value from the haplotype analysis was calculated based on the differences of haplotypic frequency distribution among patients and controls. A P-value of 0.05 was chosen as a threshold for significance. In addition, in the study design stage, we calculated the statistical power using PGA v2.0 (Menashe et al.32). Our results showed that the power for SNP and haplotype association tests were 95% and 87%, respectively, at the false-positive rate of 0.05 and a presumed odds ratio of 2.

Results

Genotype data from 7 SNPs of the 17-kb genomic region containing the SEPS1 gene from 1013 HT cases and 2998 controls were released. Allelic and genotypic frequencies of all SNPs in the case and control groups, as well as the Hardy–Weinberg equilibrium test, are shown in Table 2. All SNPs were highly polymorphic in the case and control groups, and both groups were in Hardy–Weinberg equilibrium. Significant allelic association of rs28665122 (P=0.000518; odds ratio =1.28; 95% confidence interval 1.11–1.47) was identified in combined analysis. Genotypic association analysis confirmed this result with a significant P-value of 0.002731. There were no significant differences in the other six SNPs in both allelic and genotypic analyses (Table 2). To perform haplotype-based association analyses, we examined the LD structure within the genotype data of seven SNPs and identified a haplotype block. An LD pattern is suggested in Figure 1a, which presents the results of the LD tests between the pairs of seven SNP markers across the SEPS1 gene. According to Figure 1a, we observed a two-SNP LD structure, which was also confirmed in a regional LD structure plot shown in Figure 1b. Several strongly correlated common SNPs with rs28665122 were identified in the plot. Haplotypic association analyses were followed to test this LD block. A significant P-value (global P=0.0036) was obtained through association test of the haplotype consisting of two SNPs (rs2009895 and rs28665122; Table 3). Three haplotypes, the HAP2 (TG), the HAP3 (TA) and the HAP4 (CA), were significantly associated with HT, especially the HAP4, of which the frequency increased threefold in the case group (P=0.027), while HAP1 (CG) showed no significance (P=0.452; Table 3). The haplotype frequencies estimated by GENECOUNTING (Jinghua Zhao, http://legacy.mrc-epid.cam.ac.uk/~jinghua.zhao/software.htm) and HaploStats (Sinnwell JP and Schaid DJ, http://www.mayo.edu/research/labs/statistical-genetics-genetic-epidemiology/software) were very similar. Thus, the potential bias reported by Curtis and Xu30 that minor differences in haplotype frequency estimates could produce very large differences in heterogeneity test statistics may not affect our analysis.

Table 2 Allele and genotype frequency of single SNP association analysis
Figure 1
figure 1

(a) Estimation of LD between each pair of seven SNPs genotyped in SEPS1 gene in Han Chinese population. LD structure (D’) between marker pairs was indicated by the shaded matrices. (b) Regional LD plot of rs28665122 based on the HapMap CHB+JPT data. The r2 of other SNPs versus rs28665122 was indicated by diamond shape. Gene regions were indicated at the bottom of the plot.

Table 3 Haplotypes frequency and association analysis

To investigate the potential effects of gender in our association analyses, we stratified our analyses by males and females. We found that the rs28665122 SNP showed both allelic (P=0.002644) and genotypic (P=0.010326) association with HT in females, whereas it only showed marginal significance in males (Table 4). Similar to single SNP analysis, haplotypic analysis in females showed a significant association (global P=0.0162, Table 5). The HAP3 (TA) showed significant differences for its frequency in the female case and control groups, and an almost twofold increase was observed in female cases with a P-value of 0.004. We observed a protective effect implicated by HAP2 (TG) (P=0.024; Table 5) and this may be a result of the higher frequencies in female controls. Both haplotypic association patterns failed to be identified in males.

Table 4 Gender-specific allele and genotype association analysis in female and male
Table 5 Gender-specific haplotypes frequency and association analysis in female and male

Discussion

HT is a complex disease and chronic inflammation has been considered as an important factor in the etiopathogenesis, while a wide range of environmental factors can trigger HT and cause disease progression. As one of the novel candidate genes in association studies of diseases related to chronic inflammation, the SEPS1 gene has been the focus of genetic dissection in the past several years. A previous study has shown that the SEPS1 gene is a significant risk factor for preeclampsia in Norwegian populations,19 and another study based on a Finnish cohort has also indicated the potential relationship between the SEPS1 gene and coronary heart disease or ischemic stroke.20 However, a German study failed to identify association signals between the SEPS1 gene and inflammatory bowel disease susceptibility. In that study, the polymorphisms of the SEPS1 gene did not contribute to either a certain disease phenotype or increased tumor necrosis factor-α levels in inflammatory bowel disease patients.33 Another study in Italian and German populations also failed to confirm any significant association of the SEPS1 gene with stroke in young patients and controls.34

The purpose of our study was to investigate the potential association between seven common SNPs in the SEPS1 gene and HT in the Han Chinese population. To the best of our knowledge, we have carried out the first genetic association study of the SEPS1 gene in Han Chinese individuals. To examine whether common risk variants exist in different populations or not, we compared our study with that of Santos et al.21 In the two studies, a significant association signal was found between rs28665122 and HT, and the A allele was considered to be the risk allele. However, we also observed some differences between the two studies. The odds ratio of rs28665122 was 1.28 in our data set, compared with 2.22 in that reported by Santos et al.21 The A allele frequency of rs28665122 in the cases of HT was much lower in our study than in the study of Santos et al.21 (0.1613 in our samples and 0.2651 in the Santos et al.21 samples). In addition, we also found a sex-specific association of rs28665122 that did not exist in the previous report.21 Indeed, it is not enough to come to a conclusion simply from the analysis of some SNPs.35 In our study, two specific haplotypes, HAP3 (TA) and HAP4 (CA), have been demonstrated to be two main risk factors for HT. However, the frequency of HAP4 (CA) was too low to conclude that it was associated with HT. In addition, we found that the association signal of rs28665122 was stronger than that of the haplotypes. Considering that rs2009895 was not associated with HT, it is reasonable to assume that the association in the haplotypes may be caused by rs28665122. One previous study has reported the gender-specific effect of the SEPS1 polymorphism with coronary heart disease samples. The authors of that paper proposed that the gender differences might be a result of the differences in disease etiology or in hormonal milieu for men and women.20 Although the coronary heart disease involved in their study is different from ours, similar results in both studies indicated the gender-specific effect in the SEPS1 gene. In fact, it has been reported that gender is believed to be related to HT in several epidemiological studies.6 However, our data should be interpreted with caution, because confirmation in a larger sample that includes more male samples is necessary to properly detect whether possible gender-specific interactions exist or not.

In spite of some similarities in general association patterns between these studies, the possible reasons for these discrepancies mentioned above might be differences in ethnicity, genetic heterogeneity, type of polymorphism evaluated, sex and age of participants. However, our results are unlikely to be an artifact. First, significant signals were confirmed by both single SNP and haplotype-based analyses. Second, for Han Chinese, matching of the geographic population and genetic background could help us to avoid a potential population stratification problem.36 Thus, population stratification is an unlikely explanation, because we recruited all subjects from the same geographical population. Furthermore, similar conclusions were reached in two ethnic populations (Han Chinese and European) that reaffirmed the association observed from those studies. Notably, two potential limitations of our study should be kept in mind. One is that our study results need to be replicated and validated in other ethnic groups, because SNPs and haplotype structures can vary among different ethnic groups, and statistically significant results may occur by chance. The other is that we could not conclusively rule out some other potential confounding factors, such as smoking and alcohol use, and the possibility that there are some other genes that may affect the onset of HT. As is known, SEPS1 polymorphism can affect the circulating levels of pro-inflammatory cytokines. Furthermore, the human biological mechanism of inflammation is very complex and hundreds of genes may be involved in the process of inflammation. Therefore, a comprehensive analysis with all of these related genes may be significant in unraveling the effects of these polymorphisms in the pathogenesis of HT.

In the present study, our results suggested that the A allele may serve as a risk allele for HT. However, the molecular mechanism by which the SEPS1 gene has a role in the pathogenesis of HT remains unknown and needs to be elucidated. Previous studies have shown that SNP rs28665122 is associated with circulating levels of the inflammatory cytokines interleukin-1, interleukin-6 and tumor necrosis factor-α, and its minor allele was related to a decrease in SEPS1 gene expression. Impairment of SEPS1 is furthermore associated with increased interleukin-6 and tumor necrosis factor-α secretion in macrophages.10 The GG genotype of the SNP rs28665122 was also shown to be significantly correlated with thioredoxin reductase activity and with serum selenium; furthermore, the A allele induced lower selenoprotein S activity, while the G allele of this SNP seemed to be necessary to complete enough promoter activity in a stressful environment.37 This finding was consistent with the functional studies of SEPS1 polymorphisms mentioned above. Therefore, it is reasonable to hypothesize that A allele carriers may have a higher risk of developing HT, given the above-mentioned role of SEPS1 in inflammation. Another observation was that the significant SNP identified in our study is located in the promoter region of the SEPS1 gene, which indicated that this region may be a harbor for functionally relevant variants that may be of interest for future studies.

In conclusion, our work provides further supporting evidence for association of the SEPS1 gene with HT. Moreover, as an intriguing new insight into the pathogenesis of HT, we have also confirmed previous reports that suggest that the gene is responsible for having an important role in the etiology of HT. Given the complex patterns of findings from association studies focusing on HT and its underlying genetic heterogeneity and mechanism of chronic inflammation, further inquiries and wider replications are required, especially within different ethnic samples.