Skip to content
BY-NC-ND 3.0 license Open Access Published by De Gruyter Open Access December 16, 2016

Presence of sequence and SNP variation in the IRF6 gene in healthy residents of Guangdong Province

  • Wenli Wu , Liang Hua , Hongtao Wang EMAIL logo , Jiansuo Hao , Yiyang Chen , Fan Li and Jiayu Liu
From the journal Open Life Sciences

Abstract

Objective

This study was to investigate the single nucleotide polymorphism (SNP) in the interferon regulatory factor 6 (IRF6) gene in healthy residents of Guangdong Province, China, for further analysis of their associations with the development of cleft lip with or without palate (CL/P).

Methodology

DNA was extracted from blood samples of 13 healthy residents. IRF6 genes were sequenced and analyzed by alignment to the reference sequences in GenBank.

Results

The IRF6 genes containing 45.21% GC were 25016–25046 bp in length, and had 2215-bp exons and a 1404-bp coding region. There were 65 SNPs, including 58 SNPs in the 5′-untranslated region or introns and 7 SNPs in the exons. These sites had two alleles, including 39 transition sites and 26 transversion sites. Five novel SNPs were identified. c. 459G>T and c. 820G>A in the coding region represented silent and Val274Ile mutations, respectively. The SNPs made two sequences (GenBank HQ875393 and JF346417).

Conclusions

The sequences and SNP profiles of the IRF6 gene in healthy residents of Guangdong Province are consistent with the one in GenBank but with slight variations. Our findings form a basis for further investigation of the association between IRF6 gene polymorphisms and CL/P in residents of Guangdong Province.

1 Introduction

The interferon regulatory factor 6 (IRF6) gene is located on chromosome 1q32-41 and contains 10 exons. The coding region of IRF6 includes exons 3–8 and part of exon 9 [1]. The IRF6 protein belongs to the IRF transcription factor family. It contains two conserved regions, a helix-turn-helix DNA-binding domain and a Smad interferon regulatory (SMIR) factor protein binding domain [2].

The IRF6 gene is the most studied candidate that causes cleft lip with or without palate (CL/P), and the IRF6 protein has been considered a key element for oral and maxillofacial development [3]. In the process of embryonic palate fusion, the levels of IRF6 expression at the edge of the palatal midline are high [1, 4]. In addition, high levels of IRF6 expression can be found in hair follicles, palate wrinkles, teeth, the thyroglossal duct, external genitalia, and the body skin [1]. It has been shown that an IRF6 mutation can cause Van der Woude syndrome (VWS) and popliteal pterygium syndrome (PPS) [1]. Rahimov et al. compared the conserved sequences of the IRF6 gene across multiple species and have shown the presence of the single nucleotide polymorphism (SNP) rs642961, G>A on an enhancer of the IRF6 gene. This SNP can alter the binding of TFAP2A to the IRF6 gene and is associated with the incidence of a simple cleft lip [5]. In addition, some scholars believe that this IRF6 gene polymorphism is significantly correlated with the development of nonsyndromic cleft lip with or without palate (NSCL/P). Variation of the IRF6 gene accounts for 12% of the genetic factors that cause CL/P. The probability of recurrence of CL/P in a family with the IRF6 gene mutation is three times more likely than in normal people [2].

A study by Pan et al. has shown that the rs642961 A and rs2235371 C alleles of the IRF6 gene increase the risk for the development of nonsyndromic orofacial clefts [6]. In contrast, a study by Paranaiba et al. has shown that these IRF6 gene polymorphisms (rs642961 and rs2235371) are not associated with NSCL/P in a Brazilian population [7]. The IRF6 rs642961 polymorphism gene locus is associated with NSCL +/- P in a northern Chinese population [8], Asians, and Caucasians [9]. It can even be considered as a marker of CL/P severity [10]. However, the IRF6 rs642961 polymorphism is not a risk factor for NSCL/P in an Iranian population [11]. These studies suggest that the association of IRF6 genetic variation with the development of CL/P depends on ethnicity. CL/P is an incomplete dominant disease with genetic heterogeneity. Guangdong Province is representative of the southern provinces of China. Its unique geography and climate result in distinct characteristics in many diseases, including CL/P in this region. The IRF6 gene sequences of healthy residents of Guangdong Province can be used to study the association between IRF6 gene variation and CL/P in people from this region. Therefore, the aim of this study was to investigate the presence of sequence and SNP variation in the IRF6 gene in healthy residents of Guangdong Province. This is the first study to investigate the sequences and SNP characteristics of the IRF6 gene in healthy residents of Guangdong Province.

2 Methods

2.1 Ethics statement

This study was conducted in accordance with the principles stated in the Declaration of Helsinki and was approved by the Guangzhou Women and Children’s Medical Center Ethics Committee. Written informed consent was obtained from each subject.

2.2 Sample Collection

Peripheral venous blood samples (3 mL) were collected from 13 healthy people, including 7 males and 6 females, who had fasted overnight. The criteria for selecting healthy subjects for this study were as follows: (1) All subjects belonged to the Han ethnic group and were native to Guangdong Province. There was no kinship among them. (2) No family history of CL/P. (3) No other congenital disease or malformation, or major organ disease.

Blood samples were stored at 4°C in tubes containing the anticoagulant EDTA-K2 (Huasheng Biotechnology Co., Ltd., Beijing). Genomic DNA was extracted from white blood cells using a kit for whole blood genomic DNA extraction (Xinjie Biotechnology Co., Ltd. Hangzhou), according to the manufacturer’s protocol.

2.3 Primer design and IRF gene amplification

A total of 26 pairs of primers (Table 1) for IRF gene sequencing were designed using Primer 5.0, according to the IRF6 gene sequence in GenBank (ACCESSION: NG_007081 VERSION: NG_007081.1, total 25218 bp). A 400–600-bp DNA region could be effectively sequenced using these primers. A total of seven primer pairs (Table 2) for sequencing the coding region were designed. The primers were synthesized by Shanghai Sangon Company.

Table 1

PCR primers for amplification of the IRF6 genomic sequence.

No.PrimersPrimer Positions (GenBank: NG 007081.1)Amplicon (bp)
1F: TCTGAAATCGACACATTGTC44–631304
R: GTGGGACGTACATTTCAGCA1328–1347
2F: CTTTGGGAGGAGTCCCAGTT1121–11401298
R: AGAGTCAAGTGCCCTGGATG2399–2418
3F: GGCCGCAGTTTTCTCATCTA2221–2240675
R: GCCTCTGTCTCAACCACTGC2876–2895
4F: CCTTACCCCTCATCCCTCTC2706–27251204
R: AGCTTTTTAGGCCCTGGTGT3890–3909
5F: TACTGTTGCACAGCCTCCTG3587–36061120
R: CTTCACGGCCCATAATTACC4687–4706
6F: GATGAGGAAACTGAAACCAAGA4450–44711280
R: TAGCTCCGCTCCTTGGAAAT5710–5729
7F: CGTCAACAACCCCACTATCC5463–54821298
R: TCTGCGCACCAAGAAAGTAA6741–6760
8F: GACACAGGCCTTACATTTTGC6524–65441280
R: CTGTGTTTGGGCCTGTCAGA7785–7804
9F: TCCTTCCATGGGGCTGAG7536–75531280
R: GAGCCAGCATGGGTAAAGTC8796–8815
10F: TTCCAGGCATCTCATTCTCA8591–86101289
R: TCCTCTTCTTGTTGAGGGCTA9859–9879
11F: GAAAGGTGGCTGGGAAGG9642–96591252
R: C AC AAC AAGCCTGTCCACCT10874–10893
12F: GTGATCCTCCCTCCTTAGCC10597–106161295
R: GCCAGGTCTTCCAGTTCTTG11872–11891
13F: ACCAAGCTCTGATGGCCTTT11615–116341295
R: GCCTAACTGCTGCTCCATTC12890–12909
14F: CTCTGGTTCTGGGTTGGTCT12630–12649602
R: CAACGTTTTCAAGCCAACAG13212–13231
15F: AAATGGAGGGACCCAAATCT12979–129981259
R: GCACCAAAGTGCAGATGGTA14218–14237
16F: CTGCACCCAGGTTACAATTC14033–140521194
R: AAATAGGAAAGCTGAGATCCCATA15203–15226
17F: GCCTCTTTTGGGATTCAAGA15044–150631325
R: GCTGGATTCAAAAGCCCATA16349–16368
18F: TTTTCGAGTTACGGGCAAAC16184–162031218
R: AAGGAGGCCCCAGTGTCTAT17382–17404
19F: CCATCAATGGTGGACTGGAT17157–171761236
R: TTGGCCCCAGCAATTAAATA18373–18392
20F: TTCTTTCCCTGGAGGAGTCA18204–182231213
R: TAACCTGGAGCTGGAGCTGT19397–19416
21F: CCCATGGTCCCTGTTACTTG1920–6192251215
R: CTCAGGACCTGGGAATTT20403–20420
22F: ACAGGGATGAACAGGCAGAG20188–202071239
R: CCATTCTTCCCCAAAGCATA21407–21426
23F: GCACATGTGGCTAGTGGCTA21244–212631358
R: ATCCTTGATGTCTGGGGTTG22582–22601
24F: CAGAATGGGGTCTTCCTCAG22376–223951392
R: TTCGTGCTAGGTGAGCCTTT23748–23767
25F: GGAGAGGGGGTAGGTGACAG23525–235441249
R: TTGGCACTTTTCCAATACCC24754–24773
26F: CCAGCTTCGAGGAAAAGTTG24474–24493729
R: CCAACACTTGCTCTCCCAGT25183–25202
Table 2

PCR primers for amplification of the IRF6 coding sequence.

ExonPrimers(5′→3′)Primer positions (GenBank: NG_007081.1Amplicon (bp)
3E3F: GAAAGGTGGCTGGGAAGG9642–96591252
E3R: CACAACAAGCCTGTCCACCT10874–10893
4E4F: CTGCACCCAGGTTACAATTC14033–140521194
E4R: AAATAGGAAAGCTGAGATCCCATA15203–15226
5E5F: GCCTCTTTTGGGATTCAAGA15044–150631325
E5R: GCTGGATTCAAAAGCCCATA16349–16368
6E6F: TTCTTTCCCTGGAGGAGTCA18204–182231213
E6R: TAACCTGGAGCTGGAGCTGT19397–19416
7E7F:ACAGGGATGAACAGGCAGAG20188–202071239
E7R: CCATTCTTCCCCAAAGCATA21407–21426
8E8F: GCACATGTGGCTAGTGGCTA21244–212631358
E8R: ATCCTTGATGTCTGGGGTTG22582–22601
9E9F: CAGAATGGGGTCTTCCTCAG22376–223951392
E9R: TTCGTGCTAGGTGAGCCTTT23748–23767

PCR amplification for genomic DNA and coding region sequencing were carried out in 20 μL and 25 μL volumes, respectively, containing 2 x LA Taq buffer (5 mM Mg2+), 2.5 mM dNTP mixture, 2.5 units of LA Taq DNA polymerase (TaKaRa), and 100 ng of genomic DNA. The final concentrations of each primer were 10 μM and 1 μM, respectively. The PCR was performed using a T GRADIENT PCR instrument (Biometra, Germany) with the following program: 94°C for 5 min; 35 cycles of 94°C for 40 s, 58–60°C for 50 s, and 72°C for 90 s; and a final extension at 72°C for 10 min.

2.4 IRF gene sequencing and analysis

The PCR product (3 μL) was taken and analyzed using 0.8% agarose gel electrophoresis on DYY-4C and DYCP-31DN systems (Beijing, China). The target products were purified with a GeneJETTM Gel Extraction Kit (Fermentas, Thermo Scientific, USA), according to the manufacturer’s protocol, and then used for sequencing on an ABI3730XL instrument (USA) at Shanghai Megiddo Biological Pharmaceutical Co., Ltd. The sequencing results were aligned with the IRF6 gene sequence in GenBank (ACCESSION: NG_007081 VERSION: NG_007081.1) (http://www.ncbi.nlm.nih.gov/nuccore/160358359). CExpress software was used to assemble the IRF6 gene sequences. On-line analysis software NetGene2 (http://www.cbs.dtu.dk/services/NetGene2/) and Splice View (http://zeus2.itb.cnr.it/~webgene/wwwspliceview_ex.html) were used to analyze the sequencing results, including splice site analysis and coding region prediction (initiation codon ATG, and termination codons TAG, TGA, and TAA).

2.5 Analysis of IRF6 SNP

Pertinent IRF6 SNP profiles were obtained through http://www.ncbi.nlm.nih.gov/SNP/. SNPs of the IRF gene sequences in healthy residents of Guangdong Province were analyzed using MegAlign software (DNAstar V7.10).

3 Results

3.1 Make-up of IRF6 genes

To determine the presence of sequence and SNP variation in the IRF6 gene in healthy residents of Guangdong Province, we sequenced the genomic and the coding regions of the IRF6 gene in white blood cell samples. The results showed that the IRF6 gene was 25016–25046 bp in length, with 45.21% GC content. The exons were 2215 bp in length, and the coding region was 1404 bp. There was a slight difference in the length of the introns due to the distinct length of the DNA repeat (Fig. 1).

Figure 1 Exons and introns in the IRF6 gene. Box and number in it: exon and its position in the mRNA coding sequence. Solid line: intron. Double vertical lines: the downstream DNA was not sequenced. DNA BD: DNA-binding domain. PBD: protein binding domain. SMIR: Smad interferon regulatory factor. This figure is drawn according to the information on the IRF6 gene in GenBank (Accession Numbers: HQ875393 and JF346417) and our sequencing results in this study.
Figure 1

Exons and introns in the IRF6 gene. Box and number in it: exon and its position in the mRNA coding sequence. Solid line: intron. Double vertical lines: the downstream DNA was not sequenced. DNA BD: DNA-binding domain. PBD: protein binding domain. SMIR: Smad interferon regulatory factor. This figure is drawn according to the information on the IRF6 gene in GenBank (Accession Numbers: HQ875393 and JF346417) and our sequencing results in this study.

3.2 SNP profiles of the IRF6 gene

We analyzed the SNPs of the IRF6 gene sequences in healthy residents of Guangdong Province by alignment using MegAlign software (DNAstar V7. 10). The results showed the presence of 65 SNPs in the IRF6 genes of 13 healthy residents of Guangdong Province (Table 3), including 18 SNPs in 5′UTR and 10, 14, 1, 5, 3, 2, and 5 SNPs in introns 1, 3, 4, 5, 6, 7, and 8, respectively. There were a total of 7 SNPs in exons 2, 5, 7, 9, and 10. Two SNPs were found in the coding region. All variable sites had two alleles, including 39 transition sites (22 C/T and 17 G/A) and 26 transversion sites (7 C/A, 6 G/T, 10 C/G, and 3 A/T).

Table 3

SNPs in healthy residents of Guangdong Province.

No.SNP IDPositionLocation (NG_007081)Allele
a1a2a1:a2
1rs20690685′UTRg. 467CA9/4
2rs111193475′UTRg. 564AG11/2
3rs111193465′UTRg. 580GA11/2
4rs7640935′UTRg. 1149TC10/3
5rs170152595′UTRg. 1557CT10/3
6rs170152555′UTRg. 1742TC11/2
7rs6327465′UTRg. 1965GA8/5
8rs66593675′UTRg. 2072CA11/2
9rs66968255′UTRg. 2108TC11/2
10rs6309845′UTRg. 2187CT8/5
11rs65405605′UTRg. 2407GA11/2
12rs65405595′UTRg. 2455CT8/5
13rs10052875′UTRg. 3723CT8/5
14rs23572295′UTRg. 3991CA8/5
15rs66595495′UTRg. 4325AC9/4
16rs8468085′UTRg. 4380AG12/1
17rs75455425′UTRg. 4845GA9/4
18rs75455385′UTRg. 4867GC11/2
19rs12403599Intron 1g. 5466CG7/6
20rs682461Intron 1g. 5527GC0/13
21rs17015250Intron 1g. 5703AC10/3
22rs3753517Intron 1g. 6382GA12/1
23rs3753518Intron 1g. 6516CG11/2
24rs12405750Intron 1g. 6636GA11/2
25rs861020Intron 1g. 7369TC3/10
26rs2073487Intron 1g. 7834AG11/2
27rs2073486Intron 1g. 8265CT7/6
28rs2235377Intron 1g. 9088AG11/2
29rs861019Exon 2g. 9094TC8/5
30rs622832Intron 3g. 10248TG3/10
31rs1160411Intron 3g. 10558AG6/7
32rs2294408Intron 3g. 10931CT7/6
33rs4844494Intron 3g. 12282CT12/1
34rs2005982Intron 3g. 12405CT4/9
35rs2236909Intron 3g. 12825TC7/6
36rs2236908Intron 3g. 12840CG7/6
37rs2236907Intron 3g. 12852GT7/6
38rs2236906Intron 3g. 12995AC7/6
39rs58637469Intron 3g. 13000AT11/2
40rs59043219Intron 3g. 13870CT7/6
41rs7555285Intron 3g. 14125CG4/9
42rs17015226Intron 3g. 14384GC10/2
43rs7552506Intron 3g. 14578CG11/2
44rs17015224Intron 4g. 15562CT12/1
45rs2013162Exon 5g. 15796GT7/6
46rs6685182Intron 5g. 16161TG2/11
47rs79307481Intron 5g. 16440GA10/3
48rs75594643Intron 5g. 16829TG10/3
49rs2179254Intron 5g. 17100AG3/10
50rs595918Intron 5g. 17637TC8/5
51rs17015217Intron 5g. 17851CT12/1
52rs2235375Intron 6g. 18893CG7/6
53rs926348Intron 6g. 19197CG12/1
54rs674433Intron 6g. 19605CA4/9
55rs2235371Exon 7g. 20400GA9/4
56rs2235373Intron 7g. 20677CT7/6
57rs2073485Intron 8g. 21686CT12/1
58rs7522250Intron 8g. 21941AG10/3
59rs614662Intron 8g. 22061GA8/5
60rs926346Intron 8g. 22276TA10/2
61rs742215Intron 8g. 23457AT12/1
62rs742214Exon 9g. 23555AG7/6
63rs599021Exon 9g. 23558TG6/7
64rs2235372Exon 9g. 24044CT7/6
65rs680331Exon 10g. 24608CT3/10

Five SNPs that had not been reported previously were observed. At site g. 3420 located in 5′ UTR, the sequence TACAAA was missing in eight samples, aligned to the GenBank sequence that was found in five samples (Fig. 2). At site g. 4475 located in 5′ UTR, the sequence CAAGAAGC was missing in four samples, aligned to the GenBank sequence that was found in nine samples (Fig. 3). At site g. 5527 located in intron 1, -76+339G>C (rs682461) was found in 13 samples aligned to the GenBank sequence (Fig. 4). At site g. 12951–12965 located in intron 3, the sequence TCAAACTTCTTCATT was replaced with C in one sample, aligned to the GenBank sequence with g. 12951–12965 TCAAACTTCTTCATT in seven samples (Fig. 5A and 5B). No clearly separated peak downstream of T was observed in five samples (Fig. 5C). At site g. 21778 located in intron 8, the sequence CACACA was missing in five samples, aligned to the GenBank sequence that was found in eight samples (Fig. 6).

Figure 2 SNP at site g. 3420 located in 5′UTR. (A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in five samples. (B) The sequence TACAAA is missing in eight samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.
Figure 2

SNP at site g. 3420 located in 5′UTR. (A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in five samples. (B) The sequence TACAAA is missing in eight samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.

Figure 3 SNP at site g. 4475 located in 5′UTR.(A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in nine samples. (B) The sequence CAAGAAGC is missing in four samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.
Figure 3

SNP at site g. 4475 located in 5′UTR.(A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in nine samples. (B) The sequence CAAGAAGC is missing in four samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.

Figure 4 SNP at site g. 5527 located in intron 1. Aligned with GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1), the mutation-76+339G>C (rs682461) was found in 13 samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.
Figure 4

SNP at site g. 5527 located in intron 1. Aligned with GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1), the mutation-76+339G>C (rs682461) was found in 13 samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.

Figure 5 SNP at site g. 12951–12965 located in intron 3.(A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) with g. 12951-12965 TCAAACTTCTTCATT in seven samples. (B) Replacement of TCAAACTTCTTCATT with C in one sample. (C) No clearly separated peak downstream of T was observed in five samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.
Figure 5

SNP at site g. 12951–12965 located in intron 3.(A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) with g. 12951-12965 TCAAACTTCTTCATT in seven samples. (B) Replacement of TCAAACTTCTTCATT with C in one sample. (C) No clearly separated peak downstream of T was observed in five samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.

Figure 6 SNP at site g. 21778 located in intron 8. (A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in eight samples. (B) The sequence CACACA is missing in five samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.
Figure 6

SNP at site g. 21778 located in intron 8. (A) The GenBank sequence (ACCESSION: NG_007081 VERSION: NG_007081.1) found in eight samples. (B) The sequence CACACA is missing in five samples. DNA samples were extracted from white blood cell samples. The target region in the IRF6 gene was amplified using PCR. The PCR product was sequenced by Shanghai Megiddo Biological Pharmaceutical Co., Ltd.

The results also showed the presence of two different sequences through the alignment of the SNPs in the IRF6 genes of 13 samples. Most of the SNP sites (48/65) were located in these two different sequences. They were submitted to GenBank with access numbers HQ875393 (25046 bp) and JF346417 (25016 bp).

3.3 SNP sites in the coding region of the IRF6 gene

Two mutation sites in the coding region were identified to be c. 459G>T and c. 820G>A. Transversion c. 459G>T occurred in the third nucleotide of the codon, e. g., TCG → TCT. Both TCG and TCT encode serine. Therefore, it is a silent mutation and there was no change in the protein sequence. However, the transition c. 820G>A occurred in the first nucleotide of the codon, e. g., GTC → ATC, leading to replacement of valine by an isoleucine (p. Val274Ile), which is a missense mutation.

4 Discussion

In order to study the association between IRF6 gene variation and CL/P in Guangdong Province, in the current study, we investigated the presence of sequence and SNP variation in IRF6 genes in healthy residents of Guangdong Province. We found that the IRF6 gene is 25016–25046 bp in length, with 45.21% GC content. The exons are 2215 bp in length, and the coding region is 1404 bp. There is a slight difference in the length of the introns due to the distinct length of the DNA repeat. The lengths of these sequences are consistent with the IRF6 sequence (ACCESSION: NG_007081 VERSION: NG_007081.1, 25218 bp) in GenBank. Moreover, our results are consistent with those reported by Ben et al. [12].

This is the first study to investigate the sequences and SNP characteristics of the IRF6 gene in healthy residents of Guangdong Province. A total of 65 SNPs were found in the IRF6 gene. Most of them (48/65) are located in two different sequences. Furthermore, there is a slight difference in the length of the introns due to the distinct length of the DNA repeat. It is likely that there are two distinct IRF6 gene sequences (HQ875393 and JF346417, GenBank) in healthy residents of Guangdong Province. Whether there is any difference in the sequences and functions of mRNA and protein from these two distinct IRF6 gene sequences, and whether they are associated with NSCL/P remain to be investigated.

Most SNPs found in the IRF6 gene sequences of healthy residents of Guangdong Province in the current study are NCBI dbSNP (http://www.ncbi.nlm.nih.gov/SNP/snpblastByChr.html). While some well-studied SNPs such as rs642961 were not found in the healthy residents of Guangdong Province in the current study, some of the identified 65 SNPs have been reported to be predictors for risk of NSCL/P. For example, rs2235371 (g. 20400) AA and GA have lower risks of NSCL/P than GG genotypes in Asians, but these genotypes are not significant in Caucasians. rs2013162 (g. 15796) AA carriers have a significantly lower risk than CC carriers for Caucasians but not for Asians [9]. The IRF6 rs2235371 G/A haplotype increases the risk for oral cleft in a Brazilian population [13]. Genetic polymorphisms rs7545538 (g. 4867), rs2235377 (g. 9088), rs2235371 (g. 20400), and rs2235373 (g. 20677) of the IRF6 gene are associated with an increased risk of nsCLP in a Xinjiang Uyghur population [14]. In addition to the known SNPs, five SNPs in healthy residents of Guangdong Province, at sites g. 3420, g. 4475, g. 5527, g. 12951–12965, and g. 21778, have not been reported previously. These SNPs are located in 5′ UTR (2 SNPs) and introns 1, 3, and 8, respectively (Fig. 26). All variable sites have two alleles. Whether these newly found SNPs are related to CL/P remains to be investigated in a future study.

There are two SNPs in the coding region of the IRF6 gene in healthy residents of Guangdong Province: c. 459G>T, a silent mutation, and c. 820G>A. These mutations are different from the mutations c. 145C>T (p. Q49X), c. 171T>G (p. F57L), and 1306C>G (p. L436V), which were found in a Thai family with VWS [15]; the mutations c. 251G>T (R84L) and c. 1271C>T (S424L), which were discovered in a Japanese family with PPS [16]; and the frameshift mutations c. 568delG, c. 295G>A, and c. 1219T>C, which were observed in a Pakistani family with VWS [17]. However, the c. 820G>A mutation found in the residents of Guangdong Province results in the replacement of a valine residue with an isoleucine residue. This site is located in the SMIR domain and is commonly seen in 3% of healthy European and 22% of healthy Asian populations [1]. In addition, this mutation has been shown to be associated with NSCL/P in a western Chinese population [18, 19]. We speculate that this missense mutation is probably related to nonsyndromic CL/P in Guangdong Province.

A limitation of this study is the sample size, which may cause a biased result for the population of Guangdong Province. The precise frequency of genotypes, haplotypes, and association with nonsyndromic CL/P should be further investigated by including more samples.

5 Conclusion

The sequences and SNP profiles of the IRF6 gene in healthy residents of Guangdong Province are consistent with the one in GenBank but with slight variations. Our findings form a basis for further investigation of the association between IRF6 gene polymorphisms and CL/P in residents of Guangdong Province.


These authors contributed equally to this work.


List of abbreviations

SNP

single nucleotide polymorphism

UTR

5′-untranslated region (UTR)

IRF6

The interferon regulatory factor 6

SMIR

Smad interferon regulatory

  1. Conflict of interest statement: Authors declare no conflicts of interest.

  2. Author contributions: Hongtao Wang designed the study, conducted all searches, appraised all potential studies, and wrote and revised the draft manuscript and subsequent manuscripts. Wenli Wu and Liang Hua revised the draft manuscript and subsequent manuscripts. Jiansuo Hao and Yiyang Chen assisted with the presentation of findings and assisted with drafting and revising the manuscript. Fan Li and Jiayu Liu conceived and designed the study, assisted with searches, appraised relevant studies, and assisted with drafting and revising the manuscript. All authors read and approved the final manuscript.

References

[1] Kondo S., Schutte B.C., Richardson R.J., Bjork B.C., Knight A.S., Watanabe Y., et al., Mutations in IRF6 cause Van der Woude and popliteal pterygium syndromes, Nat. Genet., 2002, 32, 285-910.1038/ng985Search in Google Scholar PubMed PubMed Central

[2] Zucchero T.M., Cooper M.E., Maher B.S., Daack-Hirsch S., Nepomuceno B., Ribeiro L., et al., Interferon regulatory factor 6 (IRF6) gene variants and the risk of isolated cleft lip or palate, N. Engl. J. Med., 2004, 351, 769-8010.1056/NEJMoa032909Search in Google Scholar PubMed

[3] Bailey C.M., Hendrix M.J., IRF6 in development and disease: a mediator of quiescence and differentiation, Cell Cycle, 2008, 7, 1925-3010.4161/cc.7.13.6221Search in Google Scholar PubMed PubMed Central

[4] Knight A.S., Schutte B.C., Jiang R., Dixon M.J., Developmental expression analysis of the mouse and chick orthologues of IRF6: the gene mutated in Van der Woude syndrome, Dev. Dyn., 2006, 235, 1441-710.1002/dvdy.20598Search in Google Scholar PubMed

[5] Rahimov F., Marazita M.L., Visel A., Cooper M.E., Hitchler M.J., Rubini M., et al., Disruption of an AP-2alpha binding site in an IRF6 enhancer is associated with cleft lip, Nat. Genet., 2008, 40, 1341-710.1038/ng.242Search in Google Scholar PubMed PubMed Central

[6] Pan Y., Ma J., Zhang W., Du Y., Niu Y., Wang M., et al., IRF6 polymorphisms are associated with nonsyndromic orofacial clefts in a Chinese Han population, Am. J. Med. Genet. A, 2010, 152A, 2505-1110.1002/ajmg.a.33624Search in Google Scholar PubMed

[7] Paranaiba L.M., Bufalino A., Martelli-Junior H., de Barros L.M., Graner E., Coletta R.D., Lack of association between IRF6 polymorphisms (rs2235371 and rs642961) and non-syndromic cleft lip and/or palate in a Brazilian population, Oral Dis., 2010, 16, 193-710.1111/j.1601-0825.2009.01627.xSearch in Google Scholar PubMed

[8] Tian J., Wei Y., Wang L., Guo J., Zhou Q., Zhu W., [Relationship between genetic polymorphisms of IRF6 rs642961 and nonsysdromic cleft lip with or without cleft palate], Wei Sheng Yan Jiu, 2015, 44, 543-8 (in Chinese)Search in Google Scholar

[9] Wattanawong K., Rattanasiri S., McEvoy M., Attia J., Thakkinstian A., Association between IRF6 and 8q24 polymorphisms and nonsyndromic cleft lip with or without cleft palate: Systematic review and meta-analysis, Birth Defects Res. A Clin. Mol. Teratol., 2016, 106 (9), 773-8810.1002/bdra.23540Search in Google Scholar PubMed PubMed Central

[10] Kerameddin S., Namipashaki A., Ebrahimi S., Ansari-Pour N., IRF6 Is a Marker of Severity in Nonsyndromic Cleft Lip/Palate, J. Dent. Res., 2015, 94, 226S-32S10.1177/0022034515581013Search in Google Scholar PubMed

[11] Nouri N., Memarzadeh M., Carinci F., Cura F., Scapoli L., Jafary F., et al., Family-based association analysis between nonsyndromic cleft lip with or without cleft palate and IRF6 polymorphism in an Iranian population, Clin. Oral. Investig., 2015, 19, 891-410.1007/s00784-014-1305-3Search in Google Scholar PubMed

[12] Ben J., Jabs E.W., Chong S.S., Genomic, cDNA and embryonic expression analysis of zebrafish IRF6, the gene mutated in the human oral clefting disorders Van der Woude and popliteal pterygium syndromes, Gene Expr. Patterns, 2005, 5, 629-3810.1016/j.modgep.2005.03.002Search in Google Scholar PubMed

[13] de Souza L.T., Kowalski T.W., Ferrari J., Monlleo I.L., Ribeiro E.M., de Souza J., et al., Study of IRF6 and 8q24 region in non-syndromic oral clefts in the Brazilian population, Oral Dis., 2016, 22, 241-510.1111/odi.12432Search in Google Scholar PubMed

[14] Mijiti A., Ling W., Guli, Moming A., Association of single-nucleotide polymorphisms in the IRF6 gene with non-syndromic cleft lip with or without cleft palate in the Xinjiang Uyghur population, Br. J. Oral Maxillofac. Surg., 2015, 53, 268-7410.1016/j.bjoms.2014.12.008Search in Google Scholar PubMed

[15] Yeetong P., Mahatumarat C., Siriwan P., Rojvachiranonda N., Suphapeetiporn K., Shotelersuk V., Three novel mutations of the IRF6 gene with one associated with an unusual feature in Van der Woude syndrome, Am. J. Med. Genet. A, 2009, 149A, 2489-9210.1002/ajmg.a.33048Search in Google Scholar PubMed

[16] Matsuzawa N., Kondo S., Shimozato K., Nagao T., Nakano M., Tsuda M., et al., Two missense mutations of the IRF6 gene in two Japanese families with popliteal pterygium syndrome, Am. J. Med. Genet. A, 2010, 152A, 2262-710.1002/ajmg.a.33338Search in Google Scholar PubMed

[17] Malik S., Kakar N., Hasnain S., Ahmad J., Wilcox E.R., Naz S., Epidemiology of Van der Woude syndrome from mutational analyses in affected patients from Pakistan, Clin. Genet., 2010, 78, 247-5610.1111/j.1399-0004.2010.01375.xSearch in Google Scholar PubMed

[18] Tang W., Du X., Feng F., Long J., Lin Y., Li P., et al., Association analysis between the IRF6 G820A polymorphism and nonsyndromic cleft lip and/or cleft palate in a Chinese population, Cleft Palate Craniofac. J., 2009, 46, 89-9210.1597/07-131.1Search in Google Scholar PubMed

[19] Huang Y., Wu J., Ma J., Beaty T.H., Sull J.W., Zhu L., et al., Association between IRF6 SNPs and oral clefts in West China, J. Dent. Res,. 2009, 88, 715-810.1177/0022034509341040Search in Google Scholar PubMed PubMed Central

Received: 2016-1-8
Accepted: 2016-10-6
Published Online: 2016-12-16
Published in Print: 2016-1-1

© 2016 Wenli Wu et al.

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Downloaded on 19.5.2024 from https://www.degruyter.com/document/doi/10.1515/biol-2016-0062/html
Scroll to top button