Data for identification of porcine X-chromosome inactivation center, XIC, by genomic comparison with human and mouse XIC

The data included in this article shows homologies of genes in porcine X-chromosome inactivation center, XIC, to each orthologue in human and mouse XIC. Open sequences of XIC-linked genes in human and mouse were compared to porcine genome and sequence homology of each orthologue to porcine genome was calculated. Sequence information of porcine genes encoded in the genomic regions having sequence homology with the human XIC-linked genes and their 2 Kb upstream regions were downloaded. Obtained information was used to design primer pairs for expression and methylation pattern analyses of XIC-linked genes in pigs. The data represented in here is related and applied to the research article entitled “Dosage compensation of X-chromosome inactivation center, XIC,-linked genes in porcine preimplantation embryos: Non-chromosome wide initiation of X-chromosome inactivation in blastocysts”, published in Mechanisms of Development Hwang et al., 2015 [1].


a b s t r a c t
The data included in this article shows homologies of genes in porcine X-chromosome inactivation center, XIC, to each orthologue in human and mouse XIC. Open sequences of XIC-linked genes in human and mouse were compared to porcine genome and sequence homology of each orthologue to porcine genome was calculated. Sequence information of porcine genes encoded in the genomic regions having sequence homology with the human XIClinked genes and their 2 Kb upstream regions were downloaded. Obtained information was used to design primer pairs for expression and methylation pattern analyses of XIC-linked genes in pigs. The data represented in here is related and applied to the research article entitled "Dosage compensation of X-chromosome inactivation center, XIC,-linked genes in porcine preimplantation embryos: Non-chromosome wide initiation of X-chromosome inactivation in blastocysts", published in Mechanisms of Development

Value of the data
The data is an example for representing severe sequence diversity of non-coding RNA genes in Xchromosome inactivation center, XIC, in eutherians.
Methods for calculating sequence homology and searching homolog region could provide proper strategies for identifying XIC in eutherians.
Current data could be used for comparative genomic analyses dealing evolution of XIC in eutherians.
Designed primer pairs could be efficiently used to analyze expression pattern and methylation pattern of XIC-linked genes and their CpG sites, respectively, in pigs.

Data
Data shared in the article is sequence homologies of human and mouse X-chromosome inactivation center, XIC, -linked genes to porcine genome and primer pairs for amplifying XIC-linked gene in pigs. And the article contains information such as distribution of CpG sites of four XIC-linked genes and primer pairs examining methylation pattern of the genomic regions in pigs.

Sequence homology of human and mouse XIC-linked genes to pig genome
Sequence homology of each gene encoded in human and mouse XIC to porcine genome was examined as follows. Genomic and transcript sequences of the human and mouse genes encoded in genomic region ranging between CDX4 and RLIM, which were determined to be the XIC synteny in this study, were downloaded from GenBank and aligned to the pig genome using BLAST. The coverage of individual blast hits (A) was calculated first as follows: The sequence homology rate of each blast hit (B) from alignment of one query was calculated by magnifying the sequence identity between blast hits and their searched counterpart region in pig: Homology rate of individual blast hit B ð Þ ¼ Qurey coverage of blast hit A ð Þ Â Identity After calculating the homology rate of each blast hit (B) from the alignment of one query, their sum was considered to be the homology rate of the query (C), which is a partial fragment of the genomic sequence or transcript sequence of human and mouse XIC-linked genes.

Homology rate of query C ð Þ ¼ Sum of Homology rate of individual blast hit B ð Þ
The homology rate of XIC-linked genes (D) of which partial fragment sequences were used for query, was calculated by averaging the homology rate of each query (C): Homologyrateofgene D ð Þ ¼ AverageofHomologyrateofquery C ð Þ Sequence homologies of human and mouse XIC-linked genes to porcine genome were calculated according to equations (Figs. 1 and 2, and Supplementary materials 1-4).

Primer pairs detecting porcine XIC-linked genes
Primer pairs for detecting and analyzing expression of the porcine XIC-linked genes were designed by using Primer 3 plus web-service [2]. Primer pairs for RT-PCR (Table 1) and Realtime PCR ( Table 2) was prepared one another. The lengths of RT-PCR products ranged approximately from 450 bp to 700 bp, except ACTB (131 bp), which were used for reference genes, and uncharacterized gene, LOC102166613 (115 bp). Primer pairs for Realtime PCR were designed to amplify PCR product ranging  in length between 90 bp and 183 bp. Designed primer pairs were validated by sequencing amplicons and used for expression analyses in the related study [1]. The primer pairs will be used for the expression analyses of porcine XIC-linked genes.

Distribution of CpG sites of XIC-linked genes and primer pairs for bisulfite sequencing
Distribution of CG dinucleotide were searched manually around the transcription starting sites, TSSs, of four XIC-linked genes, CHIC1, XIST, uncharacterized gene LOC102165544 and RLIM (Fig. 3) to design primer pairs for examining promoter methylation patterns of the genes. Primer pairs which could amplify the genomic region containing CG dinucleotide were prepared ( Table 3). The length of PCR product amplified by primer pairs targeting CpG sites of CHIC1, XIST, LOC102165544 and RLIM were 360 bp, 429 bp, 290 bp and 348 bp, respectively. And also, the numbers of CG dinucleotide in the four amplicon were 17, 13, 16, and 31, respectively. Primer pairs amplifying CpG sites of XIST gene was one of the primer set designed previously [3] and primers targeting genomic sites of remaining three genes were designed newly in here. Efficient working of the primer pairs was validated in the related study [1].