HEpD: A database describing epigenetic differences between Thoroughbred and Jeju horses
Introduction
DNA methylation at the cytosine of CpG dinucleotides is a biochemical modification that regulates gene expression in eukaryotic genomes. It is involved in many biological processes like genomic imprinting, development, and cell differentiation by epigenetic modification. Abnormal DNA methylation has been implicated in diseases including cancer (Bird, 2002, Robertson, 2005, Klose and Bird, 2006, Illingworth et al., 2008). With the advent of next generation sequencing (NGS) technologies, genome-wide DNA methylation data is now publicly available in many human disease samples. The ENCODE project provided researchers with the function of each human and mouse genomic region. In other species including economic animals, raw genome-wide DNA methylation data has been provided (Li et al., 2011, Sati et al., 2012), but these data are difficult to analyze and information is scattered across a wide number of sources. In order to overcome this deficiency, we have developed a DNA methylation database for Thoroughbred and Jeju horse cerebrum, lung, heart, and skeletal muscle tissues. The Horse Epigenetic Database (HEpD) provides the methylation peaks of each genomic region, and differentially methylated regions (DMRs) in four tissues of the two horse breeds.
Many bioinformatics tools provide methylation patterns for each genomic region in humans. Abnormal methylation patterns were confirmed by many databases based on the results of whole-genome methylation analysis (He et al., 2008, Hackenberg et al., 2011, Lv et al., 2012). Tissue-specific methylation patterns were also provided in humans (Xin et al., 2012, James et al., 2013). However, these databases concentrated solely on the human genome. Although genome-wide DNA methylation data in organisms, besides humans and plants, have been made accessible to the public, there have been few freely available databases until now.
Thoroughbred horses are intentionally selected for speed, stamina, and agility. Therefore, Thoroughbred horses have many specific genetic characteristics related to horse economic traits, like genotype, SNPs, and transcripts (McGivney et al., 2009, Hill et al., 2010a, Hill et al., 2010b, Park et al., 2012). Jeju horses are the descendants of 160 Mongolian horses that have inhabited and been bred since 1276 on Jeju Island, South Korea (Nam, 1969). Compared to the Thoroughbred, the Jeju horse is hardy, small, and often used for racing or as a food source. Analysis of the Jeju horse is valuable for identifying Jeju-specific traits and conservation of the Korean genetic resource. This study is to provide information about DMRs between four tissues as well as between two different horse breeds.
Transposable elements (TEs) make up approximately 30–50% of the mammalian genome. The expression of TEs induces genome instability, thus the host genome maintains DNA methylation in order to inhibit TE expression (Carnell and Goodman, 2003, Girardot et al., 2006). Variation of DNA methylation state in TE regions could induce a change in the genome defense system. Comparing DMRs between the TEs could provide interesting research topics. This database provides TE location and insertion direction along with their DNA methylation state.
Providing genomic information and constructing a web-based database of DNA methylation patterns is generally useful for performing further functional studies as well as for elucidating tissue-specific or breed-specific traits. This database provides the relatively highly-methylated regions in the whole genome, and an additional selection option helps to compare the methylation of two equine breeds in four tissues. We also provide the ‘Gene index’ option for easily comparing gene regions between tissues of the two equine breeds. In order to enhance the further studies of database users, we have linked our results pages to the related UCSC Genome Browser and Ensembl database for each genomic region. Results are presented as a web page or in Microsoft Excel format. The HEpD will provide insight for the functional study of methylation and potential roles for finding tissue-specific and breed-specific methylation patterns.
Section snippets
Sample information
We performed methylated DNA immunoprecipitation followed by next-generation sequencing (MeDIP-Seq) in four tissues (cerebrum, lung, heart, and skeletal muscle) derived from genomic DNA from each individual of Thoroughbred and Jeju horses (Lee et al., 2014) (Table S1). All animal protocols (ethical procedures and scientific care) used in this study were reviewed and approved by the Pusan National University-Institutional Animal Care and Use Committee (PNU-IACUC; Approval Number PNU-2013-0411).
Data source generation of datasets
A
Results and discussion
In MeDIP-Seq analysis, a total of 21 to 24 million reads from eight samples were sequenced, and these data size was produced as 1.08 to 1.2 Gb (Table S2). After filtration, high-quality reads were mapped and aligned against the reference genome (Table S3). In order to confirm the DNA methylation pattern, analysis of methylated peaks in MeDIP-Seq is essential. Therefore, HEpD provides the methylation peaks from primary MeDIP-Seq data formatted as *.csv files. Each dataset was converted to a MySQL
Conclusion
HEpD provides methylation data for four tissues from two horse breeds. This system helps one to select the highly-methylated genomic regions beyond a user-set threshold, presents the methylation state of a specific genomic region, compares the methylation state between two breeds, and uses a gene index for comparing the methylation state in a gene region. Users can see the detailed methylation state in tissues of two horse breeds displayed on the results page via four methods. Our work could
Acknowledgment
This work was supported by a grant from the Next Generation BioGreen 21 Program (No. PJ0081062011), Rural Development Administration, Republic of Korea.
References (26)
- et al.
Acute exercise remodels promoter methylation in human skeletal muscle
Cell Metab.
(2012) - et al.
Genomic DNA methylation: the mark and its mediators
Trends Biochem. Sci.
(2006) DNA methylation patterns and epigenetic memory
Genes Dev.
(2002)- et al.
The long (LINEs) and the short (SINEs) of it: altered methylation as a precursor to toxicity
Toxicol. Sci.
(2003) - et al.
NGSmethDB: an updated genome resource for high quality, single-cytosine resolution methylomes
Nucleic Acids Res.
(2014) - et al.
The insertion of a full-length Bos taurus LINE element is responsible for a transcriptional deregulation of the Normande Agouti gene
Pigment Cell Res.
(2006) - et al.
The effect of age on thoroughbred racing performance
J. Equine Sci.
(2010) - et al.
NGSmethDB: a database for next-generation sequencing single-cytosine-resolution DNA methylation data
Nucleic Acids Res.
(2011) - et al.
MethyCancer: the database of human DNA methylation and cancer
Nucleic Acids Res.
(2008) - et al.
A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses
PLoS One
(2010)
Targets of selection in the Thoroughbred genome contain exercise-relevant gene SNPs associated with elite racecourse performance
Anim. Genet.
A novel CpG island set identifies tissue-specific methylation at developmental gene loci
PLoS Biol.
MethFinder — a software package for prediction of human tissue-specific methylation status of CpG islands
Bioinformation
Cited by (0)
- 1
These authors contributed equally to this work.