Abstract
Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. Ginseng is typically classified according to its cultivation environment, such as field-cultivated ginseng (FCG) and mountain-cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between FCG and MCG. In order to investigate the whole transcriptome landscape of ginseng, we employed high-throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of transcriptome data from ginseng roots. Approximately 77 and 87 million high-quality reads were produced via FCG and MCG root transcriptome analysis, respectively, and we obtained 256,032 assembled unigenes with an average length of 1171 bp by de novo assembly methods. Functional annotation of the unigenes was performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.
Similar content being viewed by others
Abbreviations
- P. ginseng :
-
Panax ginseng C.A. Meyer
- FCG:
-
Field-cultivated ginseng
- MCG:
-
Mountain-cultivated ginseng
- NGS:
-
Next-generation sequencing
- EST:
-
Expressed sequence tag
- CDS:
-
Coding sequence
- GO:
-
Gene Ontology
- DEG:
-
Differentially expressed genes
- KEGG:
-
Kyoto Encyclopedia of Genes and Genomes
References
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Alvord G, Roayaei J, Stephens R, Baseler MW, Lane HC, Lempicki RA (2007) The DAVID gene functional classification tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol 8:183
Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11:106
Birol I et al (2009) De novo transcriptome assembly with ABySS. Bioinformatics 25:2872–2877. doi:10.1093/bioinformatics/btp367
Blanco E, Parra G, Guigó R (2007) Using geneid to identify genes. Curr Protoc Bioinform. doi:10.1002/0471250953.bi0403s18
Choi D-W, Jung J, Im Ha Y, Park H-W, In DS, Chung H-J, Liu JR (2005) Analysis of transcripts in methyl jasmonate-treated ginseng hairy roots to identify genes involved in the biosynthesis of ginsenosides and other secondary metabolites. Plant Cell Rep 23:557–566
Choi Y-E, Kim Y-S, Yi M-J, Park W-G, Yi J-S, Chun S-R, Han S-S, Lee S-J (2007) Physiogical and chemical characteristics of field- and mountain-cultivated ginseng roots. J Plant Biol 50:198–205
Coruzzi G, Bush DR (2001) Nitrogen and carbon nutrient and metabolite signaling in plants. Plant Physiol 125:61–64
Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA (2003) DAVID: database for annotation, visualization, and integrated discovery. Genome Biol 4:P3
Götz S et al (2008) High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res 36:3420–3435
Grabherr MG et al (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652
Gutiérrez RA, Lejay LV, Dean A, Chiaromonte F, Shasha DE, Coruzzi GM (2007) Qualitative network models and genome-wide expression data define carbon/nitrogen-responsive molecular machines in Arabidopsis. Genome Biol 8:R7
Haas BJ et al (2013) De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat Protoc 8:1494–1512
Huang X, Madan A (1999) CAP3: a DNA sequence assembly program. Genome Res 9:868–877
Jung C-H, Seog H-M, Choi I-W, Cho H-Y (2005) Antioxidant activities of cultivated and wild Korean ginseng leaves. Food Chem 92:535–540
Kadota K, Nishiyama T, Shimizu K (2012) A normalization strategy for comparing tag count data. Algorithms Mol Biol 7:5. doi:10.1186/1748-7188-7-5
Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30
Kim J-Y, Mahé A, Brangeon J, Prioul J-L (2000) A maize vacuolar invertase, IVR2, is induced by water stress organ/tissue specificity diurnal modulation of expression. Plant Physiol 124:71–84
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12:323. doi:10.1186/1471-2105-12-323
Martin JA, Wang Z (2011) Next-generation transcriptome assembly. Nat Rev Genetics 12:671–682
Nam KY (2002) Clinical applications and efficacy of Korean ginseng (Panax ginseng C.A. Meyer. J Ginseng Res 26:111–131
O’Hara M, Kiefer D, Farrell K, Kemper K (1998) A review of 12 commonly used medicinal herbs. Arch Family Med 7:523–536
Pertea G et al (2003) TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19:651–652
Sun J, Nishiyama T, Shimizu K, Kadota K (2013) TCC: an R package for comparing tag count data with robust normalization strategies. BMC Bioinformatics 14:219. doi:10.1186/1471-2105-14-219
Takahashi H et al (2009) Pleiotropic modulation of carbon and nitrogen metabolism in Arabidopsis plants overexpressing the NAD kinase2 gene. Plant Physiol 151:100–113
Trapnell C et al (2010) Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 28:511–515. doi:10.1038/nbt.1621
Wang J-L, Turgeon R, Carr JP, Berry JO (1993) Carbon sink-to-source transition is coordinated with establishment of cell-specific gene expression in a C4 plant. Plant Cell 5:289–296
Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10:57–63
Yang Y, Smith SA (2013) Optimizing de novo assembly of short-read RNA-seq data for phylogenomics. BMC Genom 14:328. doi:10.1186/1471-2164-14-328
Yun TK, Lee YS, Lee YH, Kim SI, Yun HY (2001) Anticarcinogenic effect of Panax ginseng C.A. Meyer and identification of active compounds. J Korean Med Sci 16:6–18
Acknowledgements
We would like to thank Dr. Woo Jun Sul of the Department of Systems Biotechnology at Chung-Ang University for their constructive comments on the initial manuscript. We would also like to thank Dr. Junsu Ko of Theragen BiO Institute at TheragenEtex for RNA-SEq. library construction.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Yang, B.W., Hahm, Y.T. Transcriptome analysis using de novo RNA-seq to compare ginseng roots cultivated in different environments. Plant Growth Regul 84, 149–157 (2018). https://doi.org/10.1007/s10725-017-0328-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10725-017-0328-6