Single molecule real-time sequencing data sets of Hypericum perforatum L. cell suspension and shoot cultures
Here, we report the establishment of a robust reference library of H. perforatum using single molecule real-time sequencing (SMRT) for the first time. Transcripts with an average size of 2 kb were obtained from high-quality RNA extracted from cell suspension cultures and plantlets. Sequencing data from cell suspension cultures yielded more than 33,000 high-quality transcripts from 20 Gb of raw data, while more than 55,000 high-quality transcripts were obtained from 35 Gb of raw data from plantlets. Alternative splice and repeat sequences, including transposon elements and simple repeats, were identified. Comparative expression analysis of Illumina-based transcripts from cell suspension cultures and plants by mapping them on the generated library and quantitative real time PCR analysis of genes predicted to participate in hypericin biosynthesis validated the PacBio library.