The transcriptome data from the leaves of four Papaver species captured at the plant's three developmental life cycles

The plants in the Papaver genus are widely known as Poppies, which is used for ornamental and medicinal purposes, to utilize its plants derived alkaloids and attractive flowers. From this genus, we have sequenced the transcriptomes of four species's (Papaver rhoeas (two cultivar), Papaver nudicaule (five cultivar), Papaver fauriei, and Papaver somniferum) leaves at three developmental stages (i.e., leaf rosette (30 days), elongation and branching (60 days), and blossom and seed formations (90 days)), to elucidate the secondary metabolite biosynthesis gene expression profiles at respective plant stages.


Data
The dataset present in this article is a transcriptome from the leaves of four Papaver species and its subspecies classified upon their flower colour, as shown in Fig. 1. The tables in this article are as follows: Table 1 explains the sampling time points of Papaver plant from its three different growth stages, and Table 2 explains the quality of the transcriptome data and the sequences mapped to the draft genome and the reference transcriptome. Totally, 590 Gb of transcriptome sequences are generated from 84 sequence libraries (i.e., 28 sampling points with three biological replicates) using Illumina Hi-Seq 4000 equipment and 481 Mb of long reads from 2 libraries using PacBio, iso-seq method. Among those, the short reads, 568.4 GB (96.2%) of bases remained after the pre-processing, as explained in the previous articles [1,2]. Complete reference transcriptome has been employed for the de-novo transcriptome assemblies, as explained in the previous articles [1,2]. Further, the pre-processed reads are mapped to the transcript references, which were obtained from the de-novo assemblies [1,2] and Papaver somniferum draft genome [3]. The coverage of sequence transcriptome is 77X per sample, which was calculated with the reference of transcripts obtained from the draft genome of Papaver somniferum. Part of this transcriptomic data was assessed to catalogue the available secondary metabolite biosynthesis transcripts and the cytochrome multi-family transcripts to the KEGG and cytochrome P450 engineering database (CYPED) [1,2]. Moreover, the differential expression profiles of those transcripts were assessed into two data models, i.e., between the stages of the developmental life cycle and between the Papaver species systematically [1,2]. Moreover, as the genome sequence has been utilized to explain the evolutionary history of morphine pathway [4], and to elucidate their core functions that exist in Papaver plant which can adapt to the whole plant community, as it is selfincompatibility to various environments [5]; hence, this data set could be valuable to assess the genetics behind the Papaver plant functions.
Specifications Table   Subject Biology Specific subject area Transcriptomics Type of data

Value of the Data
This transcriptome data can be useful to elucidate the transcriptome-wide association SNP markers and to assess the differences in the quantity of secondary metabolites, among and within Papaver species and subspecies. The phenotypic data (Flower colour, petal arrangements, number of petals) can be useful to identify the associated SNP markers for more detailed characterizations. The iso-seq data from two samples may help to improve the existing gene annotation of the representative Papaver somniferum genome.

Plant samples
Five Papaver nudicaule varieties with different colours of flowers, i.e., white, yellow, pink, orange, and scarlet have grown individually in multiple pots and maintained at 30 C for 3 months. For the mRNA sequencing, leaf samples were obtained from three developmental stages (i.e., 30, 60, and 90 days). Another four Papaver species (i.e., P. rhoeas, P. nudicaule, P. somniferum, and P. fauriei) have been sampled with a similar procedure that belongs to this project [1,2]. The samples collected for transcriptomic analysis was immediately frozen in the liquid nitrogen and stored in a deep freezer at À70 C. For each species, the experiments were repeated in triplicates (under the same conditions). Phenotypic differences among these plants, i.e., flower colour, leaves, and the visual appearance of the plant with flowers, are shown in Fig. 1.

Transcriptome sequencing
The complete sequence library preparation and sequencing experiments for the Illumina protocols were conducted by Macrogen Inc. (Seoul, Korea) (http://www.macrogen.com), the authorized sequence service providers for every individual sample. Illumina Hi-Seq 4000 system has been used to sequence all the individual samples. The details on the RNA library construction was given in the   published articles [1,2]. Total raw Illumina short reads from each sample underwent the pre-processing steps, in order to remove the adapter, and low-quality reads using Trimmomatic v0.36 [6]. The processed short reads were then mapped to the assembled transcriptome using Salmon v0.9.1 [7].

Dataset
The complete sequences generated in this article have been submitted to the GenBank sequence read archive (SRA) under the bio-project ID PRJNA476004, as given in Table 2.