Dataset from de novo transcriptome assembly of Nephelium lappaceum aril

Nephelium lappaceum (Rambutan), is one of tropical fruit in which - cultivated widely in Indonesia and has good taste and aroma. However, the transcriptomic study of rambutan has limited. In this study, we performed transcriptome assembly using paired-end Illumina technology. The assembled transcriptome was constructed using Trinity and after filtering and removal sequences redundancy produced 36,303 contigs. The contig ranged 201–11,770 bp and N50 has 1327 bp. The contig was annotated with several databases such as SwissProt, TrEMBL, and nr/nt of NCBI databases. The raw reads are deposited in the DDBJ with DRA accession number, DRA007359: https://www.ncbi.nlm.nih.gov/sra/?term=DRA007359. The assembled contigs of transcriptome are deposited in the DDBJ TSA repository with accession number IADQ01000001–IADQ01036303: ftp://ftp.ddbj.nig.ac.jp/ddbj_database/tsa/IADQ.gz and also can be accessed at http://rujakbase.id.


a b s t r a c t
Nephelium lappaceum (Rambutan), is one of tropical fruit in whichcultivated widely in Indonesia and has good taste and aroma. However, the transcriptomic study of rambutan has limited. In this study, we performed transcriptome assembly using paired-end Illumina technology. The assembled transcriptome was constructed using Trinity and after filtering and removal sequences redundancy produced 36,303 contigs. The contig ranged 201-11,770 bp and N50 has 1327 bp. The contig was annotated with several databases such as SwissProt, TrEMBL, and nr/nt of NCBI databases.

Value of the data
This data provides transcriptome for the first time of Nephelium Lappaceum from aril fruits. This data will be useful to obtain molecular markers such as microsatellite and single nucleotide polymorphisms for breeding and selection of new cultivars in Nephelium Lappaceum and relatedgenus.
This data will further be valuable for gene expression analysis using treatments among their species and related-genus.

Data
Nephelium lappaceum L. (rambutan) is originated from Indonesia and Malay Peninsula [1]. In this study, a de novo transcriptome assembly of Nephelium lappaceum has been reported. The transcriptome data were obtained from the aril part of the fruit. The aril tissue was collected, and the high quality of RNA was extracted for paired-end sequencing technology of Illumina. The high quality of reads was obtained, and de novo assembly was performed using Trinity v.2.4.0 [2]. All statistics of reads and assembled sequence were determined ( Table 1). The contigs were reconstructed using CAP3 [3] and CD-HIT-EST v.4.6.8 [4] to remove redundant contigs and then the contigs were filtering and clustering using Corset v.1.06 [5]. The contigs were annotated with several databases using the BLASTþ v.2.7.1 program [6]. An overview of the sequencing assembly of Nephelium lappaceum transcriptome data is presented in Table 2.

Experimental design, materials, and methods
Rambutan var. Binjai were collected from Mekarsari Fruit Garden at ripening stage. The flesh aril was used for RNA extraction. The total RNA was extracted using ISOLATE RNA (Bioline) following the protocol. The quality and quantity of DNA were checked by P360 Nanophotometer (Implen, München, Germany). The extracted RNA was subjected to preparation of a paired-end library for RNA sequencing using the Illumina Hiseq X Ten (BGI, Hongkong). After sequencing, the raw reads were filtered. Data filtering includes removing adaptor sequences, contamination and low-quality read from raw reads. The high quality of reads used to construct assembled transcriptome using Trinity package with default parameters and minimum length of 200 bp. The assembled contigs were performed by CAP3 ( À p 90), and CD-HIT-EST ( À c 0.90 À M 0 À T 0) and clustering with Corset after filtering low expression reads below 1 CPM. Several databases such as nt and nr databases from NCBI and Swis-sProt and TrEMBL databases from UniProt were used to annotate the contigs using the BLASTþ program with the cut-off of 10 À 5 [7].