RNA-seq data of Elaeidobius kamerunicus from North Sumatera and Central Kalimantan in Indonesia

Since introduced from West Africa at 40 years ago, Elaeidobius kamerunicus Faust. (Coleoptera: Curculionidae) is still the main pollinator agent of oil palm plantation in Indonesia until now. Unfortunately, the success rate of pollination in various regions in Indonesia is relatively different, for example in Sumatra and Kalimantan. The oil palm fruit set formed in Kalimantan tends to be lower than in Sumatra. Preliminary studies show that weevils from Kalimantan visit female flowers less than from Sumatra. However, the molecular mechanisms involved in regulating insect behavior, especially in their role as pollinating agents, are not yet clearly understood. Therefore, a transcriptomic study was carried out to obtain raw data to determine gene expression differences in studying the behavior of the same weevil from two different regions. Here, we present two data sets of RNA seq reads which are available in GenBank Sequence Read Archive (SRA) database with accession number of SRR21521626 and SRR21521625 for weevil from North Sumatra and Central Kalimantan respectively.


a b s t r a c t
Since introduced from West Africa at 40 years ago, Elaeidobius kamerunicus Faust.(Coleoptera: Curculionidae) is still the main pollinator agent of oil palm plantation in Indonesia until now.Unfortunately, the success rate of pollination in various regions in Indonesia is relatively different, for example in Sumatra and Kalimantan.The oil palm fruit set formed in Kalimantan tends to be lower than in Sumatra.Preliminary studies show that weevils from Kalimantan visit female flowers less than from Sumatra.However, the molecular mechanisms involved in regulating insect behavior, especially in their role as pollinating agents, are not yet clearly understood.Therefore, a transcriptomic study was carried out to obtain raw data to determine gene expression differences in studying the behavior of the same weevil from two different regions.Here, we present two data sets of RNA seq reads which are available in GenBank Sequence Read Archive (SRA) database with accession number of SRR21521626 and

Value of the Data
• Elaeidobius kamerunicus is the main oil palm pollinator in Indonesia [1][2][3] , however their effectiveness among different region especially in Sumatra and Kalimantan was relatively different [4][5][6] .• There are some studies about genomic data of E. kamerunicus [7][8][9] but not yet for the transcriptomic analysis.• Two set of raw-FASTQ file transcriptome data, EkNS and EkCK, were reported here to support the understanding of molecular mechanism underlying the behavior of the weevil as oil palm pollinator.• The transcriptomic data can be further analyzed by examining the differential genes expression which is essential in determining the major genes involved in the behavior of the insect which finally may support the efficiency of oil palm pollination.

Objectives
Primary objective for this reported data was to support the understanding of molecular mechanism underlying the behavior of E. kamerunicus as oil palm pollinator in Indonesia.Second objective was to know the differential genes expression between the weevils from two regions in Indonesia, EkNS and EkCK, especially that involved in behavior as oil palm pollinator insect.

Data Description
The different oil palm fruit set value in Sumatra and Kalimantan was due to the difference in behavior of E. kamerunicus weevil as main oil palm pollinator insect.FASTQ raw data file which was generated from two sets of EkNS and EkCK weevil's transcriptome has been deposited to NCBI-SRA data base with the accession number SRR21521626 and SRR21521625 respectively.These data could be used as guidance for understanding the molecular behavior mechanism of the weevils.Descriptive statistics on the RNA-seq data of the two set of both EkNS and EkCK weevils were given in Table 1 .

Insect sample collection
The collected samples were weevils that had just emerged from the male inflorescence 4-5 days through anthesis [10] .Oil palm male inflorescence was the only one of breeding site for this insect from eggs, larvae to pupae [11 , 12] .There were two origin places of the weevils i.e., from Marihat, Simalungun, North Sumatra and Pundu, East Kotawaringin, Central Kalimantan.From each place, 3 male inflorescences were taken from 3 locations in one estate, then 10 male and 20 female of new emerged weevils were selected from each inflorescence.Samples for each region were a mixture of both male and female weevils from the three sampling locations.The live weevil's samples were immersed on RNA later (Sigma) into a 1.5 mL tube and then stored at -20 °C for RNA extraction.

RNA isolation, library preparation and RNA-seq
The RNA of EkNS and EkCK weevils (whole body) were extracted using Quick RNA Miniprep Plus Kit (Zymo Research) according to the manufacturer's instructions.The quantity and quality of the total RNA were validated using NanoDrop spectrophotometer (Thermos, USA) for the purity of the RNA samples, and Agilent 2100 Bioanalizer (Agilent RNA 60 0 0 Nano Kit) for the RNA integrity (RIN value), 28S/18S and the fragment length distribution.The samples were further sequenced using Illumina Novaseq 60 0 0 PE150 platform following the steps as follow: a) mRNA enrichment, b) Double stranded cDNA synthesis, c) Repair ends, add A overhang and Adaptor, d) Fragment selection and PCR amplification, e) Library quality test, f) Sequencing on Novaseq 60 0 0 PE150 platform.The workflow for processing RNA-seq data is illustrated in Fig. 1 , with the data stored in FASTQ format.Clean reads was obtained by removing the adaptors, unknown bases (N) (more than 10 %) and low quality reads.Those data then ready for further bioinformatics process.

Table 1
Descriptive information for RNA seq raw data for two samples of Elaeidobius kamerunicus from North Sumatra (EkNS) and from Central Kalimantan (EkCK).The rate of bases which quality is greater than 20 value in clean reads.Clean Reads Q30 (%): The rate of bases which quality is greater than 30 value in clean reads.Clean Reads Ratio (%): The ratio of the amount of clean reads.