16S rRNA metagenomic dataset on endophytic bacterial community of the cashew plant (Anacardium occidentale L.) grown in Dak Lak Province of Vietnam

Vietnam is currently one of the largest producers and exporters of cashew nuts in the world. Cashew (Anacardium occidentale L.) is one of the main industrial crops cultivated in Dak Lak Province of Vietnam. Comprehending the endophytic bacteria of this plant, a new biofertilizer for sustainable cashew nut production can be progressed. In this report, the cashew root sample was collected from cashew fields in 2021 in Dak Lak. The DNeasy Powersoil kit was used to extract the genomic DNA of endophytic bacteria from the root sample. The 16S rRNA genes (V1–V9 regions) were amplified by PCR, and libraries of amplicons were prepared using the Swift amplicon 16S plus ITS panel kit. The Illumina MiSeq platform was applied to sequence amplicon libraries using 16S rRNA metagenomics. Taxonomic analyses showed that Gammaproteobacteria (38.77 %) and Alphaproteobacteria (37.76 %) were the predominant classes among the endophytic bacteria. Functional analyses revealed that biosynthesis (72.78 %) was the primary function of the endophytic bacterial community. Raw sequences (Fastq files) have been deposited in Mendeley Data [1]. The obtained data provide insight into the endophytic bacterial community of cashews cultivated in Dak Lak Province of Vietnam. The data are valuable for further developing a new biofertilizer for cashew nut production using endophytic bacteria. Ours is the first report about endophytic bacterial communities of cashews cultivated in this province as well as the Central Highlands of Vietnam.


a b s t r a c t
Vietnam is currently one of the largest producers and exporters of cashew nuts in the world.Cashew ( Anacardium occidentale L.) is one of the main industrial crops cultivated in Dak Lak Province of Vietnam.Comprehending the endophytic bacteria of this plant, a new biofertilizer for sustainable cashew nut production can be progressed.In this report, the cashew root sample was collected from cashew fields in 2021 in Dak Lak.The DNeasy Powersoil kit was used to extract the genomic DNA of endophytic bacteria from the root sample.The 16S rRNA genes (V1-V9 regions) were amplified by PCR, and libraries of amplicons were prepared using the Swift amplicon 16S plus ITS panel kit.The Illumina MiSeq platform was applied to sequence amplicon libraries using 16S rRNA metagenomics.Taxonomic analyses showed that Gammaproteobacteria (38.77 %) and Alphaproteobacteria (37.76 %) were the predominant classes among the endophytic bacteria.Functional analyses revealed that biosynthesis (72.78 %) was the primary function of the endophytic bacterial community.Raw sequences (Fastq files) have been deposited in Mendeley Data [1] .

Value of the Data
• Data provide taxonomic and functional profiles of endophytic bacteria of the cashew plant cultivated in Dak Lak Province, Vietnam.• Data can be valuable for comparing endophytic bacteria of the cashew plant grown in Dak Lak and others.• Data can be valuable for developing a new biofertilizer for sustainable cashew nut production based on endophytic bacteria.

Background
Vietnam is one of the world's top 10 producers and exporters of cashew nuts from 2011 to 2022.Cashew is one of the main perennial industrial crops cultivated in the Central Highlands region of Vietnam.Vietnam had 322,300 hectares of cashew planted and produced 341,700 tons in 2022, in which the Central Highlands contributed 83,900 hectares and 33,560 tons, respectively.Among 5 provinces in this region, Dak Lak was the biggest producer of cashew nuts [2] .Currently, chemical fertilizers are usually used for cashew nuts production in the province.However, it is clear that chemical fertilizers can impact the environment badly, reduce cashew nuts' quality, and increase farmers' input.Hence, indigenous bacteria are thought to be the best strategy to produce sustainably cashews.Data on endophytic and rhizospheric bacteria of black pepper, coffee, and sugarcane plants cultivated in Dak Lak have been explored to develop new cultivation techniques for the sustainable production of these crops [3][4][5][6] ; however, to the best of our knowledge, no data on the endophytic microorganisms of the cashew plant growth in the province as well as in the Central Highlands have been reported.This work aimed to establish a dataset on the endophytic bacteria of the cashew plant grown in Dak Lak Province, the Central Highlands of Vietnam.

Sampling
This work collected five cashew root samples (5 to 30 cm under the soil surface) from 5 cashew fields on 30 October 2021 in Hoa Phu Commune, Buon Ma Thuot City, Dak Lak Province of Vietnam.The samples were then combined to create a representative sample.The root sampling, treatment, and storage were conducted as described by Tran et al. [6] .

Genomic DNA extraction, library preparation, and sequencing
Genomic DNA extraction, library preparation, and sequencing were conducted as described previously [3][4][5][6] .Briefly, metagenomic DNA was extracted from 0.3 g of the root sample using the DNeasy PowerSoil kit (Qiagen, USA).The 16S rRNA gene amplicon sequencing libraries were prepared using the Swift amplicon 16S plus internal transcribed spacer panel (Swift Biosciences, USA).The libraries were then sequenced using the Illumina MiSeq platform (2 × 150 PE).

Fig. 1 .
Fig. 1.Krona chart representation of taxonomic classification of the cashew root endophytic bacteria.

Fig. 2 .
Fig. 2. Krona chart representation of functional profiles of the cashew root endophytic bacteria.