Genome sequence data of Bacillus velezensis BP1.2A and BT2.4

Here, we report the complete genome sequence data of the biocontrol strains Bacillus velezensis BP1.2A and BT2.4 isolated from Vietnamese crop plants. The size of the genomes is 3,916,868 bp (BP1.2A), and 3,922,686 bp (BT2.4), respectively. The BioProjects have been deposited at NCBI GenBank. The GenBank accession numbers for the B. velezensis strains are PRJNA634914 (BP1.2A) and PRJNA634832 (BT2.4) for the BioProjects, CP085504 (BP1.2A) and CP085505 (BT2.4) for the chromosomes, GCA_013284785.2 (BP2.1A), and GCA_013284785.2 (BT2.4) for GenBank assembly accessions, and SAMN15012571 (BP1.2A) and SAMN15009897 (BT2.4) for the BioSamples. Both genomes were closely related to FZB42, the model strain for plant growth promoting bacilli.


Specifications
Biological sciences Specific subject area Molecular Phylogenetics Type of data

Value of the Data
• The data of this article demonstrate that it is possible, to isolate closely related Bacillus strains from remote geographical regions with different climatic conditions • BP1.2A, and BT2.4 share 99.99% identical residues with the model strain FZB42 ( Table 3 ).
The high similarity of the two novel strains with the biocontrol strain FZB42, encourages the development of the strains as promising biocontrol agents used in sustainable agriculture in temperate and subtropical zones, as well. • The data demonstrate that gene clusters involved in non-ribosomal and ribosomal synthesis of antibacterial and antifungal secondary metabolites are highly conserved in different representatives of B. velezensis , despite of their geographical distribution. • For the scientific community, the genome data presented here, extend the resources for comparative genomic analysis among the members of the Bacillus amyloliquefaciens operational group, including Bacillus velezensis , at present the most important species used in biological plant protection. • Furthermore, extended genomic analyses performed between closely related bacteria should elucidate regions and/or genes with different variability and might identify regions (genes) with an enhanced mutation bias.

Data Description
The draft genome sequences of 59 Gram-positive bacterial strains that were isolated from Vietnamese crop plants have been already reported [1] . Two of these strains, B. velezensis BP1.2A,  4), respectively. Automatic genome annotation was performed using the RAST (Rapid Annotation using Subsystems Technology) server [3] , and the NCBI Genome Automatic Annotation Pipeline (PGAP) [4] for the general genome annotation deposited in NCBI.
As shown in Table 1 , subsystem proteins distribution [5] of the two strains is very similar to FZB42 [6] indicating their close relationship. Genome mining of B. velezensis performed with antiSMASH version 6.0 [7] extracted the complete set of gene clusters and genes involved in non-ribosomal and ribosomal synthesis of secondary metabolites previously identified in FZB42  ( Fig. 1 ). Differences to B. velezensis FZB42 were not detected when the genomes were pairwise compared using ANIb [11] ( Fig. 2 ) indicating their close relationship, despite that the sites of their isolation (Vietnam and Germany) are very remote from each other. Table 3 and the Venn diagram presented in Fig. 3 summarize the comparison of the whole genome sequences of BP1.2A, and BT2.4 with FZB42. The three strains share a core genome of 3633 CDS. There is only one additional CDS (encoding a hypothetical protein) in BP1.2A, when compared with BT2.4 suggesting that both strains are identical or nearly identical clones, and  [11] .

Table 3
Sequence comparison of BP1.2A, and BT2.4 with FZB4242 using blastn, and ANIb [11] . The italic numbers set in brackets indicate the overlap of the sequences used in the comparison. Analysis of singletons was performed with the EDGAR software package [12] .  Table 3 ). The slight differences to the numbers given in the Venn diagram ( Fig. 3 ) are due to the different methods applied, as explained in the legend to Fig. 3 .  Table 3 ). The Venn diagram constructed with EDGAR shows the number of best hits between subsets of genomes. But: A gene without reciprocal best hit to another genome is not necessarily a singleton [12] .

Strain growth conditions and DNA isolation
Cultivation of the Bacillus strains and DNA isolation have been previously described [1] .

Phylogenomics
The genome sequence data were uploaded to the Type (Strain) Genome Server (TYGS) for a whole genome-based analysis [10] . All pairwise comparisons were conducted using GBDP, and 100 distance replicates were calculated each. The resulting intergenomic distances were used to infer a balanced minimum evolution tree via FASTME 2.1.6.1 [9] . The tree was visualized with iTOL ( https://itol.embl.de/# ).

Ethics Statements
This work did not contain human subjects, animals, cell lines or endangered species.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.