Insights into Brevibacillus borstelensis AK1 through Whole Genome Sequencing: A Thermophilic Bacterium Isolated from a Hot Spring in Saudi Arabia

Brevibacillus borstelensis AK1 is a thermophile which grows between the temperatures of 45°C and 70°C. The present study is an extended genome report of B. borstelensis AK1 along with the morphological characterization. The strain is isolated from a hot spring in Saudi Arabia (southeast of the city Gazan). It is observed that the strain AK1 is rod-shaped, motile, and strictly aerobic bacterium. The whole genome sequence resulted in 29 contigs with a total length of 5,155,092 bp. In total, 3,946 protein-coding genes and 139 RNA genes were identified. Comparison with the previously submitted strains of B. borstelensis strains illustrates that strain AK1 has a small genome size but high GC content. The strain possesses putative genes for degradation of a wide range of substrates including polyethylene (plastic) and long-chain hydrocarbons. These genomic features may be useful for future environmental/biotechnological applications.


Introduction
Thermophiles are a group of heat-loving microorganisms which have an optimum growth temperature of at least 50 ∘ C [1]. The genus Brevibacillus (family Paenibacillaceae and class Bacilli) was initially described as Bacillus brevis in 1900 by Migula [2]. In later years, many new strains were placed in the same group (e.g., B. brevis) including the strains that were not following the sensu stricto criteria that challenged the overall classification. This included discrepancies in maximum growth temperatures and a wide range of GC values, hence confirming B. brevis as a heterogeneous group [3,4]. To resolve these ambiguities, a new genus named Brevibacillus was proposed in 1996 that resulted in reclassification of nearly 10 Bacillus species, based on 16S rDNA analysis [5,6]. Since then, many strains have been reclassified as novel species of the genera originally reported as members of Bacillus brevis [6][7][8][9]. The genus Brevibacillus comprised environmental bacteria that have been observed in diverse habitats including agricultural soil, wastewaters, and hot springs [10,11].
Earlier studies have shown the importance of B. borstelensis in different spheres of industry and the environment [23][24][25][26]. For instance, Arya et al. [25] reported that B. borstelensis possess great potential to degrade the fungicide carbendazim from agricultural fields at high rates, especially when coupled with Streptomyces albogriseolus. Hadad et al. [24] showed that inert polyethylene could be degraded by B. borstelensis strain 707. Tsai et al. [26] evinced that the lipolytic B. borstelensis strain SH168 can enhance the transformation of food wastes into biofertilizer. Baek et al. [23] reported that B. borstelensis strain BCS-1 could produce D-amino acid amidases, a catalyst for the formation of optically pure D-amino acids; these are the intermediates of pharmaceuticals production, food additives, insecticides, synthetic sweeteners, and agrochemicals   [27]. Furthermore, an alkaline pectin lyase applicable in fruit juice and oil extract has also been isolated from B. borstelensis P35. The reassociation analyses, cellular fatty acid profile, and isoprenoid quinone composition analysis confirmed that B. borstelensis owns these properties based on unique DNA base compositions [6,7]. . In this study, we report the genomic insights of B. borstelensis strain AK1 that may help unravel the potential importance of the species in biotechnology. Additionally, since the strain has been previously reported for potential polyethylene (plastic) degradation, we identified the putative genes/enzymes based on KEGG orthology.

Sampling.
The sampling was performed in January 2012 at a hot spring "Al-Ain Alhara", located in the southeast of Gazan city in Saudi Arabia (16 ∘ 56 N, 43 ∘ 15 E). Water samples were taken in sterile thermal glass containers while maintaining the physicochemical quality parameters.

Growth Conditions and Genomic DNA Preparation.
A 5 ml aliquot of the water sample was inoculated in 250 ml of tetrathionate (TT) broth (ATCC medium 697). Incubation was performed at 55 ∘ C for 24 to 48 h at shaking speed of 300 rpm. The bacterial cells were then harvested by centrifugation (14,000 g) and the pellet was spread over TT agar media. Statistically relevant numbers of distinctive colonies were picked and the procedure was repeated thrice to purify the strain for subsequent genomic analysis.

Morphological Characterization.
For morphological characterization, bacterial cells were obtained from stationary phase and subjected to SEM microscopy. Briefly, fixation was performed in 2.5% glutaraldehyde, which was buffered at pH 7.2 with phosphate buffer saline (PBS). Subsequently, the mixture was placed over ice for 2 h followed by washing of bacterial cells for 20 min at room temperature. The post-fixation was performed in osmium tetraoxide (1%) leading to dehydration in a graded ethanol series up to 100% concentration. Evidence codes -IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements. The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Lastly, the specimens were gold-plated (10 nm) and studied under a field emission SEM.

DNA Extraction.
Total bacterial DNA extraction was performed using Genomic DNA isolation kit (Norgen Biotek) as per the manufacturer's guidelines. DNA yield was measured using Qbit Assay that was approximately 40 g, while the quality was determined using Bioanalyzer and Agarose gel before proceeding to library preparation for sequencing.

Genome Sequencing, Assembly, and Functional
Annotation. The genome sequencing was performed using a 454genome sequencer (FLX titanium, Roche), based at Bioscience Core Laboratory of King Abdullah University of Science and Technology (KAUST) in Saudi Arabia. Briefly, 500 ng of genomic DNA was used to construct the fragment library using Rapid Library Preparation Kit. Average insert sizes of the yielded libraries were 750 bp in length. These libraries were then quantified by qPCR. The sequences were subjected to quality control to trim low-quality reads. The filtered sequences were assembled into contigs using Newbler assembler v2.5.3. The automated assembly algorithm yielded over 100 contigs. Out of these contigs, 29 were selected based on high contig length as they represented the expected whole genome length. The assembly was then annotated using Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) available at NCBI. The annotation yielded a total of 5,090 coding genes and out of these 4,951 appeared to be protein-coding genes. Additionally, gene prediction analyses were completed within Integrated Microbial Genomes Expert Review (IMG-ER). The circular visualization of the assembled contigs was generated using an online web-server Circos plotting tool (ClicOFS) [22]. The project summary, as well as associated MIGS information, is shown in Table 1.

Phylogenetic Analysis.
For phylogenetic analysis, the sequence of the 16S rRNA gene was aligned with equivalent 16S rRNA genes of closely related strains as appeared in BLAST search. The tree was calculated with an improved neighbor-joining algorithm known as BioNJ.

Accession Numbers.
The sequenced genome is deposited in GenBank with master record accession no. APBN00000000: the associated 29 contigs are assigned IDs according to this record (accession numbers APBN01000001 to APBN01000029).

Morphological Features and Classification.
Morphological analysis revealed that, in the stationary phase, average diameter of B. borstelensis AK1 ranged from 0.2 to 0.5 m whereas the length varied between 2.0 m and 15.0 m (Figure 1). The cells appeared to be fast growing, forming colonies of 3 mm diameter within a period of 24 hours. The average generation time was recorded to be 30 min at previously described conditions. The colonies were yellowpigmented and had smooth margins while the strain was able to withstand a pH range of 5.5-8.5. The general feature information as suggested by Minimum Information about the Genome Sequence (MIGS) is presented in Table 2. Likewise, phylogenetic analysis displayed that the strain was closely related to B. borstelensis AK2 and Brevibacillus sp. ODC-8 ( Figure 2).  (Figure 3; Table 3). More precisely, this comprised 5,090 predicted genes, of which 4,951 (97.27%) were protein-coding genes while 139 were RNA genes (2.73%). Among these RNA genes, 22 belonged to rRNA that contained eleven 16S, ten 23S, and one 5S gene; on the other hand, 117 genes appeared to be tRNA genes. The putative function was assigned to 3,921 genes (77.03%) whereas the remaining genes were annotated as hypothetical proteins. Protein-coding genes connected to KEGG pathways were 1,348 (26.48%) while genes associated with COG categories appeared to be 3,231 (63.48%). The genes distribution into COGs functional categories is displayed in Table 4.   genes/enzymes of the strain AK1 based on KEGG orthology, KO (i.e., enzyme commission classification), that may possess a potential in degradation services including polyethylene and hydrocarbons. We found presence of 1 cutinase, 67 lipases, 99 hydroxylases, 2 laccases, 1 polyphenol oxidase, and 116 proteases KO database (Supplementary Data) (available here). All of them have been previously reported as potential plastic degrading genes in different bacteria [28][29][30][31][32][33][34].

Genomic Insights and Comparative
Similarly, 159 monooxygenases, 136 dioxygenases, and 118 lyases are identified (Supplementary Data). These enzymes appeared to have a potential role in a number of bioremediation studies including catabolic expression of CYP family [35][36][37]. In any case, the results strengthen the significance of strain AK1 for future biotechnological services that come with advantage of extremophile properties (i.e., high GC content).

Conclusions
Thermophilic organisms are not only of industrial importance, but they can also be exploited in pollutant degradation services. B. borstelensis AK1 was selected based on such a potential and the whole genome sequencing was performed to unravel genomic insights. A general comparison with previously sequenced strains of the same species revealed that the strain AK1 has the smallest genome size but highest GC content. Nevertheless, the presence of putative biodegradation related genes supports the idea of exploiting the species in future environmental/biotechnological services.

Conflicts of Interest
The authors declare that they have no conflicts of interest.