Complete Genome Sequence of Geobacillus sp. Strain E55-1, Isolated from Mine Geyser in Japan.

We report here the complete genome sequence of Geobacillus sp. strain E55-1, isolated from the hot sediments of Mine Geyser in Japan. This strain exhibited ∼85% average nucleotide identity and ∼98.5% 16S rRNA sequence identity with the most closely related Geobacillus species.

T he genus Geobacillus comprises a group of Gram-positive thermophilic bacterial species (1) that were originally classified as group 5 within the genus Bacillus but were later reclassified as a new genus, Geobacillus (2). Researchers are interested in Geobacillus from a biotechnological point of view, particularly as a source of thermophilic enzymes (3).
We collected hot sediment samples at Mine Geyser in Japan (4, 5), which were spread over Lennox-LB agar (1.5% [wt/vol]) plates. After overnight incubation at 55°C, several single colonies were isolated. Colony PCR was conducted to amplify the nearly full-length 16S rRNA gene using a set of primers, Bac8f(C) and UN1541r(U) (6). DNA sequencing analysis suggested that many of them were affiliated with one of the following thermophilic bacilli: Bacillus, Geobacillus, or Anoxybacillus. One of the strains, designated E55-1, showed 98.5% identity with the 16S rRNA genes of a Geobacillus species; this strain was subjected to whole-genome analysis by combining Oxford Nanopore Technologies (ONT) and Illumina sequencing technologies. All software analyses were implemented with default settings.
Cells were grown in Lennox-LB broth at 55°C for 18 h. Genomic DNA was extracted from pelleted cells using lysozyme, proteinase K, and achromopeptidase for enhanced cell lysis efficiency (7). For long-read sequencing, genomic DNA was treated with the Short Read Eliminator XS kit (Circulomics). The resulting DNA was used to construct a library using a ligation sequencing kit (SQK-LSK109; ONT). Sequencing was performed using a GridION X5 system with a FLO-MIN106 R9.41 flow cell (ONT) for 6 h. Base calling was performed using Guppy v.3.0.3 (ONT) to generate 68,238 reads corresponding to 781 Mb of genome with an average length of 11,447 bases. The raw reads were filtered (quality [Q], м10; read length, м1,000 bases) using NanoFilt v.2.3.0 (8). The longest read length was 198,777 bases.
For short-read sequencing, a library with ϳ350-bp inserts was generated using the Nextera DNA Flex library prep kit (Illumina), which was subjected to 156-bp paired-end sequencing on the Illumina MiSeq platform. Adapter sequences and low-quality data were trimmed (Q, м30; read length, м10 bases) using fastp v.0.20.0 (9), yielding 1.03 million paired-end reads, spanning 308 Mb with an average length of 149 bases.

ACKNOWLEDGMENT
This work was partly supported by two grants awarded to K.M. from the Japan Society for the Promotion of Science (JSPS), a Grant-in-Aid for Scientific Research (A; 19H00936) and a Grant-in-Aid for Challenging Research (Pioneering; 19H05538).