Draft genome sequence data of the anaerobic, thermophilic, chitinolytic bacterium strain UUS1-1 belonging to genus Hydrogenispora of the uncultured taxonomic OPB54 cluster

Strain UUS1-1 (=JCM33882 =DSM111537) is a novel chitinolytic, thermophilic, anaerobic bacterium belonging to the genus Hydrogenispora of the uncultured taxonomic OPB54 cluster within the phylum Firmicutes. Strain UUS1-1 has a unique, long, hair-like rod morphology and a strong ability to degrade crystalline chitin. The whole genome of strain UUS1-1 was sequenced on an Ion GeneStudio S5 system, which yielded 86 contigs comprising 2,482,547 bp, 2235 protein-coding sequences, and a G+C content of 52.1 mol%. Strain UUS1-1 is the second cultivable isolate, besides H. ethanolica, within the OPB54 cluster and may be classified as a novel species. The genomic data have been deposited at the National Center for Biotechnology Information (NCBI) under accession number JAAKDE00000000.


a b s t r a c t
Strain UUS1-1 ( = JCM33882 = DSM111537) is a novel chitinolytic, thermophilic, anaerobic bacterium belonging to the genus Hydrogenispora of the uncultured taxonomic OPB54 cluster within the phylum Firmicutes. Strain UUS1-1 has a unique, long, hair-like rod morphology and a strong ability to degrade crystalline chitin. The whole genome of strain UUS1-1 was sequenced on an Ion GeneStudio S5 system, which yielded 86 contigs comprising 2,482,547 bp, 2235 proteincoding sequences, and a G + C content of 52.1 mol%. Strain UUS1-1 is the second cultivable isolate, besides H. ethanolica , within the OPB54 cluster and may be classified as a novel species. The genomic data have been deposited at the National Center for Biotechnology Information (NCBI) under accession number JAAKDE0 0 0 0 0 0 0 0.  Table   Subject Microbiology Specific subject area Bacteriology, Genomics Type of data Table, Figure  How data were acquired Whole-genome sequencing using the Ion GeneStudio S5 system Data format Raw, Analyzed Parameters for data collection Genomic DNA was extracted from a pure culture of strain UUS1-1 (JCM 33882), which belongs to the genus Hydrogenispora . The genome of strain UUS1-1 was sequenced using the Ion GeneStudio S5 system, assembled de novo using the CLC Genomics Workbench 20.0.1, and annotated using the NCBI Prokaryotic Genome Annotation Pipeline. Description of data collection Genomic DNA was extracted from strain UUS1-1. Following whole-genome sequencing, the genome was assembled and annotated. Data source location Japan International Research Center for Agricultural Sciences (JIRCAS), Tsukuba, Ibaraki, Japan Data accessibility Repository name: NCBI Data identification number: JAAKDE0 0 0 0 0 0 0 0 0. The version described in this paper is JAAKDE010 0 0 0 0 0 0.1 Direct URL to data: https://www.ncbi.nlm.nih.gov/nuccore/JAAKDE0 0 0 0 0 0 0 0 0.

Value of the Data
• Strain UUS1-1 is a novel, anaerobic, thermophilic, chitinolytic bacterium belonging to the genus Hydrogenispora of the uncultured taxonomic OPB54 cluster. • The data from this draft genome can help the scientific community understand the genetic properties of this unique bacterial taxon which currently contains only one other cultivated isolate ( Hydrogenispora ethanolica ). In addition, these data will be valuable to engineering researchers that study thermophilic and chitinolytic enzymes. • The data from this draft genome provide new insights into crystalline chitin degradation ability in thermophilic conditions and properties of the genus Hydrogenispora .

Data Description
Chitin is an insoluble linear chain of β-1,4-N -acetylglucosamine (GlcNAc). This crystalline polymer is the major component of the exoskeletons of arthropods, such as shrimps and insects [ 1 , 2 ]. Chitin and its derivatives can be transformed into highly valuable compounds that have many industrial applications. Chitinase-producing microorganisms have been isolated from multiple environments. Bacterial chitinases hold promise for use in several commercial applications, such as production of GlcNAc and chitosan. Thermostable chitinases from bacteria may have numerous applications because such chitinases would likely reduce industrial costs [ 3 , 4 ]; however, the chitinases currently used for industrial applications are mainly from mesophilic bacteria [5] . Thus, screening, identifying, and studying the functional properties of thermophilic chitinases is crucial. Currently, there is limited information describing the participation of anaerobic thermophilic bacteria in insoluble chitin degradation systems. Strain UUS1-1, classified as belonging to the genus Hydrogenispora and the uncultured taxonomic OPB54 cluster, was successfully isolated as a pure culture from a bacterial community. Strain UUS1-1, deposited at the RIKEN BioResource Research Center as JCM33882 and the German Collection of Microorganisms and Cell Cultures GmbH (DSMZ) as DSM111537, possesses unique features such as a hair-like rod morphology, and high chitinase activity in thermophilic conditions. Strain UUS1-1 is the first anaerobic, thermophilic, chitinolytic bacterium, and the second cultivated isolate besides H. ethanolica that belongs to the OPB54 cluster [6] .
We sequenced the genome of strain UUS1-1 to obtain new information on anaerobic, thermophilic chitin degradation systems within the genus Hydrogenispora and the taxonomic cluster OPB54, which belongs to an unidentified taxon at the order-or class-level. Features of the genome are shown in Table 1 . DNA sequencing, performed using the Ion GeneStudio S5 system, generated 11,760,377 single reads with an average length of 187 bp. The genome was assembled de novo using CLC Genomics Workbench 20.0.1 (CLC Bio, Qiagen, Valencia, CA), which resulted in 86 contigs with an N50 of 117,588 bp and a maximum size of 238,885 bp. The genome of strain UUS1-1 comprised 2,482,547 bp and had a G + C content of 52.1 mol%. Interestingly, the genome of the most closely related strain in this genus, H. ethanolica strain LX-B, was nearly identical in its G + C content but 2.4 times the size (5,983,461 bp, G + C content 54.2 mol%).

Genomic DNA extraction and sequencing
Genomic DNA of strain UUS1-1 was prepared by phenol/chloroform extraction from cells grown at 60 °C with glucose as the carbon source [10] . Fragmentation and library preparation of DNA was performed using an Ion Xpress Plus Fragment Library Kit (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer's protocol, which generated fragments with an average length of 400 bp. Fragments of approximately 200 to 300 bp were size-selected by electrophoresis on E-Gel SizeSelect II agarose gels (Invitrogen, Thermo Fisher Scientific, Waltham, MA, USA) before library preparation. The genomic DNA sequences of strain UUS1-1 were obtained using an Ion GeneStudio S5 system and processed.

Genome assembly and annotation
De novo genome assembly was performed using CLC Genomics Workbench version 20.0.1. after removal of low-quality reads. The genome was annotated using the NCBI PGAP [9] . The prediction of function of each annotated gene was executed within the Integrated Microbial Genomes-Expert Review system developed by the Joint Genome Institute (Walnut Creek, CA, USA) [11] .

Genomic ANI
Pairwise ANI values from the whole-genome sequences of two Hydrogenispora strains, two Desulfofundulus strains, three Desulfotomaculum strains, one Pelotomaculum strain, and one Moorella strain were calculated using GENETYX NGS version 4.1.1 with the BLASTALL algorithm. The matrix generated from the ANI values among these nine strains and strain UUS1-1 was converted into a genetic dendrogram using algorithms such as the unweighted pair group method with arithmetic means and the single-linkage clustering method in the R statistical program [7] . ment (SATREPS) (grant number JPMJSA1801 ) of the JST/ Japan International Cooperation Agency (JICA). We thank James Allen, DPhil, from Edanz Group ( https://en-author-services.edanzgroup. com/ac ) for editing a draft of this manuscript.

Supplementary Materials
Supplementary material associated with this article can be found in the online version at doi: 10.1016/j.dib.2020.106528 .