Draft genome sequence data of Paenibacillus cisolokensis strain LC2-13A and Xylanibacillus composti strain K-13

To discover more efficient degradation processes of lignocellulosic biomass, it is still important to analyze genomic and enzymatic data from bacteria that have strong xylanolytic ability. Here, we present the draft genome sequences of the xylanolytic bacteria Paenibacillus cisolokensis strain LC2-13A and Xylanibacillus composti strain K-13 that are closest to Paenibacillus sp. strain DA-C8, which has strong xylan degradation ability under anaerobic growth conditions. Whole-genome sequencing on the Ion GeneStudio S5 System yielded 277 contigs with total size 5,305,208 bp and G+C content 52.3 mol% for strain LC2-13A and 115 contigs with total size 4,652,266 bp and G+C content of 56.2 mol% for strain K-13. The LC2-13A genome had 5,744 protein-coding sequences (CDSs), 57 tRNAs, and 4 clustered regularly interspaced short palindromic repeats (CRISPRs), and the K-13 genome had 4,388 CDSs, 1 rRNA gene, 45 tRNAs, and 5 CRISPRs. The CDSs of LC2-13A and K-13 encoded the following carbohydrate-active enzymes: 98 and 67 glycoside hydrolases, 31 and 29 glycosyl transferases, 23 and 17 carbohydrate esterases, and 13 and 37 carbohydrate-binding modules, respectively. The whole-genome sequences of LC2-13A and K-13 have been deposited in DDBJ/ENA/GenBank under accession numbers BOVK00000000 and BOVJ00000000. The versions described in this paper are version 1.


a b s t r a c t
To discover more efficient degradation processes of lignocellulosic biomass, it is still important to analyze genomic and enzymatic data from bacteria that have strong xylanolytic ability. Here, we present the draft genome sequences of the xylanolytic bacteria Paenibacillus cisolokensis strain LC2-13A and Xylanibacillus composti strain K-13 that are closest to Paenibacillus sp. strain DA-C8, which has strong xylan degradation ability under anaerobic growth conditions. Wholegenome sequencing on the Ion GeneStudio S5 System yielded 277 contigs with total size 5,305,208 bp and G + C content 52.3 mol% for strain LC2-13A and 115 contigs with total size 4,652,266 bp and G + C content of 56.2 mol% for strain K-13. The LC2-13A genome had 5,744 protein-coding sequences (CDSs), 57 tRNAs, and 4 clustered regularly interspaced short palindromic repeats (CRISPRs), and the K-13 genome had 4,388 CDSs, 1 rRNA gene, 45 tRNAs, and 5 CRISPRs. The CDSs of LC2-13A and K-13 encoded the following carbohydrate-active enzymes: 98 and 67 glycoside hydrolases, 31 and 29 glycosyl transferases, 23 and 17 carbohydrate esterases, and 13 and 37 carbohydrate-binding modules, respectively. The whole-genome sequences of LC2-13A and K-13 have been deposited in DDBJ/ENA/GenBank under accession numbers BOVK0 0 0 0 0 0 0 0 and BOVJ0 0 0 0 0 0 0 0. The versions described in this paper are version 1. ©

Value of the Data
• Genome data from xylanolytic bacteria can be used to design methods for the efficient biological saccharification of lignocellulosic biomass. • The Paenibacillus cisolokensis and Xylanibacillus composti genome data can be used to understand the taxonomy and systematics of xylanolytic Paenibacillus species. • The genome data from P . cisolokensis , X. composti can be compared with those of closely related Paenibacillus species to better understand lignocellulose degradation and improve its efficiency in xylanolytic Paenibacillus species.

Data Description
Plant biomass is composed mainly of three different polymers, i.e., cellulose, hemicelluloses, and lignin. Xylan is the major component of hemicellulose, which is one of the most  [1] . Many of the characterized xylanases belong to glycoside hydrolase families 10 and 11, according to the classification in the Carbohydrate-Active Enzymes (CAZy) database ( http://www.cazy.org ). Among the xylanolytic bacteria, species in genus Paenibacillus are known to produce a variety of xylan degradation enzymes, such as β-1,4-xylanases, β-xylosidases, and α-L-arabinofuranosidase, with potential applications in industrial manufacturing processes [2] . Here, we present the first draft genome assemblies with annotation data of Paenibacillus cisolokensis strain LC2-13A (DSM 101873, NCBI Reference Sequence: NR_151901.1) [3] and Xylanibacillus composti strain K-13 (DSM 29793, NCBI Reference Sequence: NR_159901.1) [4] , which were isolated from the Cisolok geyser (west Java, Indonesia) and a manure compost pile in Hungary, respectively. P. cisolokensis LC2-13A and X. composti K-13 are closely related to the thermophilic, facultatively anaerobic, xylanolytic bacterium Paenibacillus sp. strain DA-C8 (Accession number: BMAQ0 0 0 0 0 0 0 0.1), as was previously reported [5] . Strain DA-C8 shows strong xylan degradation ability under anaerobic growth conditions. Thus, comparisons of genomic information among these three bacteria will help in understanding the differences in their xylan degradation abilities and properties.

Bacterial strain
LC2-13A was cultured in official DSMZ medium 220:caso agar, which consisted of 15.0 g/L peptone from casein, 5.0 g/L peptone from soymeal, and 5.0 g/L sodium chloride, with pH adjusted to 7.3, and K-13 was cultured in DSMZ medium 92:trypticase soy yeast extract, which consisted of 30.0 g/L trypticase soy broth and 3.0 g/L yeast extract.

Genomic DNA purification and sequencing
The cells were cultivated for 2 days under anaerobic conditions at 45 °C, then genomic DNA was extracted using NucleoBond® AXG Columns and NucleoBond® Buffer Set III (Macherey-Nagel, TaKaRa Bio Inc., Kusatsu, Japan) following the manufacturer's protocols. The quantity and purity of the genomic DNA were determined using a NanoDrop One UV-Vis Spectrophotometer and a Qubit 4 Fluorometer (Thermo Fisher Scientific, Waltham, MA, USA), respectively. To construct the libraries, the genomic DNA was fragmented using an Ion Xpress TM Plus Fragment Library Kit (cat. no. #4471269, Thermo Fisher Scientific) following the manufacturer's protocols. The fragmented libraries, which were approximately 20 0-30 0 bp in size, were collected by electrophoresis on Invitrogen E-Gel TM SizeSelect TM II Agarose Gels, 2% (cat. no. #G661012, Thermo Fisher Scientific). Each library was diluted to 25 pM and processed using Ion Chef Systems with the Ion 510, Ion 520, and Ion 530 Kit (cat. no. #A34019, Thermo Fisher Scientific). The LC2-13A and K-13 libraries were sequenced using an Ion 530 Chip with an Ion GeneStudio S5 System.

Ethics Statement
This research and analysis did not involve the use of human subjects or animal experiments.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that have or could be perceived to have influenced the work reported in this article.