Draft Genome Sequence of a New Oscillospiraceae Bacterium Isolated from Anaerobic Digestion of Biomass

Here, we present the genome sequence and annotation of the novel bacterial strain HV4-5-C5C, which may represent a new genus within the family Oscillospiraceae (order Eubacteriales). This strain is a potential keystone species in the hydrolysis of complex polymers during anaerobic digestion of biomass.

I n the past few years, efforts have been undertaken to characterize new species isolated from anaerobic digesters (1,2). However, few articles have focused on microorganisms isolated from acidic pretreatment stages. We present here the draft genome sequence of the bacterial strain HV4-5-C5C, which was isolated from the acidification stage of a mesophilic two-stage laboratory-scale leach bed system using as the sole substrate freshly cut grass taken from a meadow in Jena, Germany (50°51=55.4ЉN, 11°35=56.1ЉE). Isolation of the strain was performed after the diluted hydrolysate was reincubated with microcrystalline cellulose as the sole carbon source. After incubation, the hydrolysate was diluted 10 5 -fold, plated on BBL Columbia agar (BD Biosciences) supplemented with 5% laked horse blood, and cultivated under anoxic conditions at 37°C. For purification, single colonies were picked and restreaked several times.
After cultivation in brain heart infusion broth (Carl Roth) supplemented with yeast extract, DNA was extracted and purified using the Gentra Puregene Yeast/Bact. kit (Qiagen) and the NucleoSpin genomic DNA (gDNA) cleanup kit (Macherey-Nagel). We constructed a Nextera XT library from the total genomic DNA and sequenced it using the Illumina NextSeq 500 platform (150-bp paired-end reads). The raw reads were filtered (quality [Q], Ͼ20; minimum length, Ͼ50 nucleotides [nt]) with BBTools v37.10, yielding 23.48 million paired-end sequences with a mean Q value of 32.93. Genome assembly was conducted with the software SPAdes v3.13.0 (3). A total of 72 contigs were obtained (length, Ն300 nt), covering a total genome size of 2,867,854 nt with an estimated GC content of 53.25%. The largest contig was 296,629 nt, and the N 50 value was 134,989 nt. The final coverage of the genome was 2,457ϫ.
The genome harbors 88 different glycoside hydrolases and 2 polysaccharide lyases (7). Strain HV4-5-C5C might be able to carry out alcoholic fermentation as well as the synthesis of lactate, formate, and acetate (Table 1). Therefore, strain HV4-5-C5C may be a keystone species in the hydrolysis of complex polymers as well as in the acidogenesis and acetogenesis steps.
The average amino acid identity (11) and the percentage of conserved proteins (12) calculated for strain HV4-5-C5C and M. indolicus CCUG 59143, the type species of the genus Mageeibacillus, were 15.16% and 25.13%, respectively. Hence, we can assume that strain HV4-5-C5C may represent a new genus within the family Oscillospiraceae (order Eubacteriales) (11,12). Default parameters were used for all software unless otherwise specified.
Data availability. Strain HV4-5-C5C was deposited at the German Collection of Microorganisms and Cell Cultures under the designation DSM 103941. This wholegenome sequencing (WGS) project has been deposited at DDBJ/ENA/GenBank under the accession number JAAVLZ000000000.1. The raw sequence reads are deposited under SRA accession number SRR11413021. The WGS and SRA records are associated with BioProject accession number PRJNA614915.