Genomics and cellulolytic, hemicellulolytic, and amylolytic potential of Iocasia fonsfrigidae strain SP3-1 for polysaccharide degradation

Background Cellulolytic, hemicellulolytic, and amylolytic (CHA) enzyme-producing halophiles are understudied. The recently defined taxon Iocasia fonsfrigidae consists of one well-described anaerobic bacterial strain: NS-1T. Prior to characterization of strain NS-1T, an isolate designated Halocella sp. SP3-1 was isolated and its genome was published. Based on physiological and genetic comparisons, it was suggested that Halocella sp. SP3-1 may be another isolate of I. fronsfrigidae. Despite being geographic variants of the same species, data indicate that strain SP3-1 exhibits genetic, genomic, and physiological characteristics that distinguish it from strain NS-1T. In this study, we examine the halophilic and alkaliphilic nature of strain SP3-1 and the genetic substrates underlying phenotypic differences between strains SP3-1 and NS-1T with focus on sugar metabolism and CHA enzyme expression. Methods Standard methods in anaerobic cell culture were used to grow strains SP3-1 as well as other comparator species. Morphological characterization was done via electron microscopy and Schaeffer-Fulton staining. Data for sequence comparisons (e.g., 16S rRNA) were retrieved via BLAST and EzBioCloud. Alignments and phylogenetic trees were generated via CLUTAL_X and neighbor joining functions in MEGA (version 11). Genomes were assembled/annotated via the Prokka annotation pipeline. Clusters of Orthologous Groups (COGs) were defined by eegNOG 4.5. DNA-DNA hybridization calculations were performed by the ANI Calculator web service. Results Cells of strain SP3-1 are rods. SP3-1 cells grow at NaCl concentrations of 5-30% (w/v). Optimal growth occurs at 37 °C, pH 8.0, and 20% NaCl (w/v). Although phylogenetic analysis based on 16S rRNA gene indicates that strain SP3-1 belongs to the genus Iocasia with 99.58% average nucleotide sequence identity to Iocasia fonsfrigida NS-1T, strain SP3-1 is uniquely an extreme haloalkaliphile. Moreover, strain SP3-1 ferments D-glucose to acetate, butyrate, carbon dioxide, hydrogen, ethanol, and butanol and will grow on L-arabinose, D-fructose, D-galactose, D-glucose, D-mannose, D-raffinose, D-xylose, cellobiose, lactose, maltose, sucrose, starch, xylan and phosphoric acid swollen cellulose (PASC). D-rhamnose, alginate, and lignin do not serve as suitable culture substrates for strain SP3-1. Thus, the carbon utilization profile of strain SP3-1 differs from that of I. fronsfrigidae strain NS-1T. Differences between these two strains are also noted in their lipid composition. Genomic data reveal key differences between the genetic profiles of strain SP3-1 and NS-1T that likely account for differences in morphology, sugar metabolism, and CHA-enzyme potential. Important to this study, I. fonsfrigidae SP3-1 produces and extracellularly secretes CHA enzymes at different levels and composition than type strain NS-1T. The high salt tolerance and pH range of SP3-1 makes it an ideal candidate for salt and pH tolerant enzyme discovery.


INTRODUCTION
Starch-based biomass, such as brewery-spent grains, cassava pulp, rice bran, sago pith residues, and wheat bran, is a by-product of agro-industrial and agricultural operations (Hoang & Nghiem, 2021). Starch-based biomass is produced in significant amounts around the world. For example, more than 174.1 MT/year of sugarcane, cassava, rice, and palm are produced in Thailand alone (Jusakulvijit, Bezama & Thrän, 2021). These feedstocks are composed of polysaccharides, including: cellulose, hemicellulose, and starch, which serve as low-price raw materials for bioproducts. Such starch-based biomass is hydrolyzed by cellulolytic, hemicellulolytic, and amylolytic (CHA) enzymes to yield monosaccharides and oligosaccharides (Cheawchanlertfa et al., 2021). Monosaccharides can be converted to bioethanol, organic acids, or other value-added products while oligosaccharides can be used as prebiotics (Phakeenuya et al., 2020).
Anaerobic bacteria are a proven natural source for the identification and isolation of novel CHA enzymes (Cheawchanlertfa et al., 2021). The bioprospecting of extremophiles, including halophilic anaerobic bacteria, has also yielded novel enzymes with unique properties for commercial applications (Kivistö & Karp, 2011). In this study, we examine CHA enzymes derived from the halophilic anaerobic bacterium designated as Halocella sp. SP3-1 (Heng et al., 2019), which is renamed as described below.
Iocasia was recently proposed as a new genus with I. fonsfrigidae NS-1 T as the archetype (Zhang et al., 2021). This strain was isolated from cold seep sediment of the South China Sea. I. fonsfrigidae NS-1 T was shown to metabolize several carbohydrates, including: starch, xylan, alginate, carboxymethyl cellulose, and a polymer of the aromatic compound lignin (Zhang et al., 2021). I. fonsfrigidae NS-1 T is a moderate halophile, which will readily grow in 1.25-15.0% NaCl.
Prior to reporting the discovery of I. fonsfrigidae NS-1 T , our laboratory published the complete genome of an isolate originally designated as Halocella sp. SP3-1 (Heng et al., 2019), which we identify and rename in the present study as Iocasia fonsfrigidae strain SP3-1. The strain SP3-1 was isolated from the soil of a salt evaporation pond (13 • 28 37.55 N; 100 • 7 8.27 E) in the Samut Sakhon province of Thailand (Heng et al., 2019). SP3-1 readily grows on cellulose, hemicellulose, or starch under higher salt content (5-30% NaCl) than strain NS-1 T . The complete genome sequences for I. fonsfrigidae strains SP3-1 and NS-1 T were analyzed. The average nucleotide identity (ANI) between the two strains is 97.64% (Zhang et al., 2021), supporting the ''same species'' determination based on a suggested ANI threshold of 95-96% (Richter & Rosselló-Móra, 2009). Although, we conclude that the SP3-1 isolate is a strain of I. fonsfrigidae, genetic, physiological, and biochemical properties between SP3-1 and NS-1 T vary. For example, NS-1 T encodes several genes not found in SP3-1, including genes encoding for proteins related to carbohydrate metabolism, ABC transporters, PTS sugar transporters, type II secretion systems, type I-B and type III-B CRISPR associated proteins, and clusters of proteins related to ethanolamine and propanediol metabolism (Zhang et al., 2021). Conversely, strain SP3-1 exhibits higher salt tolerance and encodes genes not found in strain NS-1 T . These include genes that code for endo-β-1,4-galactanase, xylan-α-1,2-glucuronosidase, β-xylosidase, α-Larabinofuranosidase, and β-L-arabinofuranosidase (i.e., hemicellulolytic enzymes) and oligo-α-1,6-glucosidase (amylolytic enzymes). The unique CHA enzymes constituency of the strain SP3-1 proteome motivate this study. Specifically, we distinguish strains SP3-1 and NS-1 T based on phylogenetics, physiology, and biochemistry with particular interest in differentiating the respective proteomes based on enzymes content. Our analysis demonstrates that I. fonsfrigidae strain SP3-1 expresses a suite of CHA enzymes with potential for optimal functionality under high salinity conditions.

Bacterial strains and media
Strain SP3-1 was isolated from the soil sample (Heng et al., 2019), whereas Halocella cellulosilytica DSM 7362 T was purchased from the Leibniz Institute, DSMZ-German Collection of Microorganisms and Cell Cultures GmbH. Strain SP3-1 was deposited at Thailand Institute of Scientific and Technological Research Culture Collection (TISTR) and Korean Collection for Type Cultures (KCTC) under accession numbers TISTR 2992 and KCTC 25333, respectively. Strain SP3-1 was cultured in the basal medium, pH 8.0 (BM) composed of (per liter): 200 g NaCl, 1.5 g KH 2 PO 4 , 2.9 g K 2 HPO 4 , 2.1 g urea, 4.5 g yeast extract, 0.5 g cysteine-HCl, 0.001 g resazurin, and 200 µL mineral solution (25.0 g/L MgCl 2 . 6H 2 O, 37.5 g/L CaCl 2 . 2H 2 O and 0.3 g/L FeSO 4 . 6H 2 O). For shorter term storage, strain SP3-1 was preserved at −20 • C in liquid media with 25% of glycerol. Two methods were used for longer term storage: storages at −80 • C in liquid media with 25% of glycerol and via lyophilization. Strain H. cellulosilytica DSM7362 T was cultured in DSMZ medium 702 (pH 7.0) (https://www.dsmz.de/microorganisms/medium/pdf/DSMZ_Medium702.pdf). Both media were anaerobically prepared in bottles sealed with butyl rubber stoppers under an atmosphere of high-purity N 2 and sterilized by autoclaving at 121 • C for 15 min. Phosphoric acid swollen cellulose (PASC), prepared from Avicel PH-101, as previously described by Zhang et al. (2006), xylan, and starch were used as the sole carbon source to observe the ability of bacteria to degrade cellulose, hemicellulose (i.e., xylan), and starch, respectively.

16S rRNA gene analysis and phylogenetic tree
Genomic DNA was extracted as previously described by Heng et al. (2019). The 16S rRNA gene was amplified via PCR with the universal primers 8F (5 -AGAGTTTGATCCTGGCTCAG-3 ) and 1492R (5 -GGTTACCTTGTTACGACTT-3 ) (Chimtong et al., 2014). The PCR amplification was performed with 1 µL of a DNA template, followed by 5 µL of 10X Ex-Taq buffer, 1 µL of 10 mM dNTPs, 1 µL of each 10 µM primer, and 0.25 µL of Ex-Taq DNA polymerase (Promega Corp., Madison, WI, USA). PCR conditions consisted of an initial denaturation step at 95 • C for 30 s, followed by 30 cycles at 95 • C for 30 s, annealing at 68 • C for 30 s, and extension at 68 • C for 1 min. The final extension step was 10 min at 68 • C. PCR product was purified by a QIAquick PCR purification kit (QIAGEN, Hilden, Germany). The nearly complete 16S rRNA gene sequence was compiled with the BioEdit software (Hall, Biosciences & Carlsbad, 2011). The 16S rRNA gene sequence of strain SP3-1 was compared to taxa which were retrieved through BLAST (Altschul et al., 1997) and EzBioCloud databases (Yoon et al., 2017). The 16S rRNA sequence of strain SP3-1 and correlated taxa were aligned by via CLUTAL_X in MEGA software version 11 and phylogenetic trees were constructed using the neighbor-joining (NJ) method (Tamura, Stecher & Kumar, 2021). Confidence values for phylogenetic tree branches were determined via bootstrap analyses on 1,000 replicates (Rzhetsky & Nei, 1992).

Chemotaxonomic analysis
The cell wall peptidoglycan was determined as described by Komagata & Suzuki (1987). Cellular fatty acids were extracted, methylated, and analyzed using the standard microbial identification system protocol (Sherlock Microbial Identification System, version 6.1), whereas fatty acids were identified using the TSBA6 database of the microbial identification system (Sasser, 1990). Polar lipids were analyzed from freeze-dried cells by two-dimensional thin-layer chromatography (TLC), as described by Minnikin et al. (1984). Appropriate detection reagents were used for visualizing TLC bands: phosphomolybdic acid reagent 5% (w/v) solution in ethanol (Sigma-Aldrich, Saint Louis, MO, USA) was used to detect total polar lipids; ninhydrin reagent (0.2% solution; Sigma-Aldrich Saint Louis, MO, USA) was used to detect amino lipids; the Dittmer and Lester reagent (molybdenum blue, 1.3%; Sigma-Aldrich Saint Louis, MO, USA) was used to detect phospholipids; and, Dragendorff's reagent (Sigma-Aldrich Saint Louis, MO, USA) was used to detect phosphatidylcholine.

Cultivation and enzyme production
Strain SP3-1 was cultivated in 1 L of BM containing 0.5% (w/v) PASC for 3 days at 37 • C, pH 8.0 under static conditions in an anaerobic chamber (Bactron II, USA). The culture supernatant was collected by centrifugation at 10,000×g for 15 min at 4 • C. Culture supernatant was subsequently concentrated using a hollow fiber cartridge with a 10 kDa cutoff membrane (GE Healthcare, USA). The retentate (approximately 40-times concentrated) was then used as the crude enzyme.

Enzyme assays and protein determination of crude enzyme of I. fonsfrigidae strain SP3-1
Enzyme activity was determined using 50 µL of enzyme (containing 250 µg protein) mixed with 50 µL of substrate in a 50 mM sodium phosphate buffer (pH 7.0) incubated at 50 • C for 15 min. Enzymatic activity on 1% (w/v) PASC, birchwood xylan, or soluble starch were assayed by determining the amount of reducing sugar by the DNS method (Hu et al., 2008). One unit (U) of enzyme activity is defined as the amount of enzyme releasing 1 µmol of reducing sugar in 1 min. The Lowry method was used for measurement of the protein concentration and using bovine serum albumin as a standard (Lowry et al., 1951).

Library preparation and genome sequencing of I. fonsfrigidae strain SP3-1
Genome sequencing and library preparation were performed as described in Heng et al. (2019). Briefly, strain SP3-1 was cultured at 37 • C in BM containing 1% (w/v) cellobiose and 20% (w/v) NaCl until late exponential growth phase under anaerobic conditions. Cell culture was collected and used for genomic DNA extraction via the DNeasy blood and tissue kit (Qiagen, Hilden, Germany). Subsequently, the SMRTbell template prep kit 1.0 (Pacific Biosciences, Menlo Park, CA, USA) was used to construct sequencing libraries. Polymerase reads were trimmed using high-quality regions, with a minimum subread length (500 bp), a minimum polymerase read quality (0.80), and a minimum polymerase read length (100 bp).
Comparative genome analysis with comparison of CHA enzyme and carbohydrate-binding module (CBM) genes between I. fonsfrigidae strain SP3-1 and I. fonsfrigidae NS-1 T The I. fonsfrigidae SP3-1 genome (NCBI GenBank accession number CP032760) was compared with closely related strain I. fonsfrigidae NS-1 T (NCBI GenBank accession number CP046640). The polysaccharide-degrading enzyme-related genes of strain SP3-1 were grouped into CHA enzyme-encoding and CBM-encoding families using HMMER hmmsearch with Pfam_Is HMMs (full-length models) to identify complete matches to the family, which were named per the CAZy nomenclature scheme (Cantarel et al., 2009). All hits with E-values below 10 −4 were calculated and their sequences were further analyzed. For CHA enzymes and CBM families, which currently do not have the Pfam HMM, representative sequences were selected from the CAZy website per Warnecke et al. (2007). In this case, BLAST (http://www.ncbi.nlm.nih.gov/BLAST/) was used to identify these CHA enzymes, CBM families, and percent nucleotide sequence identity of all genes.
Analysis of 16S rRNA genes from isolates SP3-1, NS-1 T , and other species in the family Halanaerobiaceae indicate that isolate SP3-1 may be the same species as strain NS-1 T based on a 98.65% threshold for species determination (Kim et al., 2014). This agrees with a recent report noting isolates SP3-1 and NS-1 T as geographic variants of the same species (Zhang et al., 2021). Although 16S rRNA is used to determine relationships between genera, 16S analysis alone is not sufficient for making a same species call in microbiology (Lee et al., 1998).

Comparative genome analysis of I. fonsfrigidae NS-1 T and isolate SP3-1 suggests they are geographic variants of the same species
Whole genome sequences were used to examine relatedness between isolates SP3-1 and NS-1 T . Whole genomic sequence analysis shows 97.64% of ANI, which corresponds to a previously reported value (Zhang et al., 2021). DNA-DNA hybridization (DDH) was also conducted to further explore the relatedness between isolate SP3-1 and NS-1 T . The in-silico DDH values between NS-1 T and isolate SP3-1 (79.9%) exceeded the accepted threshold of 70% for a distinct species call. Based on sequence similarity (Kim et al., 2014) and DDH (Qin et al., 2014), our results also indicate that NS-1 T and SP3-1 are the same species. Further data support these results. Specifically, I. fonsfrigidae NS-1 T was reported to have a 3,926,493 base pair (bp) genome and a total G+C content of 35.72 mol% (Zhang et al., 2021). A complete, gapless, circular genome assembly was generated for isolate SP3-1 yielding a 4,035,760 bp genome (Fig. S1, Table S1) with a total G+C content of 35.1 mol% (Heng et al., 2019). Predictions from annotated genome assemblies suggest that strain NS-1 T encodes 3,671 proteins and SP3-1 encodes 3,729 proteins. Results from an extensive genome comparison between isolates SP3-1 and NS-1 T reveal nucleotide sequence identities between homologous genes at: 20-30% for 200 genes; 40-50% for 122 genes; 60-70% for 76 genes; 80-90% for 2,477 genes; and 100% for 470 genes (see Table 1 and Table S2). No plasmids were detected and the origin of duplication in isolate SP3-1 was determined based on GC skew analysis. Despite the determination that isolate SP3-1 and strain NS-1 T both fall under the taxon I. fonsfrigidae, significant differences exist beyond their genomic profiles. Cell morphology, physiology, and biochemistry reveal differences between I. fonsfrigidae NS-1 T and isolate SP3-1 suggesting that they are distinct strains of the same species On solid media, isolate SP3-1 forms small, white colonies with a well-bounded smooth surface. SEM of SP3-1 reveals rod-shaped cells approximately 0.4 µm dia. × 1.3 µm in length (Fig. 2). In contrast electron microscopy of strain NS-1 T reveals longer rod-shape cells approximately 0.2-0.3 µm dia. × 6.0-10.0 µm in length. Strain NS-1 T also exhibits multiple, long (5-10 µm) flagella extending from one end (i.e., unipolar) of the major axis of the cell (Zhang et al., 2021). Both I. fonsfrigidae NS-1 T and SP3-1 are Gram-negative and do not form non-endospores. When cultured in BM with 1% (w/v) cellobiose as a carbon source but with different conditions of pH (5.0-10.0) and temperature (25-70 • C), isolate SP3-1 exhibits a wider range of viable growth compared to strain NS-1 T (see Table 2). Although optimal growth of isolate SP3-1 occurs at pH 8.0, 37 • C, it readily grows at pH 10.0, confirming that it is a mesophilic alkaliphile. Beyond growth optima, the viable temperature range for isolate SP3-1 is 25-55 • C and the viable pH range is 5.5-10.0 (Table  2). Comparatively, I. fonsfrigidae NS-1 T grows between 20-45 • C at pH values between 6.5-8.0 with optima at 37 • C and pH 7.0 (Zhang et al., 2021). Moreover, SP3-1 grows on all of the same carbon sources as strain NS-1 T , except D-rhamnose and alginate, and Table 1 All genes indicated high similarity and coverage between strain SP3-1 and NS-1 T . The percentage identity of genes>70% indicated high similarity and high coverage. lignin. Suitable carbon sources include: L-arabinose, D-fructose, D-galactose, D-glucose, D-mannose, D-raffinose, D-xylose, cellobiose, lactose, maltose, sucrose, xylan, and starch (Table 2). We note that strain SP3-1 also readily grows on PASC.
Despite genetic similarity indicating that I. fonsfrigidae NS-1 T and isolate SP3-1 are the same species, the numerous macromorphological, physiological, and biochemical differences suggest that isolate SP3-1 is not merely a geographic variant but indeed a distinct strain of I. fonsfrigidae.
These cellular fatty acids and polar lipid profiles support the suggestion that strain SP3-1 is the same species as NS-1 T while highlighting major biochemical differences between these two different strains. Significantly different polar lipid profiles between members of related (or the same) taxa often indicated adaptation to different niches and thus different functionality (Dlugosch et al., 2022).

Functional genomics analysis of I. fonsfrigidae strain SP3-1 indicates a complex carbohydrate metabolism
To elucidate differences in metabolic potential between I. fonsfrigidae NS-1 T and strain SP3-1 fully annotated genomes were analyzed to detect clusters of orthologous genes that drive distinct metabolic functions. From the ∼4.0 Mbp genome, 4,044 genes are predicted with 3,875 protein-coding sequences (CDS), 12 rRNA sequences, and 59 tRNA sequences (Table S1). In comparison, for the ∼3.9 Mbp genome of I. fonsfrigidae NS-1 T 3,774 genes were predicted with 3,671 protein-coding sequences (CDS), 12 rRNA, and 58 tRNA sequences (Zhang et al., 2021). Notably, the NCBI Prokaryotic Genome Annotation Pipeline (version 4.6) was also employed and predicted 3,885 total genes for strain SP3-1 with 3,729 CDS (see Table S1). These differences are due to the distinct algorithms used in the in-house versus the NCBI pipeline. For this study, we used values from the in-house pipeline (i.e., 3,875 CDS) for further analyses.

I. fonsfrigidae strain SP3-1 encodes for and produces CHA enzymes with a notable range of cellulolytic, hemicellulolytic, and amylolytic activities
Both the range of carbon sources promoting viable growth of strain SP3-1 as well as the detection of known (and expected) carbohydrate metabolism-related COGs identified in the strain SP3-1 genome justify further examination of the CHA enzymes potential of strain SP3-1. To gauge the extent to which strain SP3-1 serves as a suitable prospect for unique carbohydrate deconstruction enzyme discovery, more detailed analyses of genes (and proteins) involved in carbohydrate metabolism was conducted. Because strain SP3-1 grows rapidly on PASC (as well as xylan and starch), a quantitative comparison was performed between strain SP3-1 and an available relative, H. cellulosilytica (DSM 7362 T ).
H. cellulosilytica DSM 7362 T grows slowly on PASC and starch. It does not grow on xylan.
To determine the fecundity of I. fonsfrigidae strain SP3-1 was cultivated in the BM containing PASC (0.5% w/v) as a carbon source at pH 8.0 and 37 • C. Culture supernatant was harvested after 3 days (at late exponential growth phase) and concentrated. The crude supernatant concentrates exhibit cellulase activity of 5.86 U/g protein on PASC. This CHA gene expression appears to be inducible. In addition, xylanase (4.43 U/g protein) and amylase (4.71 U/g protein) activities were also detected (Table 4). These are likely constitutive enzymes involved in the breakdown of xylan and starch, respectively. Thus, supernatant concentrate derived from strain SP3-1 cultures contains CHA enzymes which readily degrade PASC, xylan, and starch, which are present in the starch-based biomass.

DISCUSSION
Starch-based biomass is a by-product of agro-industrial operations worldwide with millions of tons produced annually (Ceballos, 2017). This biomass consists predominantly of polysaccharides such as cellulose, hemicellulose, and starch-all of which can be used as low-price raw materials to produce secondary products. These substrates can be degraded by enzymes. Enzymatic deconstruction of these polysaccharides can be an ''environmentfriendly'' approach to process feedstock by reducing the use of hazardous chemicals (e.g., strong acids and bases) for biomass conversion (Ceballos et al., 2015;Ceballos, 2017). Specifically, cellulose, hemicellulose, and starch are hydrolyzed by CHA enzymes to yield simple sugars (i.e., monosaccharides and oligosaccharides). Thus, this type of biomass may be used as a renewable resource to produce high-value-added products (Cheawchanlertfa et al., 2021) as long as effective, low-cost enzymes and enzyme technologies are available (Ceballos et al., 2014). Bioprospecting for enzymes capable of efficiently degrading starchbased biomass is an ongoing scientific endeavor and extremophiles are one focus for novel enzyme discovery. Recently, a bacterium strain NS-1 T was isolated from deep-sea cold seeps in the South China Sea. The strain NS-1 T was reported to have a high similarity to H. cellulosilytica DSM 7362 T . Ultimately, this halophilic isolate NS-1 T showed phylogenetic, genomic, and physiological traits unique enough to establish a novel genus within the family Halanaerobiaceae. Thus, isolate NS-1 T became the archetype of I. fonsfrigidae. I. fonsfrigidae NS-1 T exhibited the ability to metabolize a diverse array of carbohydrates (Zhang et al., 2021). In this study, the archetype I. fonsfrigidae NS-1 T (Zhang et al., 2021) and related species were contrasted with the more halophilic I. fonsfrigidae strain SP3-1, which was initially designated Halocella sp. SP3-1 (Heng et al., 2019).

Morphological differences between I. fonsfrigidae NS-1 T and strain SP3-1 underscore divergent adaptations in locomotive strategies
Despite the fact that our data support a recent report identifying strain SP3-1 as the same species as I. fonsfrigidae NS-1 T based on accepted thresholds for genetic similarity (Zhang et al., 2021), we note remarkable differences between strain NS-1 T and strain SP3-1 in terms of morphology, physiology, and biochemistry. In terms of morphological differences, the most notable feature of strain SP3-1 (when compared to strain NS-1 T ) is the absence of developed and functional flagella. Although the genome of strain SP3-1 contains many genes related to formation of the basal body of flagella (i.e., fliE, fliF, fliG, fliH, fliI, fliJ, fliK, fliL, fliM, fliN, fliO, flip, fliQ, fliR, and flhB) and hook proteins (i.e., flgA, flgB, flgC, flgD, flgF, flgG, flgH, and flgI ), it is missing flhA and flgJ, which are key genes for basal body and hook formation, respectively (see Table 5 and Table S6). Importantly, the absence or interruption of the flhA gene in Gram-negative bacteria leads to nonmotile cells, which lack flagella and are incapable of exporting flagella-related proteins (Bange et al., 2010). The absence of the flgJ gene prevents proper assembly of the hook-filament junction in flagella. The flgJ gene product is also critical for stabilizing protein-protein interactions between basal structures (e.g., L-ring formation) in flagella (Cohen & Hughes, 2014).

Physiological enhancements demonstrate the ability of strain SP3-1 to survive and thrive in a broader range of environments than strain NS-1 T and other related species
Beyond the ability of strain SP3-1 to robustly grow at higher salt concentration and higher pH (when compared to NS-1 T and other members of the family Halanaerobiaceae), strain SP3-1 grows on PASC, which makes it a viable candidate for bioprospecting carbohydratedegrading enzymes. The ability of strain SP3-1 to grow in alkaline environments while utilizing an acid-treated substrate as a carbon source underscores the strain's tolerance to pH as well as its halophilic nature. This physiological profile is supported by a set of salt stress and high pH tolerance genes identified within the strain SP3-1 genome. For example, the strain SP3-1 genome includes a molybdate ABC transporter substrate-binding gene (i.e., modA; gene loci: AZO94203.1, AZO94204.1) ( Table 5). The marine bacterium Staphylococcus sp. strain P-TSB-70, which readily grows in saline media with up to 20% NaCl (Das et al., 2020), is also endowed with a similar molybdate ABC transporter. Strain   SP3-1 also contains three genes encoding a glycine/sarcosine/betaine reductase complex (gene loci: AZO94733.1, AZO94734.1, and AZO94735.1) that are absent in strain NS-1 T ( Table 5). The glycine/sarcosine/betaine reductase complex includes selenoprotein A. Selenoprotein A is involved in betaine utilization as reported by Manzoor et al. (2015), which demonstrates the growth of selenoprotein A-expressing Syntrophaceticus schinkii strain Sp3 growth on betaine. This is notable since accumulation and utilization of betaine from culture media is essential in high salinity conditions as reported by Nyyssölä (2001) in the study of the extreme halophile Actinopolyspora halophila. Na + /H + antiporters are also reported to play an essential role in allowing halophilic bacteria to thrive in high salinity environments (Das et al., 2020;Su et al., 2021). Strains SP3-1 and NS-1 T both harbor Na + /H + antiporter genes; however, strain SP3-1 possesses a larger suite of such genes (gene loci: AZO93678.1, AZO93681.1, AZO94987.1, AZO95163.1, AZO96039.1, AZO94275.1, and AZO96817.1) than NS-1 T (see Table 5).
In addition to facilitating salt tolerance, Na + /H + antiporters (as well as ABC transporters) are known to be expressed in alkaliphilic halophiles. This has been demonstrated in several species including B. halotolerans KKD1 (Cheng et al., 2016), Bacillus firmus OF4 (Ito et al., 1997), and Bacillus sp. G1 (Liew et al., 2007). The fact that strain SP3-1 is endowed with a more diverse repertoire of genes related to halophilic and alkaliphilic growth suggests a link between genetics and phenotype that permit strain SP3-1 to tolerate higher salt concentrations and higher pH when compared to strain NS-1 T .

Genomic analyses highlight genetic substrates that underlie differences carbohydrate metabolism between I. fonsfrigidae strains SP3-1 and NS-1 T
Results from carbon utilization studies provided an initial indication that carbohydrate metabolism, biochemistry, and perhaps genetics, are different between I. fonsfrigidae strain SP3-1 and NS-1 T . For example, strain NS-1 T grown on glucose yields detectable lactate in the culture supernatant. However, strain SP3-1 does not indicating a difference in sugar metabolism between the strains. Lactate dehydrogenase (LDH) is a key enzyme in the last step of glycolysis that plays a key role in pyruvate-to-lactate reactions (Hamadneh et al., 2021). Although both strains SP3-1 and NS-1 T contain LDH genes (gene loci at AZO94114.1 and QTL97032.1, respectively), strain SP3-1 has 15 genes encoding GntR family transcriptional regulators, including an HTH-type transcriptional regulator gene lutR. In contrast, strain NS-1 T only contains seven GntR genes (see Table 5). The more extensive repertoire of GntR genes may underlie the ability of strain SP3-1 to metabolize lactate. For example, it is known that operons associated with lactate metabolism are controlled by the GntR family (Augustiniene & Malys, 2022). Specifically, HTH-type transcriptional regulators (lutR genes) have been shown to regulate genes involved in lactate utilization (Wang et al., 2019). Thus, strain NS-1 T with fewer GntR genes may exhibit reduced efficiency in metabolizing lactate, which SP3-1 readily metabolizes this by-product of glucose utilization.
It is notable that SP3-1 can also uptake raffinose as a sole carbon source. This is not surprising since the SP3-1 genome encodes an α-galactosidase gene (locus: AZO94804.1), which is absent in the genome of strain NS-1 T (see Table 5). It is known that α-galactosidase degrades raffinose. This was demonstrated in both Pseudobalsamia microspore (Yang et al., 2015) and Saccharomyces cerevisiae (Álvarez Cao et al., 2019).
Conversely, strain NS-1 T encodes an ATP-binding cassette domain-containing protein rhamnose transport system gene (i.e., rhaT), which is lacking in SP3-1. This may underlie the inability of strain SP3-1 to use rhamnose as a carbon source (see Richardson, Hynes & Oresnik, 2004).

Biochemical (and metabolic) properties highlight the CHA enzymes potential of I. fonsfrigidae strain SP3-1
Differences in carbohydrate metabolism extend well beyond simple absence/presence of genes for utilization of simple sugars as carbon sources. I. fonsfrigidae strain SP3-1 also stands out for its suite of genes encoding CHA enzymes. Not only does the strain SP3-1 genome encode more CHA enzymes than NS-1 T , but the range of exo-acting, endo-acting, and side chain-acting enzymes are more expansive. The complement of CHA enzymes in strain SP3-1 suggests that its carbohydrate metabolism is more advanced than that of strain NS-1 T .
Since these classes of enzymes work in concert (and often synergistically) to degrade polysaccharides (Linares-Pasten, Andersson & Karlsson, 2014), the extensive repertoire of CHA enzymes found in the strain SP3-1 genome justifies exploration of this halophilic alkaliphile as a potential source for novel enzyme discovery. Our data on strain SP3-1 show a suite of endoglucanases, which promote cleavage at internal sites within cellulose molecular structures, as well as β-glucosidases, which act on short-chain oligosaccharides and cellobiose to produce glucose (Baramee et al., 2017). Genomic data for strain SP3-1 also show a gene encoding for CBM. CBM is reported to assist in the binding of enzymes to insoluble substrates to promote the efficient degradation of cellulosic substrates (Limsakul et al., 2021).
In addition to cellulolytic and hemicellulolytic enzymes, strain SP3-1 harbors a suite of amylolytic genes that include endo-acting, exo-acting and debranching amylases. Amylolytic enzymes catalyze the cleavage of α-D-1,4and α-D-1,6-glycosidic linkages of starch and related oligosaccharides producing short-chain oligosaccharides and glucose (Sidar et al., 2020). Oligosaccharides from starch are used as prebiotics to promote the growth of healthy gut microflora (Belorkar & Gupta, 2016). Furthermore, amylolytic enzymes can be used in starch liquefaction as well as in paper, food, pharmaceutical, and sugar production operations (Pervez et al., 2014).
Since CHA enzymes are the primary enzymes for the breakdown of polysaccharides in starch-based biomass, strain SP3-1 is an attractive and promising microorganism for the novel discovery of CHA enzymes and conversion of starch-based biomass into value-added products.

CONCLUSIONS
Halocella sp. SP3-1 was isolated from a high salt evaporation pond in Samut Sakhon, Thailand as described by Heng et al. (2019). The whole-genome sequence of strain SP3-1 was deposited at NCBI GenBank under the accession number CP032760. It was later found to be a species of a new taxon, Iocasia fonsfrigidae, which includes a characterized type strain: NS-1 T (Zhang et al., 2021). Strain SP3-1, which was isolated from a salt evaporation pond, readily grows in higher salt content of 30% NaCl and higher pH than the type strain NS-1 T . The halophilic and alkaliphilic nature of strain SP3-1 prompted a studied focused on its potential to metabolize simple and complex carbohydrates as well as the genetic/genomic substrates that underlie its phenotypic nature.
In this study, we have demonstrated that despite the same species determination for strain SP3-1 (Heng et al., 2019) and type strain NS-1 T (Zhang et al., 2021), I. fonsfrigidae strain SP3-1 is more halophilic and alkaliphilic that strain NS-1 T and that there are genetic differences that can account for phenotypic (e.g., morphological and physiological) differences between these two strains. Our analyses demonstrate that strain SP3-1 expresses and secretes a suite of CHA-enzymes which is distinct from that of strain NS-1 T . Given the adaptation of strain SP3-1, to higher salinity and higher pH environments, this strain serves as a suitable candidate for novel enzyme discovery. Although both strains of Iocasia fonsfrigidae are likely limited in their ability to degrade lignocellulosic substrates due to the absence of ligninolytic enzymes such as: lignin peroxidase, manganese peroxidase, versatile peroxidase, laccase, phenoloxidases, and auxiliary enzymes, which play a key role in the degradation of lignin (Biko, Bloom & Van Zyl, 2020), the prospect of discovering or engineering high salt-and high pH-tolerant enzymes from the strain SP3-1 proteome is promising and the subject of ongoing research.
• Sawannee Sutheeworapong conceived and designed the experiments, performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.
• Verawat Champreda conceived and designed the experiments, performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.
• Ayaka Uke conceived and designed the experiments, performed the experiments, authored or reviewed drafts of the article, and approved the final draft.
• Akihiko Kosugi conceived and designed the experiments, performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.
• Patthra Pason conceived and designed the experiments, performed the experiments, authored or reviewed drafts of the article, and approved the final draft.
• Rattiya Waeonukul conceived and designed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.
• Ruben Michael Ceballos conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.
• Khanok Ratanakhanokchai conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.
• Chakrit Tachaapaikoon conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

DNA Deposition
The following information was supplied regarding the deposition of DNA sequences: The 16S rRNA gene sequence of Iocasia fonsfrigidae strain SP3-1 is available at GenBank: MW958225. The whole-genome sequence of Iocasia fonsfrigidae strain SP3-1 is also available at GenBank: CP032760.

Data Availability
The following information was supplied regarding data availability: The raw data are available in the Supplemental Files.

New Species Registration
The following information was supplied regarding the registration of a newly described species: The strain SP3-1 is deposited in Thailand and South Korea under deposit codes TISTR 2992 (Thailand Institute of Scientific and Technological Research) and KCTC 25333 (Korean Collection for Type Cultures), respectively.