Functional Analysis of H+-Pumping Membrane-Bound Pyrophosphatase, ADP-Glucose Synthase, and Pyruvate Phosphate Dikinase as Pyrophosphate Sources in Clostridium thermocellum

ABSTRACT The atypical glycolysis of Clostridium thermocellum is characterized by the use of pyrophosphate (PPi) as a phosphoryl donor for phosphofructokinase (Pfk) and pyruvate phosphate dikinase (Ppdk) reactions. Previously, biosynthetic PPi was calculated to be stoichiometrically insufficient to drive glycolysis. This study investigates the role of a H+-pumping membrane-bound pyrophosphatase, glycogen cycling, a predicted Ppdk–malate shunt cycle, and acetate cycling in generating PPi. Knockout studies and enzyme assays confirmed that clo1313_0823 encodes a membrane-bound pyrophosphatase. Additionally, clo1313_0717-0718 was confirmed to encode ADP-glucose synthase by knockouts, glycogen measurements in C. thermocellum, and heterologous expression in Escherichia coli. Unexpectedly, individually targeted gene deletions of the four putative PPi sources did not have a significant phenotypic effect. Although combinatorial deletion of all four putative PPi sources reduced the growth rate by 22% (0.30 ± 0.01 h−1) and the biomass yield by 38% (0.18 ± 0.00 gbiomass gsubstrate−1), this change was much smaller than what would be expected for stoichiometrically essential PPi-supplying mechanisms. Growth-arrested cells of the quadruple knockout readily fermented cellobiose, indicating that the unknown PPi-supplying mechanisms are independent of biosynthesis. An alternative hypothesis that ATP-dependent Pfk activity circumvents a need for PPi altogether was falsified by enzyme assays, heterologous expression of candidate genes, and whole-genome sequencing. As a secondary outcome, enzymatic assays confirmed functional annotation of clo1313_1832 as ATP- and GTP-dependent fructokinase. These results indicate that the four investigated PPi sources individually and combined play no significant PPi-supplying role, and the true source(s) of PPi, or alternative phosphorylating mechanisms, that drive(s) glycolysis in C. thermocellum remain(s) elusive. IMPORTANCE Increased understanding of the central metabolism of C. thermocellum is important from a fundamental as well as from a sustainability and industrial perspective. In addition to showing that H+-pumping membrane-bound PPase, glycogen cycling, a Ppdk–malate shunt cycle, and acetate cycling are not significant sources of PPi supply, this study adds functional annotation of four genes and availability of an updated PPi stoichiometry from biosynthesis to the scientific domain. Together, this aids future metabolic engineering attempts aimed to improve C. thermocellum as a cell factory for sustainable and efficient production of ethanol from lignocellulosic material through consolidated bioprocessing with minimal pretreatment. Getting closer to elucidating the elusive source of PPi, or alternative phosphorylating mechanisms, for the atypical glycolysis is itself of fundamental importance. Additionally, the findings of this study directly contribute to investigations into trade-offs between thermodynamic driving force versus energy yield of PPi- and ATP-dependent glycolysis.

they can conserve the energy stored in the phosphoanhydride bond by recycling the biosynthetically generated PP i in glycolysis, which hypothetically results in a net increase of the ATP yield compared to a conventional ATP-dependent glycolysis (14,16,17) (equation 2). However, subsequent model-based calculations estimated that the amount of PP i generated from biosynthetic reactions is far from enough to satisfy the PP i requirement in glycolysis (9). Hence, additional mechanisms to generate PP i must be operative to be consistent with the current understanding of glycolysis in C. thermocellum.
Two possible mechanisms that have been postulated include the use of a H 1 -pumping membrane-bound pyrophosphatase and the use of glycogen cycling in which glycogen is simultaneously formed and degraded (9,18) (Fig. 1). In Rhodospirillum rubrum and Arabidopsis thaliana, a H 1 -pumping membrane-bound PPase couples the energy stored in the proton gradient to formation of PP i from 2 P i (19)(20)(21). If, in C. thermocellum, this reaction indeed operates in the reverse PP i -generating direction, 1 mol ATP might drive synthesis of 2 mol PP i based on FIG 1 PP i -dependent glycolysis of Clostridium thermocellum. PP i is consumed by PP i -dependent phosphofructokinase (PP i -Pfk) and pyruvate phosphate dikinase (Ppdk) (indicated in red) and may be formed by the proton-pumping membrane-bound pyrophosphatase (PPase) and glycogen cycling (indicated in blue), with ADP-glucose synthase (Ags) as the key enzyme. The H 1 /PP i stoichiometry of the PPase is based on previous estimations (2). Solid arrows represent a single reaction; dashed arrows represent lumped reactions. Abbreviations: ADPG, ADP-glucose; CB, cellobiose; F6P, fructose-6-phosphate; FBP, fructose-1,6-bisphosphate; G1P, glucose-1-phosphate; G6P, glucose-6-phosphate; Glu, glucose; Glyc, glycogen; Mal, malate; OAA, oxaloacetate; PEP, phosphoenolpyruvate; PYR, pyruvate. previous estimations of the H 1 /PP i stoichiometry of the PPase and the ATP/H 1 stoichiometry of the ATPase (2). This would result in a net ATP yield of 3.5 ATP per mol glucose equivalent (equation 2). Although a previous study did not observe a phenotype upon knockout of a gene putatively encoding the membrane-bound PPase, it was not shown whether membrane-bound PPase activity was indeed eliminated (18). A second proposed mechanism, glycogen cycling, starts with the key enzyme ADP-glucose synthase catalyzing the reaction from glucose-1-P and ATP to ADP-glucose and PP i (Fig. 1). Glycogen is subsequently formed from ADP-glucose and degraded again to glucose-1-P. Overall, this cycle would form one PP i and ADP at the expense of one ATP and P i . If this mechanism would be solely responsible for PP i supply, the net ATP yield will be 2 mol ATP per mol glucose equivalent (equation 2). Glycogen cycling has been observed before in anaerobic cellulolytic bacteria harboring PP i -dependent glycolysis (22)(23)(24); however, the role of such cycling in supplying PP i has not yet been experimentally verified.
The aim of the present study is to investigate possible PP i -generating mechanisms in the central metabolic pathways of C. thermocellum. To this end, single knockout strains of two previously suggested and two newly identified candidate PP i -supplying routes were constructed, and the impact of the deletions was quantitatively analyzed in batch serum bottle cultures. Functional annotation of targeted genes and confirmation of their deletion was done through enzyme assays or glycogen measurements. To assess possible complementarity of the mechanisms, double, triple, and quadruple knockout strains were constructed and analyzed. Whole-genome sequencing was performed to identify possible secondary mutations. Finally, growth arrest studies of the constructed strains were performed to evaluate the role of biosynthetically generated PP i in these strains.

RESULTS
Theoretical analysis of the pyrophosphate stoichiometry of biosynthesis. Previous studies on the glycolytic PP i requirement of cellobiose-grown C. thermocellum cultures used an Escherichia coli metabolic network model (25) to estimate the amount of PP i generated during biosynthesis (9,18). This model predicted that approximately 11 mmol PP i is produced per g biomass during biosynthesis, an amount that is not corrected for the anabolic PP i requirement that occurs in C. thermocellum when the carbon flux required for assembling the building blocks for biosynthesis involves the PP i -consuming Pfk and Ppdk reactions. Zhou et al. (9) partially accounted for this by including the assumption that all carbon used for biosynthesis passes through the Pfk reaction, which still resulted in a predicted net biosynthetic PP i formation of 3.6 mmol PP i per g biomass. However, as Ppdk activity was not yet detected, this prediction did not include the PP i consumption in lower glycolysis and therefore still overestimates the net biosynthetic PP i stoichiometry. Considering that the Ppdk reaction likely carries a large part of the PEP-to-pyruvate flux (67% according to Olson et al. [8]), it is possible that in some cases biomass formation actually requires a net input of PP i per gram of biomass. This directly implies that in those cases, energetic benefits of PP i recycling are limited to the carbon ending up in the biomass without additional gains in catabolism. Therefore, to get a more accurate prediction of the amount of PP i required or produced during biosynthesis and the possible energetic benefits of this, a stoichiometric analysis was performed. As is common for existing stoichiometric C. thermocellum models (26)(27)(28), the cellular composition of the Gram-positive bacterium Bacillus subtilis was used (Table 1 and Fig. 2) (29). Lumped reactions from precursor metabolites to the cell building blocks (i.e., amino acids, nucleotides, lipids, lipoteichoic acids, and cell wall components) were used, and for each precursor metabolite and cell building block a PP i stoichiometry was determined (see Tables S1 to S8 in the supplemental material). Since the PEP-to-pyruvate conversion can occur through both the PP i -consuming Ppdk reaction and the PP i -neutral malate shunt, a degree of freedom, x, was implemented that can vary between 0 (only the malate shunt is used) and 1 (only Ppdk is used).
The stoichiometric analysis showed that PP i is formed in polymerization reactions (for protein, DNA, and RNA synthesis) and in numerous biosynthetic reactions where NTP is converted to NMP and PP i (i.e., in arginine, asparagine, cysteine, histidine, methionine, and tryptophan synthesis but also in nucleotide, lipid, lipoteichoic acid, and cell wall synthesis). However, synthesis of many of the precursor metabolites needed to form these cell building blocks will cost PP i (Table 1 and Fig. 2). The overall PP i stoichiometry of biosynthesis indicated that biosynthesis would only have a net formation of PP i if the Ppdk reaction accounts for #15% of the PEP-to-pyruvate flux. This implies that for the previously reported PEP-to-pyruvate flux distribution determined for wild-type C. thermocellum (i.e., 67% through Ppdk; 33% through the malate shunt [8]), biosynthesis would actually require a net input of PP i .
When the anabolic and catabolic PP i requirements are combined (Fig. 3), for an experimentally determined biomass yield of 0.15 g cells per g cellobiose (30), it can be seen that even if the whole PEP-to-pyruvate flux passes through the malate shunt (i.e., x = 0), biosynthesis can only account for maximally 4.7% of the PP i required in catabolism. Therefore, the updated PP i stoichiometry of biosynthesis also implies that C. thermocellum requires a currently unknown (net) nonbiosynthetic PP i source to supply the remaining catabolic PP i needed to drive its glycolysis. This main conclusion is robust and independent of small changes in cellular composition resulting from, for instance, deviations from the commonly used B. subtilis composition or changes due to differing growth conditions.
Prediction of putative PP i sources using the optStoic procedure. In addition to the previously suggested sources of nonbiosynthetic PP i , i.e., the H 1 -pumping membrane-bound PPase and glycogen cycling (9), which will be investigated in this work, we also searched for alternative PP i -supplying pathways. In selecting candidate pathways, the following considerations were taken into account. First, a possible PP i -supplying pathway must be able to carry enough flux to drive glycolysis. Second, PP i needs to be generated directly or indirectly from ATP or GTP (9) with an overall stoichiometry of at least ATP eq 1 P i ! ADP eq 1 PP i (or energetically more beneficial). This precludes, for instance, direct hydrolysis of an ATP to AMP and PP i (ATP eq ! AMP eq 1 PP i ), since, after balancing by adenylate kinase, this leads to the consumption of two ATP eq per mol PP i (13), thereby making this mechanism incompatible with a high ATP-yielding glycolysis (equation 2).
To identify additional theoretical PP i -supplying pathways, we used the optStoic procedure (31) to search for the minimal set of reactions within the iCBI655 genome-scale metabolic model of C. thermocellum (26) that have an overall stoichiometry where one ATP and P i is converted to one ADP and PP i . Of the top 100 proposed PP i -supplying mechanisms, 76 did not contain the membrane-bound PPase or glycogen cycling as key mechanisms (File S1). Of these 76 cycles, eight model solutions involved acetate cycling using acetyl-coenzyme A (CoA) synthetase as a key PP i -supplying enzyme (Fig. 4). Next, two malate shunt cycles were predicted, which convert pyruvate to PEP via Ppdk, PEP to oxaloacetate via PEP carboxykinase, and oxaloacetate to pyruvate via pyruvate carboxylase. Both cycles differ only in the enzymes used to balance the nucleotide cofactors. As pyruvate carboxylase activity has not been reported for C. thermocellum, we modified this cycle to include PEP carboxykinase, malate dehydrogenase, malic enzyme, and Ppdk (Fig. 4), which are known to be present and active in C. thermocellum (8). Two cycles required 3-isopropenyl-6-oxoheptanoate:CoA ligase, which is a fatty acyl-CoA ligase involved in limonene and pinene degradation. Given that C. thermocellum is not grown in the presence of these monoterpenes, it is highly unlikely that these cycles are active in C. thermocellum; therefore, they are not included for further study. Furthermore, 64 possible PP i -supplying cycles that require enzymes involved in nucleotide synthesis (i.e., NTP pyrophosphorylase and UTP FIG 2 Theoretical pyrophosphate and carbon amounts (in mmol per g cells) needed for biomass formation from precursor metabolites for wild-type C. thermocellum grown on cellobiose. The PP i and carbon fluxes are determined from the stoichiometric analysis of biosynthesis using the macromolecular composition of aerobically grown glucose-limited chemostat cultures of B. subtilis (Table 1). The parameter x (in the red boxes) describes the flux distribution between the Ppdk reaction and the malate shunt (x = 1, the PEP-to-pyruvate conversion solely goes through Ppdk; x = 0, the PEP-topyruvate conversion solely goes through the malate shunt). Numbers in gray boxes represent the amounts of cellobiose or precursor metabolites needed for biomass formation in mmol g x 21 ; numbers in red boxes represent the PP i amounts produced or consumed per precursor metabolite needed for biomass formation in mmol g x 21 . Gray circles represent precursor metabolites; red circles represent PP i ; white circles represent nonprecursor metabolites. Solid arrows represent a single reaction; dashed arrows represent lumped reactions. Cyan arrows represent the malate shunt; the orange arrow represents the Ppdk reaction. For the PP i stoichiometry of E4P and R5P, the nonoxidative pentose-phosphate pathway as proposed by Koendjbiharie et al. (60) is used. Abbreviations: 3PG, 3-phosphoglycerate; AcCoA, acetyl-coenzyme A; AKG, a-ketoglutarate; E4P, erythrose-4-phosphate; F6P, fructose-6-phosphate; G6P, glucose-6-phosphate; GAP, glyceraldehyde-3phosphate; OAA, oxaloacetate; PEP, phosphoenolpyruvate; PYR, pyruvate; R5P, ribose-5-phosphate; SucCoA, succinyl-coenzyme A. pyrophosphohydrolase) were not investigated in the context of this study for two reasons. First, each PP i -supplying enzyme in these cycles is encoded by multiple gene candidates without an experimentally confirmed annotation. Identifying the correct gene candidate would be a time-consuming process given the current gene-editing techniques available for FIG 4 Two putative pyrophosphate-supplying pathways as identified with the help of the optStoic algorithm (31). (1) Ppdk-malate shunt cycle, where the malate shunt operates in the forward direction (PEP-to-pyruvate) and Ppdk in the reverse direction (pyruvate-to-PEP). (2) Acetate cycling, which occurs via the two-step conversion of acetyl-CoA to acetate, catalyzed by phosphotransacetylase (Pta) and acetate kinase (AcK), and the subsequent conversion of acetate to acetyl-CoA, catalyzed by acetyl-CoA synthetase (Acs). Both pathways result in formation of one PP i from one ATP equivalent. For the Ppdk-malate shunt cycle, a simultaneous transhydrogenation of NADH and NADP 1 to NAD 1 and NADPH also occurs. ATP, ADP, and AMP were balanced in the overall stoichiometry with the adenylate kinase reaction (2 ADP $ ATP 1 AMP) and the nucleosidediphosphate kinase reaction (ATP 1 NDP $ ADP 1 NTP). Abbreviations: AcCoA, acetyl-coenzyme A; Acetyl-P, acetyl-phosphate; CoA, coenzyme-A; MAL, malate; Mdh, malate dehydrogenase; Me, malic enzyme; OAA, oxaloacetate; PEP, phosphoenolpyruvate; Pepck, phosphoenolpyruvate carboxykinase; Ppdk, pyruvate phosphate dikinase; PYR, pyruvate.

FIG 3
Schematic representation of the estimated pyrophosphate amounts (in mmol per g cells) needed in anabolism and catabolism for wild-type C. thermocellum grown in cellobiose-limited chemostat cultures (at a dilution rate of 0.1 h 21 ). The PP i amounts depend on the parameter x, which describes the flux distribution between the Ppdk reaction and the malate shunt (x = 1, the PEP-to-pyruvate conversion solely goes through Ppdk; x = 0, the PEP-to-pyruvate conversion solely goes through the malate shunt). The amount of PP i and cellobiose (in mmol) needed for anabolism is based on the stoichiometric analysis of biosynthesis ( Fig. 2) using the macromolecular composition of aerobically grown glucose-limited chemostat cultures of B. subtilis ( Table 1). The total amount of cellobiose (19.48 mmol) needed per gram of cells is based on an observed biomass yield of 0.15 g cells per g cellobiose obtained for cellobiose-limited chemostat cultures of wild-type C. thermocellum (30). The amount of cellobiose needed for catabolism (11.80 mmol) is calculated by subtracting the anabolic requirement (3.59 mmol) and unaccounted carbon (21% of the total, i.e., 4.09 mmol) from the total amount of cellobiose. Red arrows represent PP i amounts in mmol g x

21
; black arrows represent cellobiose amounts in mmol g x

21
. Analysis of Predicted PP i Sources in C. thermocellum Applied and Environmental Microbiology C. thermocellum (32). Second, their involvement in nucleotide biosynthesis might complicate interpretation of the physiological impact of these knockouts (if viable at all). Hence, four candidates were targeted for their role as a possible PP i supplier: H 1 -pumping membranebound PPase, glycogen cycling, acetate cycling, and the Ppdk-Malate shunt cycle. Functional analysis of four possible individual sources of PP i in C. thermocellum. To investigate the contributions of the four proposed pathways (as described in the previous paragraph) in generating PP i , all genes predicted to encode the PP i -supplying enzymes in these pathways were individually deleted in wild-type strain LL1004, yielding strains AVM008 (Dclo1313_0823; encoding putative membrane-bound pyrophosphatase), AVM051 (DP clo1313_0717-0718 -clo1313_0717-0718; encoding putative ADP-glucose synthase, which consists of two subunits), AVM003 (Dclo1313_0949; Dppdk), and AVM059 (Dclo1313_1686; encoding putative acetyl-CoA synthetase).
To verify that the introduced genetic modifications resulted in a complete loss of the predicted activity and simultaneously functionally annotate the targeted genes, enzyme activities were measured in cell extracts of batch serum bottle cultures ( Table 2). In line with previous reports (8), deletion of clo1313_0949, encoding Ppdk, completely removed the Ppdk activity in AVM003 compared to LL1004 (Table 2). Furthermore, removal of clo1313_0823 resulted in a complete elimination of pyrophosphatase activity in AVM008 compared to LL1004 (Table 2), providing in vitro evidence that clo1313_0823 encodes a functional pyrophosphatase. Despite numerous assay optimization attempts (see Materials and Methods), ADP-glucose synthase activity could not be measured in cell extracts of the wild-type strain LL1004 (data not shown).
As an alternative to enzymatic activity measurements of ADP-glucose synthase, glycogen formation was measured during exponential growth of LL1004 and AVM051 (DP clo1313_0717-0718 -clo1313_0717-0718). In contrast to the wild-type strain LL1004, which formed 20% to 25% (wt/wt) glycogen during the exponential growth phase, biomass of the ADP-glucose synthase deletion strain (AVM051) did not contain glycogen (Fig. 5). Since ADP-glucose synthase activity is essential for a functioning glycogen cycle, this is a strong, albeit indirect, indication that removal of P clo1313_0717-0718 -clo1313_0717-0718 eliminated ADP-glucose synthase activity. For functional analysis of clo1313_0717 and clo1313_0718, encoding the two subunits of the C. thermocellum ADP-glucose synthase, both genes were simultaneously expressed from the high-copy-number pTrc99a plasmid in E. coli BL21. ADP-glucose synthase activity was readily detected in E. coli cell extracts expressing clo1313_0717 and clo1313_0718, whereas this activity was not detected in the empty vector control when assayed at 55°C (Table S9). Furthermore, the activity was found to be ADP-glucose dependent, as GDP-and UDP-glucose could not be used as the substrate. This indicated that the C. thermocellum genes clo1313_0717 and clo1313_0718 indeed encode an ADP-glucose synthase.
For a quantitative analysis of the impact of these three individual gene deletions, the specific growth rate, biomass yield, and fermentation product yields were determined in batch serum bottle cultures ( Fig. 6 and Table S10). Interestingly, single knockouts of the membrane-bound PPase (AVM008), ADP-glucose synthase (AVM051), and Ppdk (AVM003) did not have a major effect on the observed growth rates compared to LL1004 (Fig. 6). If one of these sources was indeed solely responsible for generating PP i , one would expect a lower growth rate in the knockout strains resulting from a decreased PP i flux and a consequently lower glycolytic flux, which was not observed for these mutants. Consistent with previous reports (18), the biomass yield in AVM008 (Dppase) was similar to that of the wild-type Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686 ,0.005 ,0.05 a Averages and standard deviations were obtained from two independent biological duplicates. The detection limit was 0.05 mmol mg protein 21 min 21 for the Ppdk assay and 0.005 mmol mg protein 21 min 21 for the PPase assay. b ND, not determined. strain (LL1004) (Fig. 6). Given that PP i supply via the membrane-bound PPase was hypothesized to have a PP i -to-ATP ratio higher than unity (2), we expected a decreased biomass yield in AVM008. The absence of this effect suggested that there is no or very low contribution of this mechanism to PP i supply in C. thermocellum. Knockouts of ADP-glucose synthase (AVM051) and Ppdk (AVM003) resulted in a 24% (P , 0.01) and 20% (P , 0.01) decrease, respectively, in biomass yield compared to LL1004 (Fig. 6). Considering that glycogen as a biomass component is energetically inexpensive to make compared to other cell building blocks (e.g., protein, DNA, RNA, etc.), removal of glycogen in AVM051 (DP clo1313_0717-0718 -clo1313_0717-0718; DP ags1,2 -ags1 ags2) was expected to make formation of new cells without glycogen more energetically costly, leading to a lower biomass yield. The slightly reduced biomass yield in AVM003 (Dppdk) was consistent with earlier reports (8). Finally, no major shifts in the measured fermentation product yields were observed for the three single knockout strains compared to LL1004 ( Fig. 6 and Table S10). These results indicate that neither the H 1 -pumping membrane-bound PPase, glycogen cycling, nor the Ppdk-Malate shunt cycle is essential for PP i generation. The fourth possible mechanism, acetate cycling, with acetyl-CoA synthetase (Acs) as the key enzyme (Fig. 4), was also investigated. As commonly used Acs assays measure combined activity of acetate kinase (Ack) and phosphotransacetylase (Pta) (33), acetyl-CoA synthetase activity was first measured in a pta knockout background (strain LL1041 [34]) (Table S11). Interestingly, no measurable acetyl-CoA synthetase activity was detected in these cell extracts. In agreement with this result, no difference in Acs-like enzyme activity was observed between strains LL1004 (wild type) and AVM059 (Dclo1313_1686), indicating that the measured activity for these strains was due to Pta-Ack activity (Table S11). The absence of measurable Acs activity likely cannot be ascribed to low gene expression or translation, as clo1313_1686 is highly expressed at the transcriptional level (6,35) and moderately expressed at the translational level (36). For further functional analysis, clo1313_1686 was expressed in E. coli BL21 from the pTrc99a plasmid with parallel expression of the native E. coli acs gene (b21_03901) as a control. SDS-PAGE analysis showed that clo1313_1686 was highly expressed in the BL21 pTK54 cell extract (Fig. S1). However, at 55°C, no Acs activity was observed for any of the E. coli cell extracts (Table S12). In contrast to the 3-fold higher Acs activity at 37°C when overexpressing the native E. coli acs gene (BL21 pTK55), activity at 37°C in E. coli expressing clo1313_1686 (BL21 pTK54) was similar to that of the empty vector control (BL21 pTrc99a) (Table S12). Based on these results, clo1313_1686 likely does not encode an acetyl-CoA synthetase, and the lack of Acs activity makes acetate cycling as a PP isupplying mechanism unlikely. In line with this, deletion of clo1313_1686 (AVM059) in C. thermocellum did not have a significant effect on the growth rate, biomass yield, or fermentation product yields compared to LL1004 ( Fig. 6 and Table S10). However, to fully exclude any possibility that this gene contributes to a PP i -supplying mechanism, the clo1313_1686 gene knockout was still included in subsequent combinatorial deletion studies.
Physiological impact of the combinatorial deletion of four possible PP i sources in C. thermocellum. A possible explanation for the observed lack of a phenotypic effect upon disrupting the single putative PP i -supplying pathways is that they can functionally complement each other. To test this hypothesis, double, triple, and quadruple knockout strains were created, resulting in strains AVM053 (Dppase DP ags1,2 -ags1 ags2), AVM052 (Dppdk DP ags1,2 -ags1 ags2), AVM060 (DP ags1,2 -ags1 ags2 Dclo1313_1686), AVM056 (Dppase DP ags1,2 -ags1 ags2 Dppdk), and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686). Double knockouts of possible PP i -supplying pathways had no large detrimental effects on the growth rates, biomass yields, or fermentation product FIG 6 Maximum specific growth rates, biomass yields, and fermentation product yields of C. thermocellum wild-type and mutant strains in batch serum bottle cultures. Cultures were grown on LC medium containing 5 g L 21 cellobiose. Averages and standard deviations were obtained from three independent biological replicates. Absence of error bars indicates standard deviations were equal to zero. yields ( Fig. 6 and Table S10). For the double knockouts AVM053 (Dppase DP ags1,2 -ags1 ags2) and AVM052 (Dppdk DP ags1,2 -ags1 ags2), removal of ADP-glucose synthase lowered the biomass yield, with 20% (P , 0.01) and 15% (P , 0.01) compared to the single-knockout strains AVM008 (Dppase) and AVM003 (Dppdk), respectively. This decrease was in line with the decreased biomass yield observed for the single deletion of ADP-glucose synthase (AVM051). Interestingly, deletion of ppdk in AVM053 (Dppase DP ags1,2 -ags1 ags2), creating the triple-knockout strain AVM056, also did not result in a major phenotypic change ( Fig. 6 and Table S10). Finally, deletion of clo1313_1686 in the triple-knockout strain AVM056 (Dppase DP ags1,2 -ags1 ags2 Dppdk), resulting in AVM061, did lower the growth rate to 0.30 h 21 and the biomass yield to 0.18 g x g 21 , which was ca. 22% (P , 0.01) and 15% (P , 0.01) lower than that of AVM056. Furthermore, fermentation product yields were comparable between these strains ( Fig. 6 and Table S10). The lower growth rate and biomass yield observed upon deletion of clo1313_1686 in AVM056 are in contrast to the effect of this knockout observed in the wild type (LL1004) and AVM051 (DP ags1,2 -ags1 ags2) (Fig. 6). In agreement with the single-knockout strains, no enzyme activity of Ppdk and PPase or formation of glycogen was observed in AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) Table 2). Additionally, the combined Acs/Pta-Ack activity in strains AVM056 (Dppase DP ags1,2 -ags1 ags2 Dppdk) and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) was comparable to that of strains LL1004 and AVM059 (Dclo1313_1686) (Table S11), which further confirms that it is unlikely that, in a strain with the other three mechanisms deleted, acetate cycling contributes to PP i supply. Compared to the wild-type strain LL1004, the quadruple deletion strain AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) showed a 22% (P , 0.01) and 38% (P , 0.01) decreased growth rate and biomass yield. The ethanol yield only slightly increased from 0.81 to 0.90 mol mol 21 (P , 0.05), and the acetate yield decreased from 1.10 to 0.96 mol mol 21 (P , 0.05) relative to LL1004 (Fig. 6). The small, combined effects of the four deletions in AVM061 were much smaller than what would be expected if one or more of these genes play a role in supply of the PP i needed to drive a PP i -dependent glycolysis.
Continued cellobiose fermentation in the quadruple knockout strain during growth arrest. Although our metabolic network analysis showed that biosynthesis cannot supply enough PP i to drive glycolysis ( Fig. 2 and 3 and Table 1), it might be that current knowledge and/or the assumptions used for the biosynthetic pathways are not correct. If biosynthesis can in fact supply enough PP i , which would explain the lack of a severe phenotypic effect in the quadruple knockout strain AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) compared to LL1004, one would expect that fermentation of cellobiose cannot occur in growth-arrested cells of AVM061.
To test this hypothesis, first cells of the wild-type strain (LL1004) were harvested, washed, and transferred to (i) unmodified LC medium, (ii) LC medium without urea as the sole nitrogen source, and (iii) LC medium without urea and cellobiose (sole carbon source). Similar to previous studies (9), we observed nitrogen depletion to be an effective method to arrest growth with continued fermentation (Fig. S2). However, after 24 h, fermentation significantly slowed down and even stopped, while about half of the initial cellobiose remained. As key glycolytic enzymes are regulated by intracellular NH 4 1 levels (37, 38), we hypothesized that this could be a confounding effect of the use of nitrogen depletion to control cell growth. Therefore, we considered other nutrients such as sulfur to achieve growth arrest.
The main sulfur sources in LC medium are Na 2 SO 4 and cysteine (39). Since cysteine is also used as a reducing agent, it was not completely removed, but instead cysteine levels were decreased 10-fold and Na 2 SO 4 was completely removed. For this method, cells of LL1004 and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) were harvested, washed, and transferred to three different LC media: (i) unmodified LC medium, (ii) LC medium without Na 2 SO 4 and with only 0.01 g L 21 cysteine, and (iii) LC medium without cellobiose and Na 2 SO 4 but with 0.01 g L 21 cysteine. When transferred to unmodified LC medium, strains LL1004 and AVM061 both grew exponentially from the start, increasing the OD 600 from 0.61 to 3.02 and 0.88 to 2.50, respectively, and rapidly consuming all cellobiose (Fig. 7). Both strains produced roughly equimolar amounts of acetate (16.7 mM and 13.7 mM for LL1004 and AVM061, respectively) and ethanol (15.8 mM and 15.2 mM for LL1004 and AVM061, respectively) as main fermentation products. Transfer to sulfur-limited LC medium initially resulted in an OD 600 increase from 0.60 to 1.32 and 0.74 to 1.54 for LL1004 and AVM061, respectively (Fig. 7). In this initial period, the remaining sulfur sources (mainly cysteine and trace amounts from metal sulfates) were completely consumed. After this initial period, growth was arrested for both strains and cells continued to ferment cellobiose to acetate, ethanol, and lactate as main fermentation products. Lastly, cells transferred to the control medium without cellobiose and with limited sulfur sources produced only minor amounts of acetate (0.24 mM) in LL1004 and did not produce any fermentation products in AVM061 (Fig. 7). The tiny amount of acetate in LL1004 was likely the result of conversion of intracellularly stored glycogen in the inoculum. Overall, these results show that for both the wild-type strain (LL1004) and the quadruple knockout strain (AVM061), cellobiose fermentation can still occur in growth-arrested cells, which excludes biosynthesis as a primary source of PP i for glycolysis.
Analysis of phosphofructokinase activity in C. thermocellum. Previous studies on organisms with PP i -dependent glycolysis have shown that in addition to a PP i -dependent phosphofructokinase (Pfk), some of these organisms also possess a functional ATP-or FIG 7 Growth and product profiles of LL1004 (wild type; left column) and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686; right column) during growth arrest studies. Cultures were grown on LC medium (top; control experiment), LC medium without Na 2 SO 4 and with 0.01 g L 21 cysteine (middle; Slimited), and LC medium without cellobiose and Na 2 SO 4 but with 0.01 g L 21 cysteine (bottom; S-limited, no cellobiose). Data are shown for one representative experiment (n = 2). GTP-dependent 6-phosphofructokinase (40)(41)(42)(43). The presence of a functional ATP-or GTP-dependent Pfk could decrease the PP i demand in glycolysis and thereby provide an explanation for the observed phenotype of AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686). Although ATP-dependent Pfk activity has hitherto never been measured in C. thermocellum, one gene has been annotated as an ATP-dependent Pfk in the KEGG (44,45) and Pfam databases (46,47), clo1313_0997, and this gene is moderately expressed at the transcriptional and translational level (6,35,36). Furthermore, two other candidate genes, clo1313_1832 and clo1313_2627, are both annotated as members of the carbohydrate kinase PfkB family (48) in the Pfam database, whereas the KEGG database annotated these genes as a fructokinase and tagatose-6-phosphate kinase, respectively. Both genes are expressed at low to moderate levels (6,35,36).
To investigate if ATP-or GTP-dependent Pfk activity contributes to the observed phenotypes, cell extracts of strains LL1004 and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) were assayed for PP i -, ATP-, and GTP-dependent Pfk activity. Pfk activity was found to be PP i dependent only for both strains (Table 3). In line with this observation, heterologous expression of clo1313_0997, clo1313_1832, and clo1313_2627 from the pTrc99a plasmid in E. coli BL21 showed similar ATP-and GTP-dependent Pfk activities as the empty vector control (Table S13). An additional protein denaturation heating step, aimed to remove the native E. coli background activity (49), also did not result in measurable Pfk activity in strains expressing the three C. thermocellum genes (Table S13). Heterologous expression in E. coli of the positive-control tsac_1362, encoding the T. saccharolyticum ATP-and GTPdependent Pfk, showed a 5-fold higher ATP-dependent activity and an 11-fold higher GTPdependent activity than the empty vector control (Table S13). This difference in activity became even more pronounced with the heat-treated cell extracts (Table S13). SDS-PAGE analysis showed the presence of all heterologous proteins in the E. coli cell extracts, albeit at varying protein levels ( Fig. S3 and S4). Together, these measurements imply that neither clo1313_0997, clo1313_1832, nor clo1313_2627 encodes an ATP-or GTP-dependent Pfk. Finally, ATP-as well as GTP-dependent fructokinase activity of 1.5 mmol mg protein 21 min 21 was measured in E. coli expressing clo1313_1832, whereas it was below the detection limit of activity in the wild-type strain (LL1004), which provides biochemical confirmation of the current annotation in the KEGG database of clo1313_1832 as a fructokinase (44,45) (Table S14).
Whole-genome sequencing of the engineered strains. The genome of all engineered strains used in this study was sequenced to verify the intended deletions and to identify secondary mutations acquired during strain construction (File S2). None of the observed secondary mutations were related to possible PP i -supplying pathways or to a lower PP i demand (e.g., ATP/GTP-Pfk candidate genes, clo1313_0997 and clo1313_2627, and their up-and downstream regions did not contain mutations). All engineered strains had acquired a nucleotide substitution in the clo1313_1795 gene, encoding the hydrogenase-Fe-S B (hfsB) protein, which resulted in an I516K point mutation. Interestingly, this mutation was already present in the parental strain LL1004, albeit at low frequency (12%). Previous studies reported that disruption of hfsB in C. thermocellum resulted in increased ethanol production (6,50). However, it is unlikely that the I516K point mutation had a similar effect, as the ethanol yields for the engineered strains are comparable to LL1004 (Fig. 6). Furthermore, AVM052 (Dppdk DP ags1,2 -ags1 ags2) and the parental strain AVM003 (Dppdk) contained over three times more secondary mutations than LL1004. The increased mutation frequency of both strains is likely related to a single-nucleotide deletion in clo1313_1445, a gene that encodes the DNA mismatch repair protein MutS. This single-nucleotide deletion resulted in a frameshift of the gene that could inactivate the protein. Mutations in the mutS gene have been shown to result in failure to repair replication errors and give rise to strong mutator or hypermutator phenotypes (51,52). Prediction of additional PP i -generating cycles using an adapted optStoic procedure. optStoic results are inherently dependent upon the quality of the genome annotation. Given that bioinformatically predicted annotations are not always correct, as seen for clo1313_1686, and that approximately 20% of the C. thermocellum genome encodes proteins of unknown function (also called hypothetical proteins) (53), it is possible that important PP i -supplying mechanisms are currently missed. In an attempt to address this possibility, the KEGG database was probed for PP i -generating cycles carrying out the net conversion, ATP eq 1 P i ! ADP eq 1 PP i , which would be feasible in C. thermocellum (26) with the addition of one reaction currently absent from iCBI655. Of the 78 proposed cycles (File S3), 66 can be eliminated, as they required the use of the membrane-bound PPase or ADP-glucose synthase, which have been deleted in this study. One cycle required sulfate adenylyltransferase, which, due to its essential function in sulfur metabolism, cannot be deleted. The 11 remaining PP i -supplying cycles are involved in nucleotide synthesis or mannose metabolism and have multiple gene candidates encoding key enzymes. Hence, candidates identified with the addition of one reaction currently absent in iCBI655 have the same experimental limitations as those discussed for other candidate cycles not investigated in the context of this study.

DISCUSSION
Four possible PP i -generating mechanisms were investigated: the H 1 -pumping membrane-bound PPase, glycogen cycling, a predicted Ppdk-malate shunt cycle, and acetate cycling with acetyl-CoA synthetase as a key enzyme. Several convergent lines of evidence indicate that none of these are the major source of PP i in C. thermocellum. Knockout of a significant PP i -supplying mechanism was expected to severely impair growth or lead to no growth. However, the observed 22% and 38% decreased growth rate and biomass yield in AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) compared to the wild type (LL1004) show that the targeted sources are not important for PP i supply and that there are still one or multiple PP i -supplying pathways functional in C. thermocellum. The lower biomass yield and growth rate in AVM061 are, in addition to the already-described decrease resulting from glycogen removal, likely linked to decreased metabolic flexibility or possible unknown side effects of the deletions.
The 20% decreased biomass yield observed upon deletion of ppdk in LL1004, which was also observed by Olson et al. (8), is one example of the impact of decreased metabolic flexibility but might simultaneously provide information about the stoichiometry of an unknown PP i -generating mechanism(s) (Fig. 6). The malate shunt, which, in a ppdk knockout strain, carries the entire PEP-to-pyruvate flux, yields one ATP equivalent and transhydrogenation of all glycolytically formed NADH to NADPH (Fig. 1). In contrast, the Ppdk reaction converts PEP and PP i to net formation of two ATP equivalents (with adenylate kinase balancing ATP, ADP, and AMP). The observed biomass yield decrease can be related to a catabolic oversupply of NADPH in Dppdk strains due to the different overall stoichiometry of the malate shunt. The excess NADPH can be reoxidized by producing amino acids (18) or by using NfnAB, thereby forming reduced ferredoxin and NADH, which likely leads to more hydrogen formation (54). Although neither product is measured in this study, increased amino acid and H 2 formation were indeed observed previously in a ppdk deletion strain (8). The 9% lower carbon recovery in strain AVM003 (Dppdk) compared to strain LL1004 (Table S10) would be in line with such an increased amino acid excretion. Interestingly, if a currently unknown source of PP i results in formation of more than 1 PP i per ATP (or does not consume ATP), one would also expect a decreasing biomass yield upon deletion of ppdk, thereby providing an alternative (or additional) hypothesis for this observation.
In addition to the functional annotation of the membrane-bound PPase, ADP-glucose synthase, and a fructokinase, it was shown that clo1313_1686 does not encode acetyl-CoA synthetase activity, which contradicts the current annotation in the KEGG database (45). A lack of measurable acetyl-CoA synthetase activity makes acetate cycling as a PP i -supplying mechanism unlikely. Interestingly, in contrast to the absence of an effect observed upon deleting clo1313_1686 in the wild type (LL1004) and strain AVM051 (DP ags1,2 -ags1 ags2), knockout of this gene in strain AVM056 (Dppase DP ags1,2 -ags1 ags2 Dppdk) lowered the growth rate and biomass yield. Since a functional annotation of this gene is currently missing, it is difficult to understand the observed effect of this knockout. Alternatively, whole-genome sequencing showed some secondary mutations that were only present in AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686), which might explain the lower growth rate and biomass yield; however, none of these secondary mutations could be directly linked to this phenotype.
A currently unknown alternative PP i -supplying mechanism will have to meet two requirements. The first is that the overall stoichiometry of this mechanism will likely be equivalent to ATP 1 P i ! ADP 1 PP i . Stoichiometric ATP-to-PP i ratios smaller than unity would contradict the observed reversibility and predicted high energetic efficiency of PP idependent glycolysis (11,55). Furthermore, no other known alternative mechanism apart from the studied H 1 -pumping membrane-bound PPase would allow a stoichiometric ATPto-PP i ratio higher than unity. The second requirement is that the PP i -supplying mechanism must be able to carry sufficient flux to be able to drive glycolysis. For AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686), one can estimate this flux to be approximately 9.7 mmol PP i g x 21 h 21 based on an observed growth rate of 0.30 h 21 , a biomass yield of 0.18 g x g s 21 , and the assumption that PP i -Pfk is the only PP i -consuming reaction in the absence of Ppdk. PP i -dependent acetate kinase (EC 2.7.2.12) and a PP i -dependent PEP carboxykinase (PEPCK) (EC 4.1.1.38) have previously been suggested to supply PP i in Entamoeba histolytica (56,57) and Propionibacterium shermanii (58). However, in C. thermocellum only one gene is annotated as acetate kinase and was found to be ATP dependent (10,59), and PEPCK was unable to use orthophosphate as the substrate (9), which likely eliminates these possibilities. The nonoxidative pentose phosphate pathway of C. thermocellum has previously been shown to use PP i -PFK instead of transaldolase (60), thereby contributing to the PP i originating from biosynthesis (61). However, the optStoic analysis did not identify a possible cyclic pathway involving these reactions as a potential additional PP i source. Since all remaining qualifying PP i -supplying options identified using optStoic with at most a single reaction absent from the iCBI655 model are either part of biosynthetic networks, have essential functions, or rely on unavailable precursor molecules, their future investigation will need further gene-editing developments or alternative scientific approaches.
In contrast to having an alternative PP i -generating mechanism, a smaller than foreseen glycolytic PP i demand is also possible by having an active ATP/GTP-dependent Pfk. Although this activity was not found in this study (Table 3; see also Table S13 in the supplemental material), it is not uncommon for bacteria and eukaryotes with a PP i -dependent glycolysis to have at least one gene encoding an ATP/GTP-dependent Pfk. When cell extracts of these organisms were assayed for ATP-and PP i -dependent Pfk activity, either no ATP-dependent activity was found (9,49,62) or this activity was 6-to 11-fold lower than that of the PP i -dependent activity (42,63). For Clostridium thermosuccinogenes, ATP-Pfk activity was only detected after protein purification; however, the catalytic efficiency (defined as k cat /K m ) of ATP-Pfk was much lower than that of purified PP i -Pfk (60). Additionally, ATP-Pfk activity of the purified E. histolytica ATP-Pfk was only detected after an initial preincubation step with ATP and at relatively high fructose-6-phosphate concentration (20 mM) (42). Hence, it could be that biochemical knowledge and methodologies are currently missing, preventing the measurement of ATP/GTP-Pfk activity in C. thermocellum.
Although the H 1 -pumping membrane-bound PPase and ADP-glucose synthase do not have a PP i -supplying role, the fact that the genes encoding these enzymes are highly expressed and translated (35,36) suggests other functions in C. thermocellum. Given that C. thermocellum has a PP i -dependent Pfk as the main PP i sink, it is unlikely that the membranebound PPase functions solely as a PP i disposal system. Baykov et al. (64) hypothesized that membrane-bound PPases play an important role in cell survival under stress conditions by utilizing the energy released upon PP i hydrolysis to maintain ion gradients. Hence, it could be that the membrane-bound PPase in C. thermocellum is important for its robustness and that phenotypic effects of a ppase deletion strain are only observed under stress conditions. ADP-glucose synthase is important for glycogen formation in wild-type C. thermocellum (Fig.  5). Even though high-flux glycogen cycling is not an essential PP i -generating cycle, 20% to 25% (wt/wt) glycogen is formed in C. thermocellum during exponential growth (Fig. 5) and can be used as carbon and energy storage (65). As part of glycogen formation, ADP-glucose synthase does contribute to the anabolically produced PP i . For an average chain length of 10 glucose units (66) and a PP i stoichiometry of 1 mol PP i per glucose unit added to the glycogen chain, formation of 25% (wt/wt) glycogen can generate 0.15 mmol PP i per g biomass, which would provide less than 1% of the total glycolytic PP i requirement (Fig. 3).
The observation that the membrane-bound PPase is not responsible for PP i supply in C. thermocellum brings into question the hypothesis that PP i -dependent glycolysis allows for a significantly higher energetic efficiency at the costs of thermodynamic driving force compared to an ATP-dependent glycolysis (11,55,67). Since membrane-bound PPase is currently the only predicted mechanism yielding a PP i -to-ATP ratio higher than unity, formation of PP i via an alternative mechanism will likely make use of PP i stoichiometrically equivalent to ATP. Hence, the only energetic benefit of using PP i will come from the relatively small amount of biosynthetic PP i that is recycled in glycolysis. However, C. thermocellum also has other mechanisms to conserve energy. One of these mechanisms is the phosphoroclastic cleavage of cellodextrins (as discussed in the introduction), which, depending on the oligomer size (n), could save 0.5 to 0.83 ATP per glucose equivalent (for n = 2 to 6) (68). Furthermore, coupling of the exergonic oxidation of reduced ferredoxin to endergonic reactions, e.g., ion translocation via membrane-bound hydrogenases (Ech) and oxidoreductases (Rnf) or transhydrogenation of NADH to NADPH (NfnAB) (54,69), also allows for energy conservation. Therefore, although the energetic benefit of using PP i might be small, it is likely one of many mechanisms that C. thermocellum has to optimize energy conservation from sugar dissimilation.
The present study has demonstrated that previously hypothesized PP i sources are not responsible for PP i supply in C. thermocellum. Together with the updated PP i stoichiometry for biosynthesis, these findings can help to improve current genome-scale metabolic models as well as provide fundamental knowledge of the PP i metabolism of C. thermocellum. optStoic-identified cycles with more than one reaction missing from the iCBI655 model can also serve as a starting point for performing sequence alignment to identify previously unknown reactions active in C. thermocellum and capable of cycling carbon to produce PP i . Furthermore, although our findings did not identify the source(s) of PP i , eliminating some potential sources provides new insights into the advantages of having a PP i -dependent glycolysis versus an ATP-dependent glycolysis and the trade-off between energy yield and thermodynamic driving force. Finally, our findings could help to predict the impact of changing from a PP i -dependent glycolysis to an ATP-dependent glycolysis and guide future metabolic engineering attempts aimed to increase the ethanol production capacity of C. thermocellum.

MATERIALS AND METHODS
Strains and maintenance. All C. thermocellum strains used in this study (Table 4) originate from DSM1313 (Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Braunschweig, Germany; GenBank accession number CP002416). Stock cultures were grown anaerobically in CTFUD medium (32). Escherichia coli strains used in this study (Table 4) originate from a BL21 derivative (New England Biolabs catalog number C2566I; purchased from BioNordika AB, Solna, Sweden). Stock cultures were propagated in LB medium (10 g L 21 peptone, 5 g L 21 yeast extract, 10 g L 21 NaCl) supplemented with 100 mg ml 21 ampicillin. Frozen stocks were prepared by the addition of glycerol (25%, vol/vol) to overnight cultures and stored in 1-ml aliquots in cryogenic vials (VWR International AB, Stockholm, Sweden) at 280°C. For C. thermocellum, stocking was done in a vinyl anaerobic chamber from Coy Laboratory Products with 5% H 2 , 10% CO 2 , and 85% N 2 (Strandmöllen AB, Ljungby, Sweden).
Plasmid construction. All plasmids used in this study are listed in Table 5. Deletion plasmids were in vitro assembled with the Gibson assembly protocol (70, 71) using the pDGO145 backbone, P gapDH -cathpt cassette, and the regions homologous to the 59-upstream, 39-downstream, and internal region of the genes of interest as cloning fragments (32). For the Gibson assembly, 0.15 pmol DNA of the 59-flank,  39-flank, and internal region fragments was mixed with 0.03 pmol of pDGO145 backbone and P gapDH -cathpt cassette. For the E. coli expression plasmids, Gibson assembly was used with 0.03 pmol pTrc99a backbone and 0.15 pmol coding sequence of the gene(s) of interest. DNA fragments were PCR amplified from pDGO145 or pTrc99a or from genomic DNA of C. thermocellum LL1004 (wild-type), Thermoanaerobacterium saccharolyticum JW/SL-YS 485 (DSM8691; GenBank accession number CP003184), or an E. coli BL21 derivate using Phusion high-fidelity DNA polymerase (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer¨s instructions with primers ordered from Thermo Fisher Scientific or Integrated DNA Technologies (IDT; Skokie, IL, USA) ( Table 6). For the deletion plasmids, primers were designed that add at least 30-bp homologous overhangs to the 59 and 39 ends of the 59-flank, 39-flank, and internal region cloning fragments. For the expression plasmids, at least 30-bp overhangs were added to the 59 and 39 ends of the coding sequence of the gene(s) of interest. Genomic DNA, plasmid DNA, and PCR products were purified using commercially available kits from GeneJET (Thermo Fisher Scientific), with the exception of genomic DNA from T. saccharolyticum JW/SL-YS 485, which was purchased directly from DSMZ. After Gibson assembly, 20 ml of Gibson reaction mix was used to transform 50 ml of E. coli BL21 derivative cells (catalog number C2566I; New England Biolabs). E. coli BL21 cells were made chemically competent using rubidium chloride (72). Correct plasmid assembly was confirmed via diagnostic PCR and Sanger sequencing (Eurofins Genomics Sweden AB, Solna, Sweden) of the open reading frames, homologous flanks, and promoters. Diagnostic PCR was performed on plasmid DNA isolated from randomly picked E. coli colonies using DreamTaq DNA polymerase (Thermo Fisher Scientific).
Genetic modifications for C. thermocellum were confirmed via diagnostic PCR with DreamTaq DNA polymerase (Thermo Fisher Scientific), using primer combinations binding outside the targeted loci as well as inside the targeted loci (Table 6). Additionally, Sanger sequencing (Eurofins Genomics Sweden AB) of the modified loci and whole-genome sequencing were used to confirm gene deletions. Culture purity of constructed strains was routinely checked through Sanger sequencing (Eurofins Genomics Sweden AB) with 16S rRNA primers from IDT (Table 6).   Amplification of pDGO145 backbone CAGCTGCTAATAGTAGTGAAAAAATCAG 63 Amplification of P gapDH -cat-hpt selection cassette from pDGO145 GTGGGAATAGGCATGGAAAAGATTTTTTTGCC 64 Amplification of P gapDH -cat-hpt selection cassette from pDGO145 GGGGAGGGCGTGAATGTAAGCGTGA 222 Amplification of 59-flanking region of clo1313_0949 for pTK3 tattatcatgacattaacctataaaaataggcgtatcacgaggcgatatcGCATTTTGCCGTTATGTGCC 100 Amplification of 59-flanking region of clo1313_0949 for pTK3 ccttattatttctgtcccaaatcctttgtaccCCTTTTCCTCCAAGCAGGTC 101 Amplification of 39-flanking region of clo1313_0949 for pTK3 gcatcaatgagagacctgcttggaggaaaaggGGTACAAAGGATTTGGGACAG 102 Amplification of 39-flanking region of clo1313_0949 for pTK3 cgggcaaaaaaatcttttccatgcctattcccacCCCTCACCCTTGCTTCATATG 105 Amplification of internal region of clo1313_0949 for pTK3 gttatgtcacgcttacattcacgccctccccAGTTTGTGGAGATAGCCGAAAAAC 223 Amplification of internal region of clo1313_0949 for pTK3 ttcggttagagcggcattatccctgattttttcactactattagcagctgGGCAACGCAGCAAGTACCCA 419 Amplification of 59-flanking region of P clo1313_0717-0718 -clo1313_0717-0718 for pTK20 and pTK22 ataaaaataggcgtatcacgaggcgatatcGTCCATACCGGAGGAAAAGC 420 Amplification of 59-flanking region of P clo1313_0717-0718 -clo1313_0717-0718 for pTK20 and pTK22 cataatatcaaccttctttatctcttgcaaaaacCACCTGTTAATTTACATTTATCCGCC 418 Amplification of 39-flanking region of P clo1313_0717-0718 -clo1313_0717-0718 for pTK20 and pTK22 tataggcggataaatgtaaattaacaggtgGTTTTTGCAAGAGATAAAGAAGG 395 Amplification of 39-flanking region of P clo1313_0717-0718 -clo1313_0717-0718 for pTK20 and pTK22 gcaaaaaaatcttttccatgcctattcccacAGGTTCTTTTTAAGCTCGCC 390 Amplification of internal region of clo1313_0717 for pTK20 ttatgtcacgcttacattcacgccctccccCATGTTTACAAGATGAACTATTCCC 417 Amplification of internal region of clo1313_0717 for pTK20 ctgattttttcactactattagcagctgTTCCTGACACTGCCGTATATC 396 Amplification of internal region of clo1313_0718 for pTK22 ttatgtcacgcttacattcacgccctccccGTGCCGATGCCATGTACCATAAC 397 Amplification of internal region of clo1313_0718 for pTK22 ccctgattttttcactactattagcagctgGGGATTCAGCATCTCCATGTTAATC XSH0987 Amplification of 59-flanking region of clo1313_1686 for pSH226 taggcgtatcacgaggcgatGAACTGGGCATTGACAGC XSH0988 Amplification of 59-flanking region of clo1313_1686 for pSH226 tttgaagttgTATATAAAACCTCCATTATAAAATTATAGCC XSH0989 Amplification of 39-flanking region of clo1313_1686 for pSH226 ggaggttttatataCAACTTCAAAATAACAACTTGCAAAATAAATG XSH0990 Amplification of 39-flanking region of clo1313_1686 for pSH226 tccatgcctattcccacgatCTTTGAAATCTTCGGCATTGC XSH0991 Amplification of internal region of clo1313_1686 for pSH226 tacctggcccagtagttcagGTTATGGCAAAAGCTCTTATCG XSH0992 Amplification of internal region of clo1313_1686 for pSH226 tttttcactactattagcagTTCAGGATTCAGTGGTTCACC 482 Amplification of pTrc99a backbone for pTK30, pTK50-pTK55 GGCTGTTTTGGCGGATGAGA 483 Amplification of pTrc99a backbone for pTK30 attagtcctccttattcgggtacgtctgattaggCATGGTCTGTTTCCTGTGTG 617 Amplification of pTrc99a backbone for pTK50-pTK55 CATGGTCTGTTTCCTGTGTG 484 Amplification of clo1313_0717-0718 genes for pTK30 cctaatcagacgtacccgaataaggaggactaatATGCATAAAAAGGAGATTATTGCCTTGCTG 485 Amplification of clo1313_0717-0718 genes for pTK30 gaaaatcttctctcatccgccaaaacagccTTATATTACCGCATTTTTTCCTATTGTTATAGG 618 Amplification of clo1313_0997 for pTK50 gataacaatttcacacaggaaacagaccatgATGAGCAGTGTAAGAACGATTG 619 Amplification of clo1313_0997 for pTK50 gaaaatcttctctcatccgccaaaacagccTTATAATGCTAATATTTTGCTAAGCTGAATC 620 Amplification of T. saccharolyticum pfk (tsac_1362) for pTK51 gataacaatttcacacaggaaacagaccatgATGAGAACAATAGGAGTTTTAACAAGTGGTG 621 Amplification of T. saccharolyticum pfk (tsac_1362) for pTK51 gaaaatcttctctcatccgccaaaacagccTTAAATTGATAATATTTTGCTGAGTTCATAAAGC 622 Amplification of clo1313_1832 for pTK52 gataacaatttcacacaggaaacagaccatgATGTTTGATGTTGTTGCGGTTG 623 Amplification of clo1313_1832 for pTK52 gaaaatcttctctcatccgccaaaacagccTTACTTCTCTTCAAGAAACTGCCTC 624 Amplification of clo1313_2627 for pTK53 gataacaatttcacacaggaaacagaccatgATGATAACATCTGTGGCTCTC 625 Amplification of clo1313_2627 for pTK53 gaaaatcttctctcatccgccaaaacagccTCAAGATATTCTCTCAATTTCCACTCTG 632 Amplification of clo1313_1686 for pTK54 gataacaatttcacacaggaaacagaccatgATGTTAGAGAAATATTTGTCAAAAGTAAATTTTAATTC 633 Amplification of clo1313_1686 for pTK54 gaaaatcttctctcatccgccaaaacagccTTATTTTTTTGCCTGGTCCTTGTTTCTG 634 Amplification of E. coli acs (b21_03901) for pTK55 gataacaatttcacacaggaaacagaccatgATGAGCCAAATTCACAAACACACC 635 Amplification of E. coli acs (b21_03901) for pTK55 gaaaatcttctctcatccgccaaaacagccTTACGATGGCATCGCGATAG 282 Confirmation of correct deletion plasmid assembly GCCACCTGACGTCTAAGAAA 281 Confirmation of correct deletion plasmid assembly AAGAAAACAGACGCGCCC 280 Confirmation of correct deletion plasmid assembly GGAACCTTCCTTTTATAGGCG 284 Confirmation of correct deletion plasmid assembly GTTAGAGCGGCATTATCCCT 157 Confirmation of correct deletion plasmid assembly GGCAGCTAATAGAGGCATTA 156 Confirmation of correct deletion plasmid assembly CCTAACTCTCCGTCGCTATT 163 Confirmation of correct deletion plasmid assembly CCTGATGAATGAGTTGAGCTTC 248 Confirmation of correct plasmid assembly pLL1228 CTGAGGGCACGCAGTTTAGG 249 Confirmation of correct plasmid assembly pLL1228 GCTTCCGGTATTTAGTCAGGTGC 524 Confirmation of correct plasmid assembly pSH226 CTCGAACCGGATGACATGAC 525 Confirmation of correct plasmid assembly pSH226 CCATTCCTTGTCACCGTTGAAC 246 Confirmation of correct plasmid assembly pTK3 GCAATGCATCAATGAGAGACCTGC 247 Confirmation of correct plasmid assembly pTK3 GTTATTGCTTTTTCCTGCTACCAAACAC 406 Confirmation of correct plasmid assembly pTK20 and pTK22 CATAATCCTGTAAGGCTGAAGC 329 Confirmation of correct plasmid assembly pTK20 and pTK22 AGCGGAGTTTTGGAGTACGTC 486 Confirmation of correct expression plasmid assembly CTGTGCAGGTCGTAAATCACTG 628 Confirmation of correct expression plasmid assembly CGGCGTTTCACTTCTGAGTTC 325 Confirmation of correct plasmid assembly pTK30 CATAGCATACAGGGATATGAGC 401 Confirmation of correct plasmid assembly pTK30 TGAGCAAATGGTTGACTCAG 404 Confirmation of correct plasmid assembly pTK30 GAATCAGCCCATGACTTCGG 626 Confirmation of correct plasmid assembly pTK50 CTCGATGGCTTTGCCTTCTC 627 Confirmation of correct plasmid assembly pTK50 CAGAGGAGCAAGAGACATCAG 327 Confirmation of correct plasmid assembly pTK51 GCACTATCGACAATGACATACCG 330 Confirmation of correct plasmid assembly pTK51 CCAGTACAATTATGTGATGCAGC 139 Confirmation of correct plasmid assembly pTK52 GTGCGGATGTCAGTTTGGAC 629 Confirmation of correct plasmid assembly pTK52 GCCAAAGTTCTTCTTCACTGAC 630 Confirmation of correct plasmid assembly pTK53 GAGACAGTGAGACGGAAGTAC 631 Confirmation of correct plasmid assembly pTK53 GCGCCTCAACAATTTCGTCC 636 Confirmation of correct plasmid assembly pTK54 TGGCGGTTACAATATGGCCC 637 Confirmation of correct plasmid assembly pTK54 AGGATATGGTCAGACAGAGC 638 Confirmation of correct plasmid assembly pTK55 CCAGTAGATATCACCCGGATG 639 Confirmation of correct plasmid assembly pTK55 CGTACTGGCGGGAAAATTGAC 640 Confirmation of correct plasmid assembly pTK55 TAAGCGTGACGTAGGCGTAG (Continued on next page) modifications. Solution B, containing Na 2 SO 4 , KH 2 PO 4 , and K 2 HPO 4 , was concentrated 1.15-fold. Solution A (20-fold concentrated cellobiose), solution C (50-fold concentrated urea), solution D (50-fold concentrated MgCl 2 , CaCl 2 , FeSO 4 , FeCl 2 , and L-cysteine HCl), solution E (50-fold concentrated vitamins), and solution TE (1,000-fold concentrated trace elements) were added to solution B to reach the final concentrations. After adding all medium components, bottles were purged for 5 cycles (45 s per cycle) with a gas mix of 20% CO 2 and 80% N 2 prior to inoculation to initiate growth. Initial preculture serum bottles were inoculated from frozen glycerol stocks (280°C). After overnight incubation, these initial precultures were transferred in mid-exponential phase (optical density at 600 nm [OD 600 ], 0.5 to 1.0) to fresh precultures. Serum bottles were inoculated from these exponentially growing fresh preculture serum bottles (OD 600 , 0.5 to 1.5) to an initial OD 600 of 0.05. These serum bottles were sampled throughout the exponential growth phase (up to an OD 600 of 2.5) for OD 600 and extracellular metabolite analysis (as described below). All serum bottle cultures were incubated at 55°C in a Jeio Tech ISS-4075R incubator shaker (Milmedtek AB, Karlskrona, Sweden) set at 180 rpm.
Growth arrest studies. For growth arrest experiments, the washed-cell experimental method of Zhou et al. (9) was used, with some modifications. Growth arrest was achieved with two different media based on either nitrogen depletion (as used by Zhou et al. [9]) or sulfur depletion. For both methods, cells were harvested from exponentially growing cultures (OD 600 , 1.2 to 1.8) grown on 100 ml modified LC medium (30,39) in 200-ml Kimble serum bottles (DWK Life Sciences). Upon harvesting, cells were transferred in the anaerobic chamber to two 50-ml Falcon tubes sealed with anaerobic vinyl tape (TG Instrument AB, Helsingborg, Sweden) and centrifuged in an Avanti J-20 XP centrifuge (Beckman Coulter, Brea, CA, USA) at 6,500 Â g for 15 min at 4°C. Cell pellets were resuspended in 1.8 ml modified LC medium without cellobiose, cysteine, and Na 2 SO 4 (for S depletion) or in 1.8 ml modified LC medium without cellobiose and urea (for N depletion), transferred to a 2-ml microtube (Sarstedt AB, Helsingborg, Sweden), centrifuged (6,500 Â g, 12 min, 25°C) in a table-top centrifuge (Microstar 12; VWR), and washed twice with the same modified LC medium. Washed cells were used to inoculate 125-ml Wheaton serum bottles (DWK Life Sciences) containing 50 ml of one of five modified LC media, i.e., LC medium, LC medium without Na 2 SO 4 and with 10-fold lower cysteine levels (0.01 g L 21 ) (for S-depletion), LC medium without cellobiose and Na 2 SO 4 but with 0.01 g L 21 cysteine (S-depletion control), LC medium without urea (for N-depletion), and LC medium without cellobiose and urea (N-depletion control). All washes and inoculation were done under anaerobic conditions. Modified LC media were purged extensively for 20 cycles (45 s per cycle) with pure N 2 gas, directly followed by purging for 5 cycles (45 s per cycle) with a gas mix of 20% CO 2 and 80% N 2 , and immediately transferred to the anaerobic chamber to ensure complete anaerobic conditions. After inoculation, bottles were sampled regularly over the course of 24 to 72 h for both OD 600 and extracellular metabolite analysis as described below.
CDW and optical density determination. Cell dry weight (CDW) measurements were performed in technical triplicate by adding 10 ml of culture sample to predried and preweighed conical glass tubes and centrifuging (2,250 Â g, 20 min) the tubes in a table-top centrifuge (Z206 A; Hermle Labortechnik GmbH, Wehingen, Germany). Cell pellets were washed once with deionized water and dried overnight in a forced-convection drying oven (VENTI-Line; VWR International AB) at 105°C. CDW was determined by dividing the dry weight of the cells by the volume of the culture sample. Optical density was measured in technical triplicate at 600 nm in a V-1200 spectrophotometer (VWR International AB). For calculation of the biomass yield on cellobiose, CDW was estimated from OD 600 measurements using a conversion factor of 2.6. This factor was determined from the slope of OD 600 against CDW based on a calibration curve of five samples. The calibration curve was made by serial dilution of culture samples taken at an OD 600 of 2.0 (OD 600 values after dilution: 0.5, 0.8, 1.0, 1.5, and 2.0) for strain LL1004 (wild type), AVM003 (Dppdk), AVM051 (DP ags1,2 -ags1 ags2), AVM056 (Dppase DP ags1,2 -ags1 ags2 Dppdk), and AVM061 (Dppase DP ags1,2 -ags1 ags2 Dppdk Dclo1313_1686) and was consistent between all strains. Extracellular metabolite analysis. Culture supernatants were obtained by centrifuging 0.6 ml culture sample in a table-top centrifuge (Centrifuge 5424; Eppendorf, Hamburg, Germany) at 20,238 Â g for 2 min. After initial centrifugation, the supernatant was filtrated with Corning Costar Spin-X centrifuge tube filters (0.22-mm nylon membrane; Sigma-Aldrich) at 20,238 Â g for 2 min. Filtered supernatant was stored for up to 1 week at 4°C until further analysis. Extracellular concentrations of acetate, ethanol, formate, lactate, pyruvate, malate, glucose, and cellobiose were determined in culture supernatants on a Waters Alliance 2695 highperformance liquid chromatography (HPLC) (Waters, Milford, MA, USA) containing a Bio-Rad Aminex HPX-87H column (Bio-Rad, Hercules, CA, USA). The HPLC was operated with 5 mM H 2 SO 4 as the mobile phase at a flow rate of 0.6 ml min 21 at 60°C. Pyruvate, malate, and formate were detected with 75 mM H 2 SO 4 as the mobile phase. Detection was done with a Waters 2996 photodiode array detector at 210 nm and a Waters 2414 refractive-index detector. The column was heated with a Waters temperature control module.
Glycogen assay. Glycogen content was measured in technical triplicate by adding 1 ml of culture sample to 5 ml ice-cold methanol (280°C) and centrifuging (10,000 Â g, 10 min, 210°C) in an Avanti J-20 XP centrifuge (Beckman Coulter). The supernatant was decanted and the cell pellet was dissolved in 1.8 ml ice-cold methanol, transferred to a 2-ml microtube (Sarstedt AB), and centrifuged (10,000 Â g, 10 min, 4°C) in a table-top centrifuge. The cell pellet was stored at 280°C until further analysis. After thawing the cell pellets on ice, glycogen was analyzed as described previously (73). Glucose released from glycogen conversion was measured by HPLC as described above.
Calculation of yields and specific growth rate. Yields on cellobiose (in mol mol 21 or g x g 21 ) and maximum specific growth rates (m max ; per hour) of each fermentation were determined from at least five samples taken during the exponential growth phase (OD 600 , 0.4 to 2.2). Yields on cellobiose were calculated by plotting the product concentrations against the cellobiose concentration and using the absolute slopes of the resulting linear fit made by linear regression. The maximum specific growth rate during exponential growth was calculated from the slope of the semilogarithmic plot of OD 600 against time. All fermentation data can be found in File S4 in the supplemental material.
Protein expression in E. coli. For high-level expression of genes of interest from the trc promoter in E. coli, 500 ml of an overnight culture was inoculated into 500-ml baffled shake flasks containing 100 ml LB medium supplemented with 100 mg ml 21 ampicillin. Cultures were grown aerobically at 37°C in an orbital shaker (Infors, Basel, Switzerland) set at 180 rpm. At an OD 600 of 0.6, protein expression was induced by addition of isopropyl-b-D-1-thiogalactopyranoside at a final concentration of 200 mM. After 4 h of incubation, cells were harvested, centrifuged (6,500 Â g, 15 min, 4°C) in an Avanti J-20 XP centrifuge (Beckman Coulter), washed twice with 100 mM cold Tris-HCl buffer (pH 7.5 at 25°C), and stored at 220°C until further use. BL21 pTrc99a was used as an empty vector control strain.
SDS-PAGE. To verify expression of genes of interest in E. coli, SDS-PAGE gels were run. Three volumes of cell extract samples were mixed with one volume of 4Â NuPAGE LDS sample buffer (Thermo Fisher Scientific) and incubated at 105°C for 15 min. For each cell extract sample, approximately 6 mg of protein solution (in 10 to 15 ml) and 10 ml of SeeBlue Plus2 prestained protein standard (Thermo Fisher Scientific) were loaded into wells of a NuPAGE 10% Bis-Tris gel (Thermo Fisher Scientific). Gel electrophoresis was run at 180 V for 60 min in morpholinepropanesulfonic acid (MOPS) running buffer (10.46 g L 21 MOPS, 6.06 g L 21 Tris, 1 g L 21 SDS, 0.3 g L 21 EDTA) at room temperature. After electrophoresis, gels were stained with PageBlue protein staining solution (Thermo Fisher Scientific) according to the manufacturer's instructions.
Preparation of cell extracts for in vitro enzyme activity assays. C. thermocellum cell extracts for in vitro measurement of pyruvate phosphate dikinase, acetyl-CoA synthetase, lactate dehydrogenase, ADPglucose synthase, and phosphofructokinase activity were prepared as described earlier (30).
For in vitro enzyme activity assays of the membrane-bound pyrophosphatase, C. thermocellum cells were harvested as described above, with minor modifications. After harvesting the cells, the cell pellets were washed and resuspended in 10 mM MOPS-tetramethylammonium (TMA) hydroxide buffer (pH 7.2 at 25°C) prior to storage at 220°C. To prepare cell extracts, cell suspensions were thawed on ice, centrifuged (6,500 Â g, 15 min, 4°C) in an Avanti J-20 XP centrifuge (Beckman Coulter), and resuspended in 10 mM MOPS-TMAOH buffer (pH 7.2 at 25°C) with 5 mM MgCl 2 , 5 mM DTT, 50 mM EGTA, and one tablet of cOmplete protease inhibitor cocktail (Sigma-Aldrich). Cells were disrupted by triple passage through a prewashed and prechilled FA-078 SLM Aminco French press (SLM Instruments Inc., Urbana, IL, USA) at 19,200 lb/in 2 . Between each passage, cell suspensions were cooled on ice for 5 min. Whole cells and cell debris were removed by centrifugation (20,000 Â g, 30 min, 4°C).
E. coli cell extracts were prepared by thawing the cell suspension on ice, centrifuging (6,500 Â g, 15 min, 4°C), and resuspending the cell pellet in 100 mM Tris-HCl buffer (pH 7.5 at 25°C) with 2 mM DTT and 10 mM MgCl 2 . Cells were lysed by single passage through a prewashed and prechilled FA-078 SLM Aminco French press (SLM Instruments Inc.) at 12,800 lb/in 2 . The cell extract was obtained by centrifuging the cell lysate at 15,000 Â g for 20 min at 4°C.
For phosphofructokinase enzyme activity assays with E. coli strains BL21 pTK50, BL21 pTK51, BL21 pTK52, and BL21 pTK53, the cell extract was split in two fractions. One fraction was used directly in the assays, whereas the other fraction was heat treated for 30 min at 60°C to remove native E. coli background activity (as described previously [49]). The heat-treated sample was subsequently centrifuged at 15,000 Â g for 20 min at 4°C to remove precipitated proteins.
All cell extracts were stored on ice and used on the same day.
In vitro enzyme activity assays. Enzyme activity assays were performed aerobically with fresh cell extracts at 37°C or 55°C with a Cary 50 UV-visible spectrophotometer equipped with a single-cell Peltier element (Varian AB, Solna, Sweden). All assays were done in quartz cuvettes (Sigma-Aldrich) with 1-cm path length and 1-or 3-ml reaction mixtures. Enzyme activities are reported in mmol Á min 21 Á (mg protein) 21 and are averages from biological duplicate measurements. For each biological replicate, two concentrations of cell extract were assayed in technical duplicate to confirm proportionality between the enzyme activity and the added amount of cell extract.
To ensure that all enzyme activity assays were performed at exactly 37°C or 55°C, all reaction components except for coupling enzymes (if needed), cell extract, and the substrate used to start the reaction were added as concentrated stock solutions to preheated Milli-Q (at 37°C or 55°C) and incubated for 2 or 5 min at 37°C or 55°C, respectively. After this interval, coupling enzymes and the cell extract were added and incubated for 2 or 4 min at 37°C or 55°C, respectively. The reaction was subsequently started by adding the starting substrate as described below.
ATP-and PP i -dependent phosphofructokinase (EC 2.7.1.11 or EC 2.7.1.90) was assayed according to Zhou et al. (9), with some modifications. The reaction mixture (1 ml) contained 50 mM Tris-HCl (pH 7.0), 5 mM MgCl 2 , 0.15 mM NADH, 1 mM fructose-6-phosphate, 4 U of aldolase (from rabbit muscle, A8811; Sigma-Aldrich), 4 U of triosephosphate isomerase (from S. cerevisiae, T2507; Sigma-Aldrich), 4 U of a-glycerophosphate dehydrogenase (from rabbit muscle, G6751; Sigma-Aldrich), and 50 or 100 ml cell extract. The reaction was started by addition of 2 mM ATP or PP i . For GTP-dependent phosphofructokinase assays, 2 mM GTP was added at the start and incubated for 2 min with 2 U of inorganic pyrophosphatase (from yeast, 10108987001; Roche) to remove PP i impurities in the GTP stock. The reaction was started by addition of 1 mM fructose-6-phosphate. In addition to these standard conditions, the following variations were attempted for the ATP-and GTP-dependent Pfk assays: (i) use of 50 mM imidazole-HCl (pH 7.0 at 55°C) or MOPS (pH 7.0 at 55°C) as assay buffer, (ii) dialysis of the cell extract as described above, and (iii) addition of various concentrations of ATP or GTP (0.05 to 8 mM) and fructose-6-phosphate (0.25 to 20 mM).
Membrane-bound pyrophosphatase (EC 7.1.3.1) was discontinuously assayed based on hydrolysis of pyrophosphate (PP i ) to inorganic phosphate (P i ) (76). The assay mixture (3 ml) contained 100 mM MOPS-TMAOH (pH 7.2), 5.3 mM MgCl 2 , 40 mM EGTA, 10 mM NaCl, 50 mM KCl, and 20, 40, 50, 100, or 200 ml cell extract. The reaction was started by addition of 160 mM K 4 PP i . After 10 s and 5 min, a 550-ml sample was taken and stored immediately on ice. Liberated inorganic phosphate was detected using a malachite green detection assay (77). A volume of 40 ml of malachite green color reagent was mixed with 160 ml of sample in triplicate in a flat-bottom 96-well plate (Eppendorf Cell Culture Plates; Sigma-Aldrich) and incubated at room temperature for 25 min. After exactly 25 min, the absorbance was measured at 630 nm in a plate reader (AH Diagnostics AB, Solna, Sweden). Phosphate concentrations were calculated from A 630 values using a phosphate calibration curve (range, 0 to 30 mM P i ). Enzyme activities were corrected for acid-catalyzed hydrolysis of PP i , which was determined by omitting cell extract from the assay mixture.
Protein concentrations in cell extracts were quantified according to Bradford (78) with bovine serum albumin (Sigma-Aldrich) as the standard.
Optimizations of ADP-glucose synthase assay. Since ADP-glucose synthase activity assays with C. thermocellum cell extracts did not show any activity with the above-described protocol, several modifications to this protocol were attempted both in the preparation of the cell extract as well as in the assay conditions used. For the preparation of the cell extract, we (i) harvested and prepared the cell(-free) extracts in 100 mM MOPS (pH 7.5 at 25°C), HEPES (pH 7.5 at 25°C), or potassium-phosphate buffer (pH 7.5 at 25°C) instead of Tris-HCl buffer (pH 7.5 at 25°C) as described above, (ii) added one tablet of cOmplete protease inhibitor cocktail (Sigma-Aldrich) to the lysis buffer, (iii) performed the harvest and preparation of the cell(-free) extracts under anaerobic conditions, and (iv) dialyzed the cell extract prior to the in vitro enzyme activity measurement according to the protocol described above. The assay modifications included (i) the use of 50 mM MOPS, HEPES, or potassium-phosphate buffer at a (ii) pH of 7.0, 7.5, or 8.0, (iii) varying the assay temperature to 37, 45, and 55°C, and (iv) assaying the activity under anaerobic conditions.
Whole-genome sequencing. Whole-genome resequencing for all strains constructed in this study was used to verify strain construction and check for secondary mutations, as described before (79).
(i) Library preparation. Genomic DNA was prepared using the Omega E.Z.N.A. kit by following the manufacturer's protocol (Omega Bio-Tek, GA, USA). A volume of 500 ng of DNA was used for next-generation sequencing (NGS) library preparation using the NEBNext Ultra II FS DNA library prep kit for Illumina (New England Biolabs, MA, USA). Fractionated, adapter-ligated DNA fragments went through 5 rounds of PCR amplification and purification. The resulting NGS library was sequenced at the Genomics and Molecular Biology Shared Resource (GMBSR) at Dartmouth.
(iii) Data analysis. Read data were analyzed with the CLC Genomic Workbench, version 12 (Qiagen Inc., Hilden, Germany). Reads were mapped to the reference genome (GenBank accession no. NC_017992) with an average read coverage of at least 45-fold. Mapping was improved by two rounds of local realignment. The CLC Basic Variant Detection algorithm was used to determine small mutations (single-and multiple-nucleotide polymorphisms, short insertions, and short deletions). Variants occurring in less than 35% of the reads or fewer than 4 reads were filtered out. The fraction of the reads containing the mutation is presented in File S2. To determine larger mutations, the CLC InDel and Structural Variant algorithm was run. This tool analyzes unaligned ends of reads and annotates regions where a structural variation may have occurred, which are called breakpoints. Since the read length averaged 150 bp and the minimum mapping fraction was 0.5, a breakpoint can have up to 75 bp of sequence data. The resulting breakpoints were filtered to eliminate those with fewer than 10 reads or less than 20% not perfectly matched. The breakpoint sequence was searched with the Basic Local Alignment Search Tool (BLAST) algorithm (80) for similarity to known sequences. Pairs of matching left and right breakpoints were considered evidence for structural variations, such as transposon insertions and gene deletions. The fraction of the reads supporting the mutation (left and right breakpoints averaged) is presented in File S2. Mutation data from CLC were further processed using custom Python scripts (https:// github.com/danolson1/cth-mutation).
optStoic procedure. The optStoic algorithm (31) is used to probe for pathways capable of generating pyrophosphate via the net stoichiometric conversion ATP eq 1 P i ! ADP eq 1 PP i using the reaction content of the most recent C. thermocellum genome-scale metabolic model (i.e., iCBI655 [26]) as well as the complete list of reactions present in the KEGG database (44) (7,164 reactions). The minFlux optimization formulation within optStoic minimizes the total reaction flux required to achieve the desired net stoichiometric conversion. An in-depth description of the optStoic algorithm and the minFlux optimization formulation can be found elsewhere (31). By successively excluding the previously obtained solutions and rerunning minFlux, a total of 100 distinct pathway designs were identified using reactions from only model iCBI655 and 500 with the full complement of KEGG reactions. From the set of 500 solutions generated using KEGG reactions, only solutions that contained at most one new reaction that was not present in the iCBI655 genome-scale metabolic model were retained.
Data analysis. Unpaired Student's t test was used for comparison between values in this study. Data availability. Whole-genome sequencing data of the engineered strains was deposited into the NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra) with accession numbers listed in Table 4. Plasmid sequence accession numbers are shown in