Abstract
Angustmycin A has anti-mycobacterial and cytokinin activities, and contains an intriguing structure in which an unusual sugar with C5′-C6′ dehydration is linked to adenine via an N-glycosidic bond. However, the logic underlying the biosynthesis of this molecule has long remained obscure. Here, we address angustmycin A biosynthesis by the full deciphering of its pathway. We demonstrate that AgmD, C, A, E, and B function as d-allulose 6-phosphate 3-epimerase, d-allulose 6-phosphate pyrophosphokinase, adenine phosphoallulosyltransferase, phosphoribohydrolase, and phosphatase, respectively, and that these collaboratively catalyze the relay reactions to biosynthesize angustmycin C. Additionally, we provide evidence that AgmF is a noncanonical dehydratase for the final step to angustmycin A via a self-sufficient strategy for cofactor recycling. Finally, we have reconstituted the entire six-enzyme pathway in vitro and in E. coli leading to angustmycin A production. These results expand the enzymatic repertoire regarding natural product biosynthesis, and also open the way for rational and rapid discovery of other angustmycin related antibiotics.
Similar content being viewed by others
Introduction
The nucleoside antibiotics are a large and structurally diverse group of compounds that often exert biological effects by mimicking naturally occurring nucleosides or nucleotides1,2. They are typically comprised of a modified nucleoside base, N-glycosidically linked to a ribosyl sugar residue, analogous to the structural characteristics of biological purine or pyrimidine ribonucleosides. The angustmycins, and related angustmycin A (1, also called decoyinine) and angustmycin C (2, also called psicofuranine), are adenosine analogs, but are unusual in having a six-carbon ketose sugar, β-d-psicofuranosyl, in place of the more usual ribofuranose motif (Fig. 1)3,4. The N-glycosidic linkage of the ketose sugar to adenine is via an unusual β-2-N-ketoglycosidic bond that is rarely found elsewhere in nature5. 1 differs from 2 in that it has an exo-5,6-ene bond similar to the exo-glycal intermediates found in the biosynthesis of tunicamycins6. The 5′,6′-ene group of 1 precludes any of the phosphorylation of the 6′-OH, which is possible with 2, and potentially also effects the local planarity of the furanosyl ring.
Several chemical syntheses have been reported for the angustmycin family7,8,9,10,11, including 4-thio angustmycin C12, although considerably less is known regarding the biosynthesis. Early radiolabeling studies by Suhadolnik and co-workers have shown that the 6′-deoxy-d-erythro-2′,5′-hexodiulose sugar motif of 1 arises directly from d-[1-14C] glucose or uniformly 14C-labeled d-fructose13,14. Also, 2 labeled with 14C in the adenine group and at C-6 of the d-allulose is directly converted to 1 by growing cells of Streptomyces hygroscopicus14.
The biological effects induced by angustmycin-related antibiotics have been studied for over 50 years15,16. 1 is a potent inhibitor of GMP (guanosine monophosphate) synthesis in Gram-positive bacteria and is commonly used as a biochemical tool to induce bacterial sporulation17,18. More recently, the inhibition of the murine GMP synthase by 1 has been shown to suppress melanoma cell invasion and tumorigenicity in immunocompromised mice, suggesting the potential of angustmycins as an anti-melanoma agent19,20.
Angustmycin A (1) was also recently found to be a promising cytokinin (called Linfusu) that induces adventitious root or bud differentiation21. Cytokinins, as plant-specific chemical messengers (hormones), usually participate in the regulation of the cell cycle and several development processes during plant growth22. Adenine-derived cytokinins are abundant in nature, including zeatin with an adenine base and a five-carbon isopentenyl side chain23. The naturally occurring trans-zeatin was initially isolated from maize endosperm in the early 1960s24 and several other nucleoside cytokinins were identified later from plants or microbial metabolites25,26. Like cytokinin, 1 has been previously demonstrated to promote the growth of several plants, including Panax notoginseng, Siraitia grosvenorii, and Triticum aestivum27.
In the present paper, we address the mechanism for the biosynthesis of the angustmycin family of nucleotide antibiotics. We demonstrate a six-enzyme pathway for the biosynthesis of 1, which highlights an unusual dehydration step via a self-sufficient economical strategy for cofactor recycling. We have reconstituted the entire six-enzyme pathway both in vitro and in Escherichia coli, leading to 1. The study provides an understanding of the logic underlying 1 biosynthesis, expanding our knowledge of natural product biosynthesis, and also paves the way for further engineering of the pathway for rational enhancement of 1 production.
Results and discussion
Identification of the angustmycin gene cluster from S. angustmyceticus JCM 4053
To identify the gene cluster for angustmycin biosynthesis, we investigated the genomes of two independent angustmycin producer strains, Streptomyces angustmyceticus JCM 4053 and Streptomyces decoyicus NRRL 2666, whose capabilities of angustmycins production were confirmed by liquid chromatography-mass spectrometry (LC-MS) (Supplementary Fig. 1a–d). Since adenine is the direct precursor for the corresponding adenosyl motif of angustmycins13, we initially used Ari9 (the adenosine phosphorylase in the aristeromycin pathway) as a query sequence to perform BlastP analysis against S. angustmyceticus JCM 4053 genome (accession no. CP082945, BLAG01000004). This identified a homolog with 45% identity to Ari9, but the surrounding genes (lotus tag, K7396_21205-K7396_21225; accession no. CP082945) are apparently unrelated to angustmycin biosynthesis. We deduced that angustmycin biosynthesis may use a biosynthetic strategy similar to that of trans-zeatin to provide the purine-related moiety28, and we subsequently used the enzyme LOG (Lonely guy; accession no. AK071695) as the query sequence to conduct BlastP analysis (Supplementary Fig. 1e), leading to the discovery of five candidate enzymes. The surrounding regions of these candidates revealed a gene cluster (agm, accession no. MZ151497) that encodes enzymes related to allulose sugar biosynthesis and assembly. Furthermore, using AgmA as the query sequence led to the identification of a homologous gene cluster (accession no. MZ151498) in S. decoyicus NRRL 2666 genome (accession no. CP082301) with the identical organization and significant homology. Hence, these initial findings suggested that the target gene cluster (agm) may be directly responsible for angustmycin biosynthesis (Fig. 2a).
To correlate the agm gene cluster to angustmycin biosynthesis, the gene cluster from S. angustmyceticus JCM 4053 was directly cloned by two-fragment PCR coupled with Gibson assembly. The subsequent gene cluster and its variant (pCHW501 and pCHW501ΔagmF) were individually introduced into Streptomyces coelicolor M1154 (Supplementary Fig. 2a, b). After confirmation, the resultant conjugants were fermented for metabolite analysis. A bioassay indicated that the metabolite samples of M1154::pCHW501 and JCM 4053 show apparent inhibition against the indicator strain Mycobacterium smegmatis, while the samples of M1154::pCHW501ΔagmF and M1154::pSET152 lacked the relevant bioactivity (Fig. 2b). High-performance liquid chromatography (HPLC) analysis indicated that the sample from S. coelicolor M1154::pCHW501 generated target peaks corresponding to those of 2 and 1 of JCM 4053 metabolites, while these peaks were absent from the metabolites of the negative control (S. coelicolor M1154::pSET152). LC-MS analysis revealed that the target peaks of M1154::pCHW501 gave rise to distinctive [M + H]+ ions at m/z 298.1144 (with major fragment ions at m/z 280.0587 and 136.0617) and m/z 280.1040 (with fragment ions at m/z 262.0929 and 136.0618), which are fully consistent with the expected fragmentation patterns of 2 and 1 (Supplementary Fig. 3b, c). More interestingly, the metabolites recovered from M1154::pCHW501ΔagmF selectively produced the 2 peak, but not 1, suggesting that the agmF gene is likely responsible for the conversion of 2 to 1 (Fig. 2c and Supplementary Fig. 3a).
To establish the identities of the target metabolites accumulated by M1154::pCHW501 and M1154::pCHW501ΔagmF, they were HPLC purified for 1D and 2D nuclear magnetic resonance (NMR) analysis. The proton-NMR (1H-NMR) and heteronuclear multiple quantum coherence NMR spectra of the metabolite from M1154::pCHW501ΔagmF displayed two methylenes, five methines (including two sp2 carbons), and four quaternary carbons (including three sp3). Detailed analysis of the 1D and 2D NMR data led to the identification of this metabolite as 2, which was further confirmed by 1H-1H COrrelated SpectroscopY correlations of H-3′‒H-4′‒H-5′‒H-6′, and by heteronuclear multiple bond correlations (HMBC) of H-1′ with C-2′ and C-3′; H-8 with C-4 and C-5; and H-2 with C-4, C-5, and C-6 (Supplementary Fig. 4, 5 and Supplementary Table 1). Comparison of the 1D and 2D NMR data of the M1154::pCHW501 metabolites with those of 2 suggested structural similarities, except that the hydroxymethyl group at C-5′ in 2 was replaced by an exocyclic double bond. This was further confirmed by HMBC correlations from H-6′ to C-4′ and C-5′ (Supplementary Figs. 6 and 7 and Supplementary Table 2). Accordingly, the structures of the target metabolites were identical to those of 2 and 1 as shown in Fig. 1. Taken together, these data also demonstrated that the target gene cluster is responsible for the biosynthesis of angustmycin.
The nine-gene cluster agm is required for angustmycin biosynthesis
In silico analysis revealed a target 9.8 kb region (pCHW501) containing nine genes that are deduced to be involved in angustmycin biosynthesis. Of these, agmA-F, constituting the structural genes, are important for angustmycin biosynthesis on the basis of genetic and in silico analysis, and their functions are described in Supplementary Table 3. Moreover, AgmT1 and AgmT2 are annotated as MFS (major facilitator superfamily) transporters. In general, MFS proteins facilitate the transport of a variety of substrates across cytoplasmic or internal membranes29, and therefore, AgmT1 and AgmT2 are proposed to play the functional role of exporting angustmycins out of cells. AgmR shows 50% identity to A6A06_38595 (LacI family regulator) of Streptomyces sp. CB02923, and may have a regulatory role by binding to the promoter region during angustmycin biosynthesis. To establish the role of agmR, the pCHW501ΔagmR variant was conjugated into S. coelicolor M1154, and the resultant recombinant was then fermented for metabolite analysis (Supplementary Fig. 2c and Supplementary Table 4). HPLC analysis indicated that the sample from M1154::pCHW501ΔagmR was capable of producing two peaks corresponding to 2 (major component) and 1, which are quantitatively different from that of M1154::pCHW501, implying that there may be an endogenous regulator in S. coelicolor M1154, which also participates in upregulating the transcriptional level of the biosynthetic genes to control the ratio of 2 and 1 (Fig. 2c and Supplementary Fig. 3a, d). Related studies to investigate the regulatory mechanism during angustmycin biosynthesis are now underway in our laboratory.
AgmA functions as an AMP phosphoribohydrolase
As indicated by in silico analysis, AgmA shows considerable homology (54% identity) to the LOG gene of Oryza sativa Japonica Group (japonicum cultivar rice), which is a well-characterized phosphoribohydrolase involved in trans-zeatin biosynthesis, so we proposed that AgmA performs a similar function in angustmycin biosynthesis (Supplementary Fig. 8a). To determine if AgmA indeed fulfills this role, it was overexpressed and purified to near homogeneity from E. coli BL21(DE3). The in vitro activity was tested using AMP/dAMP as substrate. Consistent with our expectation, the HPLC results indicated that AgmA is capable of consuming AMP/dAMP as a substrate to produce a characteristic peak for adenine, which was absent from the negative control (Fig. 3a and Supplementary Fig. 8b–f). Further LC-MS analysis showed that the target peak gave rise to a distinctive [M + H]+ ion for adenine at m/z 136.0613 (with major fragment ions at m/z 118.8611 and 93.8582) and m/z 136.0613 (with major fragment ions at m/z 118.8811 and 93.9553), fully corresponding with those of the adenine authentic standard (Supplementary Fig. 9a–c). We then determined the specificity of AgmA against various other substrates related to AMP, including ATP, ADP, dATP, dADP, adenosine, and deoxyadenosine. The HPLC results from these experiments indicated that the AgmA enzyme could also recognize ATP, ADP, and dADP as substrates, but with lower relative activity (Supplementary Fig. 9d). These data establish that AgmA functions as an AMP phosphoribohydrolase for the supply of adenine.
The LOG family of cytokinin-activating enzymes (InterPro: IPR005269), as revealed by BlastP analysis, is widespread among the three kingdoms of life (with ca. 9000 sequences retrieved from the UniProt database). These sequences were incorporated using an enzyme similarity tool (EFI-EST) to generate a sequence similarity network (SSN)30 with the UniRef90 database based on an alignment score of 80 (Fig. 3b). These results indicated that AgmA homologs are more widely distributed in bacterial genomes, and also in plant genomes. The related enzymes of actinobacterial origin, including the AgmA described in this study, cluster in the network (highlighted in red). These cytokinin-activating family proteins commonly contain a conserved PGGxGTxxE motif, which differs from the canonical lysine decarboxylases31. Their proposed catalytic activity in plants is to convert inactive cytokinin nucleotides to active forms. Foreseeably, the AgmA-related enzyme may be of considerable value for plant cytokinin studies and applications.
Angustmycin A biosynthesis features an unusual final dehydration step
The C-5′–C-6′ double bond present in the sugar moiety of 1 is chemically intriguing. In our initial studies, the truncated agm gene cluster (pCHW501ΔagmF) was revealed to confer the host cell S. coelicolor M1154 with the capability of selectively producing 2, implying that the candidate enzyme AgmF is likely to govern the final dehydration step of 2 to 1. Surprisingly, the further bioinformatic analysis showed that AgmF was annotated as an S-adenosylhomocysteine (SAH) hydrolase, which usually catalyzes the reversible hydrolysis of SAH to adenosine and homocysteine32,33. SAH hydrolase is a ubiquitous enzyme that plays a central role in cellular methylation processes by maintaining the intracellular balance between SAH and S-adenosylmethionine34. To determine if AgmF also fulfills the role of a SAH hydrolase, it was overexpressed and purified from E. coli BL21(DE3), and we subsequently evaluated the enzymatic assay of AgmF using SAH as substrate. More surprisingly, the results showed that the AgmF enzyme could not recognize it as the substrate. We, therefore, reinvestigated the enzymatic role of AgmF, and inferred that 2 is more likely to be its real substrate. Accordingly, we tested its activity with 2 in the presence or absence of the exogenous NAD+ (nicotinamide adenine dinucleotide) cofactor, and the HPLC results indicated that 2 is indeed dehydrated to form 1 (Supplementary Fig. 10a). The identity was further confirmed by LC-MS, as it could give rise to [M + H]+ ions at m/z 280.1032 (with main fragment ions at m/z 262.0833 and 136.0000) and m/z 280.1031 (with main fragment ions at m/z 262.0000 and 135.9167), which fully matched to those of a 1 authentic standard. More interestingly, we also report that AgmF is able to maintain the activity without the addition of exogenously NAD+ cofactor, implying that AgmF is a self-sufficient enzyme in which the NAD+ cofactor is tightly bound within the activity pocket (Fig. 4a and Supplementary Fig. 12a). To further test this hypothesis, the AgmF enzyme was heat-treated to release the potential cofactor, which was then identified by HPLC as a distinctive NAD+ peak, whose identity was further determined by LC-MS (Supplementary Fig. 10b–f). To compare the difference in the enzymatic activity with or without NAD+, we conducted a time-course experiment, illustrating that the AgmF reaction with NAD+ occurs at a higher rate in the early reaction stage (Supplementary Fig. 10g). Furthermore, we noticed that substrate 2 was not be used up in the AgmF reaction, implying that this enzyme may catalyze a reversible reaction. Hence, AgmF may catalyze a reversible reaction from 1 to 2 with the chemical equilibrium shifted to 1 formation (Fig. 4b and Supplementary Fig. 12b).
We next determined the kinetic parameters of AgmF for the substrate 2, with KM = 1.382 ± 0.121 mM, and kcat = 1.902 ± 0.061 s−1 (Supplementary Fig. 13a). The KM value is larger than those of the reported SAH hydrolases with SAH as substrate35. We also evaluated the substrate specificity of AgmF against several 2 analogs, including adenosine, 2′-deoxyadenosine, 2′-amino-2′-deoxyadenosine, aristeromycin, formycin, and tubercidin, with results showing that the enzyme specifically recognizes 2 as the primary substrate (Supplementary Fig. 12d, e). To further investigate the catalytic mechanism of AgmF, a homologous structural model for AgmF was constructed based on a SAH hydrolase (containing NAD+ and adenosine binding pockets) of Mycobacterium tuberculosis H37Rv (PDB: 3CE6, 34% identity to AgmF)34. Accordingly, the side chains of Thr155, Thr156, Asn189, Thr240, Glu241, and Thr274 are identified as the candidate NAD+ binding sites that are generally required for this family of SAH hydrolase-related enzymes to execute the catalytic function. Specific residues from the C terminus of the adjacent subunit also interact with the NAD+, forming a covering over the binding site. This structure might partly explain the difficulty in the release of NAD+ (Fig. 4c). The 2 binding to the active sites also seems to be similar to those of the model structure complexed with adenosine. His56 and His299 may form hydrogen bonds with the 6′-hydroxyl of 2, and residues Asp156, Asp188, and Lys184 probably interacted with the hydroxyls of the furanosyl sugar (Fig. 4c and Supplementary Fig. 11). To verify these deductions, we performed selective mutational assays and determined that the AgmF variants H56A, D129A, K184A, D188A, and H299A completely abolished the activity for catalyzing 2 to 1, thereby demonstrating that these residues are functionally essential for AgmF activity (Fig. 4d and Supplementary Fig. 12c). Hence, these combined data demonstrate that AgmF functions as an unusual NAD+-dependent dehydratase. On account of the structural similarity between 2 and adenosine, we also assessed if adenosine could act as a competitive inhibitor, and the results indicated that the AgmF was found to be almost inactivated in the presence of a 3-fold excess of exogenously added adenosine (Supplementary Fig. 14).
AgmF differs from the classical family of SAH hydrolases in catalytic function, particularly in the recognition of a distinct substrate, while the key catalytic residues are conserved, suggesting that AgmF may adopt a similar catalytic mechanism to that of the typical SAH hydrolase. Concerning the dehydration reaction, several enzymatic strategies are derived from the common dehydratase domain in polyketide synthase36, the heme iron-dependent aldoxime dehydratase37, the dihydroxyacid dehydratase (containing a 2Fe–2S cluster)38, and the glutamyl-tRNA-dependent dehydratase in RiPPs (ribosomally synthesized and post-translationally modified peptides) biosynthesis39. The generic mycobacterial SAH hydrolase has been shown to catalyze a reversible reaction via an oxidation–reduction mechanism, which is associated with a cryptic dehydration step. However, the self-sufficient strategy exploited for the NAD+ cofactor recycling in the AgmF reaction is interesting. We, therefore, propose that Lys185 is likely to act as the basic residue extracting the proton from the 4′-OH group during oxidation to 4′-keto angustmycin C (2a) and that Lys185 is more basic because of its proximity to Asp188 and Ser190 (Fig. 4e). The carboxyl group of Asp129 can make an interaction with H-4′ and catalyze a proton abstraction at C-4′ to form 2b (Fig. 4e). Moreover, the two histidine residues His56 and His299 are speculated to form hydrogen bonds with O6′ of 2 and may be involved in the removal of the 6′-OH of 2b to form 2c. Finally, the end product 1 is produced with recycling of the NAD+ cofactor (Fig. 4e and Supplementary Fig. 13b, c). The leaving group proposed for the AgmF-catalyzed reaction is water, which is relatively poor when compared to the 5′-homocysteine motif for the SAH hydrolase family enzymes32. However, this may be promoted by more favorable conditions within the active site of AgmF or possibly involve a transient leaving group at the 5′ position (Fig. 4e and Supplementary Fig. 13c).
We also note that AgmF homologs are actually far more widely distributed in the microbial genomes than was previously supposed, which could be shown in a colored SSN analysis (Supplementary Fig. 15). A phylogenetic analysis of AgmF against other SAH hydrolases showed that the AgmF-related dehydratases formed a separate cluster from other enzymes (Supplementary Fig. 16), suggesting that AgmF represents a family of dehydratases using an unusual self-sufficient strategy for the cofactor NAD+ recycling.
AgmE, C, and B harbor individual activities of APRTase, ribose 5-P pyrophosphokinase, and AMP phosphatase
An initial in silico analysis showed that AgmE contains a conserved phosphoribosyltransferase domain. To evaluate if the protein carries out the expected adenine phosphoribosyltransferase (APRTase) function, it was overexpressed and purified from E. coli BL21(DE3), and assayed in vitro with the substrates adenine and phosphoribosyl pyrophosphate (PRPP). HPLC analysis of the reaction products showed that the AgmE-catalyzed reaction is capable of generating a characteristic AMP peak, whose identity was further verified by LC-MS, and indicating that AgmE plays a similar role as APRTase during 1 biosynthesis (Supplementary Fig. 17).
As for the catalytic function of AgmC, in silico analysis indicated that it harbors a conserved ribose-phosphate pyrophosphokinase domain. To determine if this enzyme executes a related activity, we performed an in vitro enzyme assay using ribose 5-phosphate (ribose 5-P) as substrate. To facilitate the detection of the enzymatic product (PRPP), we conducted an assay to indirectly monitor the conversion of ATP to AMP, using a coupled AgmC-AgmE reaction to trap PRPP to form the product AMP. As anticipated, the coupled reaction synthesized an enhanced amount of AMP in comparison with that of the negative control (without adenine added), showing that AgmC could utilize ATP to give AMP (Supplementary Fig. 18d). The AgmC-AgmE-coupled reactions lacking ribose 5-P (or lacking both ribose 5-P and ATP) also abolished the production of the target product AMP. These data demonstrate that AgmC performs a catalytic pyrophosphokinase-like function in the process of 1 biosynthesis (Supplementary Fig. 18).
Similar bioinformatics analysis for the function of AgmB revealed a conserved HAD signature motif with 51% identity to C5D20_10175 (HAD-like hydrolase) of Rathayibacter toxicus. This family of enzymes unusually catalyzes nucleophilic substitution reactions at phosphorus or carbon centers, using a conserved Asp carboxylate anion in the active site40. All members of this family possess a conserved alpha/beta core domain. To assess the function of the purified AgmB enzyme, we tested its activity in vitro and, as expected, found that the AgmB enzyme is indeed able to convert AMP to adenosine, the identity of which was verified by LC-MS (Supplementary Fig. 19). Collectively, this provides biochemical evidence that AgmE, AgmB, and AgmC have the activities of APRTase, AMP phosphatase, and ribose 5-P pyrophosphokinase, and they are most likely to play related functional roles during 1 biosynthesis.
Reconstitution of the complete six-enzyme pathway in vitro to 1 and 2
The functional role of AgmD in the 1 biosynthetic pathway also remained undetermined, and by in silico analysis showed 31% identity to AlsE of E. coli. The AlsE enzyme was previously characterized as a d-allulose 6-phosphate 3-epimerase that utilizes d-fructose 6-phosphate as substrate. In addition, earlier metabolic labeling experiments indicated that 1 arises directly from either d-glucose or d-fructose. This suggested that AgmD should play a 3-epimerase role similar to AlsE and provides the basis for further understanding of 1 biosynthesis (Supplementary Fig. 20a–c).
To further decipher the biosynthetic logic underlying 1 biosynthesis, we reconstituted the complete biosynthetic pathway to 1 in vitro. The six enzymes covering AgmA-F were individually overexpressed and purified from E. coli BL21(DE3), allowing for the construction of the biosynthetic pathway to 1 de novo (Fig. 5a, b). HPLC analysis of the products showed that the complete AgmA-F reactions are capable of producing the distinctive 2, adenine, and 1 peaks, whose corresponding [M + H]+ ions m/z 298.1143, 136.0617, and 280.1039 (and related fragment ions) are fully consistent to those of authentic standards. It was also found that the enzymatic reaction including AlsE, AgmA-C, and AgmE-F is also capable of producing the peaks of 2, adenine, and 1 peaks, but with a higher overall efficiency, showing that AlsE and AgmD share the same functional role. These data demonstrated that the six enzymes (AgmD, C, A, E, B, and F) catalyze the relay reactions with fructose 6-phosphate as the starter substrate in the biosynthesis of angustmycins (Fig. 5c and Supplementary Table 5).
Based on these results, we then attempted to dissect the individual stepwise reactions occurring during 1 biosynthesis. HPLC analysis of the products indicated that the reaction (AlsE + AgmC) was able to generate a characteristic AMP peak, which was absent from the AlsE reaction. LC-MS analysis of this peak gave a [M + H]+ molecular ion at m/z 348.0703 and major fragment ions at 136.0617 and 250.0937, fully matched to those of the AMP authentic standard. These data established that AgmC is a pyrophosphokinase that uses allulose 6-phosphate as the acceptor substrate. The coupled reactions containing AlsE, AgmC, and AgmA also gave the adenine peak, but without the coproduction of AMP. Hence, the substrate of AgmA is probably also the product of AgmC. This intermediate was further consumed by the addition of the AgmE enzyme in the related coupled reaction to form 5. The coupled LC-MS analysis of 5 gave a [M + H]+ ion at m/z 378.0806, and main fragment ions of m/z 360.0691, 262.0933, and 136.0619, demonstrating that AgmE is an adenine phosphoallulosyltransferase. 5 was revealed to be specifically dephosphorylated to render 2, whose identity was confirmed by LC-MS (Fig. 5c, Supplementary Fig. 21, and Supplementary Table 5). To analyze their relative catalytic activities, the generation of 2 by the five-enzyme coupled reaction was measured with either AlsE or AgmD. The relative activity of AgmD is 22% when compared with that of AlsE, consistent with the result of the one-pot in vitro reaction (Supplementary Fig. 20d). The stepwise reactions were also individually carried out to illustrate that AgmD has a function identical to AlsE (Supplementary Fig. 22 and Supplementary Table 5), and the accumulation of adenine and consumption of 5 would affect the one-pot reaction efficiency (Supplementary Figs. 23 and 24).
In the present paper, we have demonstrated that the complete 1 biosynthetic pathway consisting of six enzymes is sequentially arranged to synthesize 1. The biosynthetic pathway starts with a glycosyl epimerization step from d-fructose 6-phosphate to d-allulose 6-phosphate as catalyzed by AgmD, and then successively undergoes pyrophosphorylation, adenine phosphoallulosyltransfer, and dephosphorylation to produce 2. After the final dehydration step, 2 is converted to 1. Notably, the cofactors involving NAD+ and NADP+ are recycled during 1 biosynthesis, and AMP plays dual roles as both the substrate and the product. These latter results suggest that 1 biosynthesis exploits a self-sufficient and efficient strategy for cofactor recycling and substrate utilization, which is unusual in natural product biosynthesis (Fig. 5d).
Utilization of E. coli as a cell factory for engineered production of 1 and 2
Finally, we have also explored the potential of utilizing E. coli as a cell factory for the production of 1. The six biosynthetic genes (agmDCAEBF, listed according to the reaction sequence) were introduced into E. coli GYJ23, a strain engineered for nucleoside production, via a three-plasmid system (Fig. 6a). After confirmation, the transformant GYJ23/agmDCAEBF was shown by HPLC to be capable of generating the target peaks of 1 and 2, whose identities were further confirmed by LC-MS analysis (Fig. 6b and Supplementary Table 5). The yields of 2 and 1 could independently reach 110 and 370 μg/mL after 96 h fermentation. Interestingly, we found the strain (GYJ23/agmDCAEB, lacking agmF) is capable of targeted accumulating 2 with a yield of 420 μg/mL after 96 h fermentation (Fig. 6c, d).
We also evaluated the production of 1 and 2 conferred by the set of six genes with agmD replaced by alsE. HPLC analysis showed that the strain GYJ23/(alsE + agmCAEBF) also produces 1 and 2, but with higher titers (Fig. 6b and Supplementary Table 5). Hence, the 2 titer of the strain GYJ23/(alsE + agmCAEB) was 780 μg/mL, which is considerably higher than that of the GYJ23/agmDCAEB strain. All of these data demonstrate that suitably engineered E. coli strains could be used as a robust cell factory for the efficient production of the nucleoside antibiotic 1 (Fig. 6c, d). The successful production of 1 and 2 in a heterologous E. coli cell factory raised the question of why the strain GYJ23/(alsE + agmCAE) is not capable of accumulating 5 and adenine. This may in part be because phosphate-containing compounds, such as nucleotides, are less easily transported out of cells. Concerning adenine, as a primary metabolite, it is also perhaps prone to being recycled during the cell cycle of E. coli.
In summary, we report the discovery and characterization of the gene cluster responsible for angustmycin A (1) biosynthesis and we provide evidence that AgmA (phosphoribohydrolase), AgmB (phosphatase), AgmC (pyrophosphokinase), AgmD (epimerase), and AgmE (phosphoallulosyltransferase) collaborate together to biosynthesize the angustmycin C (2) molecule. We have also unraveled that AgmF functions as an unusual dehydratase for the final tailoring step to 1 via a self-sufficient strategy for NAD+ cofactor recycling. Moreover, we have reconstituted the complete biosynthetic pathway to 1 in vitro and successfully engineered the production of 1 in a robust cell factory of E. coli. We anticipate that deciphering the angustmycin pathway will expand the biochemical repertoire for the intriguing enzymatic reactions leading to nucleoside natural product biosynthesis (while this paper was under review, a part result of the angustmycin biosynthesis was reported by Shiraishi et al.41; they characterized the gene cluster and identified part of the AgmF function (dehydratase)), and open the way to the rapid and rational discovery of more purine nucleoside antibiotics related to 1 (Supplementary Fig. 25).
Methods
General methods
Strains and plasmids used in this study are described in Supplementary Table 6 and primers are listed in Supplementary Table 7. General methods employed in this work follow the standard protocols42,43.
Enzymes, chemicals, and reagents
All of the enzymes used in this study were purchased from New England Biolabs. The chemicals and reagents were the products of Sigma-Aldrich, Thermo Scientific, OMEGA, or J&K Scientific.
Sequencing analysis of the genome of actinobacteria strain
Sequencing of the genome of the actinobacteria strain was performed on Nanopore and Illumina Hiseq2500 machine. The sequencing reads for S. angustmyceticus JCM 4053 and S. decoyicus NRRL 2666 are 7.41 and 7.97 million, respectively. The raw data were assembled by Flye (Version: 2.8.3) and Unicycler (Version: 0.4.9) to obtain the scaffold. Hmmer (Version: 3.3.1) software was used for the annotation of the genomes. Bioinformatic analysis of the target DNA region was performed on the basis of the online programs FramePlot 4.0beta (http://nocardia.nih.go.jp/fp4/) and 2ndFind (http://biosyn.nih.go.jp/2ndfind/).
Direct cloning of the whole agm gene cluster and its variants
The whole agm gene cluster was directly cloned by a two-step PCR strategy and the detailed procedure is described as follows: the two pieces (left, 4603 bp; right, 4636 bp) housing the whole agm gene cluster were individually amplified by PrimeSTAR Max DNA Polymerase (Takara) with primers (piece1-F/R and piece2-F/R) (Supplementary Table 7) and template (genomic DNA of S. angustmyceticus JCM 4053) (Supplementary Fig. 2a). The two pieces were cloned into the site between EcoRI and XbaI of the vector pSET152 by Gibson Assembly method to obtain the resulting plasmid pCHW501 (Supplementary Fig. 2a). The agmF-deleted gene cluster cloning was similar to that described above. The two pieces (left, 4603 bp; right, 3148 bp) were individually amplified by PrimeSTAR Max DNA Polymerase (Takara) with primers (piece1-F/R and piece3-F/R) (Supplementary Table 7) and template (genomic DNA of S. angustmyceticus JCM 4053) (Supplementary Fig. 2b). The two pieces were cloned into the sites between EcoRI and XbaI of vector pSET152 by Gibson Assembly method to obtain the plasmid pCHW501ΔagmF (Supplementary Fig. 2b). The two recombinant plasmids were confirmed by PCR with primers agm id-F/R (Supplementary Table 7). The agmR-deleted gene cluster cloning was based on pCHW501. Two pieces (left, 3042 bp; right, 417 bp) were individually amplified by PrimeSTAR Max DNA Polymerase (Takara) with primers (DagmR-First-F/R and DagmR-Second-F/R) (Supplementary Table 7) and template (genomic DNA of S. angustmyceticus JCM 4053). The two pieces were cloned into the site between EcoRI and BglII of vector pCHW501 by Gibson Assembly method to obtain the agmR in-frame deletion plasmid pCHW501ΔagmR, which was confirmed by PCR with primers agmR id-F/R (Supplementary Fig. 2c and Supplementary Table 7).
Fermentation and detection of angustmycins
For fermentation, S. angustmyceticus JCM 4053 and S. decoyicus NRRL 2666 were independently cultivated on YS agar (including 2 g yeast extract, 10 g soluble starch, and 15 g agar per liter, pH 7.3) and MS plate43, and spores of related strains were inoculated in ISP2 medium44 and cultivated for 2 days; after that, the cultures (2%, V/V) were transferred to fermentation medium45 and fermented (180 r/min, 30 °C) for 5 days. For HPLC and LC-high resolution mass spectrometry (HRMS) analysis, the fermentation broth was processed by adding oxalic acid till pH 5.0. HPLC analysis was performed using Shimadzu LC-20AT equipped with C18 column (Diamonsil, 5 μm, 4.6 × 250 mm) under 0.15% aqueous formic acid (95%): methanol (5%) at a flow rate of 0.5 mL/min over 30 min and the detection wavelength was 254 nm. LC-HRMS analysis was carried out on a Thermo Fisher Scientific ESI-LTQ Orbitrap (Scientific Inc.) instrument controlled by Xcalibur in the positive mode. The parameters for HRMS analysis are as follows: a capillary temperature of 275 °C and a capillary voltage of 35 V, and the error in this work is 10 p.p.m. The bioassay method was described in Supplementary Method 1, and the extraction, purification, and chemical structure identification of angustmycins were described in Supplementary Methods 2 and 3.
Model building and analysis of AgmF structure
The homology structural model of AgmF was constructed according to the X-ray structure of NP_217765 from Mycobacterium tuberculosis H37Rv (PDB 3CE6, https://www.rcsb.org/structure/3CE6) and further refined using Discovery Studio 3.0 (Accelrys).
In vitro assay of AgmA
Reaction mixtures (50 μL each) consisted of a solution containing 50 mM Tris-Cl buffer (pH 7.5), 1 mM AMP (dAMP), and 10 μg AgmA at 30 °C. The reaction was terminated after 2 h by the addition of an equivalent volume (50 μL) of methanol. Following centrifugation to remove the protein, the reaction was analyzed by HPLC and LC-HRMS equipped with a reverse-phase C18 column with 0.15% aqueous formic acid (95%): methanol (5%) at a flow rate of 0.5 mL/min over 30 min, and the detection wavelength was 254 nm.
In vitro assay of AgmF
Reaction mixtures (50 μL each) consisted of a solution containing 50 mM PBS buffer (pH 7.5), 1 mM NAD+, 1 mM 2, and 10 μg AgmF at 30 °C. The reaction was terminated after 2 h by the addition of an equivalent volume (50 μL) of methanol. Following centrifugation to remove the protein, the reaction was analyzed by HPLC and LC-HRMS equipped with a reverse-phase C18 column with 0.15% aqueous formic acid (95%): methanol (5%) at a flow rate of 0.5 mL/min over 30 min and the detection wavelength was 254 nm.
In vitro assay of AgmB, AgmC, and AgmE
AgmB reaction mixtures (50 μL each) consisted of a solution containing 50 mM Tris-Cl buffer (pH 7.0), 1 mM AMP, 1 mM Mg2+, and 10 μg AgmB at 30 °C for 2 h; AgmE reaction mixtures (50 μL each) were composed of a solution containing 50 mM PBS buffer (pH 7.5), 1 mM adenosine, 1 mM PRPP, 1 mM Mg2+, and 10 μg AgmE at 30 °C for 2 h; AgmC and AgmE combined reaction mixtures (50 μL each) consisted of a solution containing 50 mM Tris-Cl buffer (pH 7.5), 1 mM ribose 5-P, 1 mM ATP, 1 mM Mg2+, 10 μg AgmC, and 10 μg AgmE at 30 °C for 2 h. The post-processing and analysis methods of these reactions were identical to that of the AgmF reaction. The protocol of overexpression and purification of the target proteins was described in Supplementary Method 4.
One-pot reaction for 1 and 2 biosynthesis
Six-protein reaction mixtures (50 μL each) consisted of a solution containing 50 mM Tris-Cl buffer (pH 8.0), 10 mM fructose 6-phosphate, 1 mM ATP, 0.5 mM NADP+, 4 mM Mg2+, and 10 μg of individual proteins (AgmD/AlsE, AgmC, AgmA, AgmE, AgmB, and AgmF) at 30 °C for 2 h. The reaction was terminated after 2 h by the addition of an equivalent volume (50 μL) of methanol. Following centrifugation to remove the protein, the reaction was analyzed by HPLC and LC-HRMS equipped with a reverse-phase C18 column with 0.15% aqueous trifluoroacetic acid (95%): methanol (5%) at a flow rate of 0.5 mL/min over 30 min. The process of heterologous expression in E. coli was as described in Supplementary Method 5.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Genomes of S. angustmyceticus JCM 4053 and S. decoyicus NRRL 2666 have been deposited in GenBank under accession nos. CP082945 and CP082301, respectively. The two gene cluster sequences from S. angustmyceticus JCM 4053 and S. decoyicus NRRL 2666 have been deposited in GenBank under accession nos. MZ151497 and MZ151498, respectively. All other data generated and analyzed in this study are available within the article and the Supplementary information. Source data are provided with this paper.
References
Chen, W. et al. Natural and engineered biosynthesis of nucleoside antibiotics in Actinomycetes. J. Ind. Microbiol. Biotechnol. 43, 401–417 (2016).
Gong, R. et al. Harnessing synthetic biology-based strategies for engineered biosynthesis of nucleoside natural products in actinobacteria. Biotechnol. Adv. 46, 107673 (2021).
Yuntsen, H., Yonehara, H. & Ui, H. Studies on a new antibiotic, angustmycin. I. J. Antibiot. 7, 113–115 (1954).
Yuntsen, H., Ohkuma, K., Ishii, Y. & Yonehara, H. Studies on angustmycin. III. J. Antibiot. 9, 195–201 (1956).
Eble, T. E., Hoeksema, H., Boyack, G. A. & Savage, G. M. Psicofuranine. I. Discovery, isolation, and properties. Antibiot. Chemother. 9, 419–420 (1959).
Wyszynski, F. J. et al. Biosynthesis of the tunicamycin antibiotics proceeds via unique exo-glycal intermediates. Nat. Chem. 4, 539–546 (2012).
Lamberth, C. The chemistry of 4′,5′-unsaturated nucleosides. A review. Org. Prep. Proced. Int. 31, 379–397 (1999).
Richardson, S. K., Howell, A. R. & Taboada, R. Synthesis and properties of psico-nucleosides. Org. Prep. Proced. Int. 38, 101–176 (2006).
McCarthy, J. R. Jr, Robins, R. K. & Robins, M. J. Purine nucleosides. XXII. The synthesis of angustmycin A (decoyinine) and related unsaturated nucleosides. J. Am. Chem. Soc. 90, 4993–4999 (1968).
Farkaš, J. & Šorm, F. Synthesis of psicofuranine. Tetrahedron Lett. 3, 813–814 (1962).
Bodenteich, M. & Marquez, V. E. Synthesis of (±)-psicoplanocin A.: a carbocyclic nucleoside combining the structural features of psicofuranine and neplanocln A. Tetrahedron Lett. 31, 5977–5980 (1990).
Haraguchi, K., Takahashi, H. & Tanaka, H. Stereoselective entry to 1′-C-branched 4′-thionucleosides from 4-thiofuranoid glycal: synthesis of 4′-thioangustmycin C. Tetrahedron Lett. 43, 5657–5660 (2002).
Sugimori, T. & Suhadolnik, R. J. The biosynthesis of psicofuranine. J. Am. Chem. Soc. 87, 1136–1137 (1965).
Chassy, B. M., Sugimori, T. & Suhadolnik, R. J. The biosynthesis of the 6-deoxy-D-erythro-2,5-hexodiulose sugar of decoyinine. BBA Gen. Subj. 130, 12–18 (1966).
Hanka, L. J. Mechanism of action of psicofuranine. J. Bacteriol. 80, 30–36 (1960).
Slechta, L. Studies on the mode of action of psicofuranine. Biochem. Pharmacol. 5, 96–107 (1960).
Mitani, T., Heinze, J. E. & Freese, E. Induction of sporulation in Bacillus subtilis by decoyinine or hadacidin. Biochem. Biophys. Res. Commun. 77, 1118–1125 (1977).
Zain-ul-abedin, Lopez, J. M. & Freese, E. Induction of bacterial differentiation by adenine- and adenosine-analogs and inhibitors of nucleic acid synthesis. Nucleos. Nucleot. Nucl. 2, 257–274 (1983)..
Bianchi-Smiraglia, A. et al. Pharmacological targeting of guanosine monophosphate synthase suppresses melanoma cell invasion and tumorigenicity. Cell Death Differ. 22, 1858–1864 (2015).
Khan, N. et al. Solution structure and functional investigation of human guanylate kinase reveals allosteric networking and a crucial role for the enzyme in cancer. J. Biol. Chem. 294, 11920–11933 (2019).
Hong-Yuan, X., Hong-Zhang, X., Bing, H. & Feng-Jue, Z. in Biotechnology and Sustainable Agriculture 2006 and Beyond (eds. Xu, Z., Li, J., Xue, Y. & Yang, W.) 97–101 (Springer Netherlands, 2010).
Davies, P. J. Plant Hormones, Vol. 2 (British Medical Journal, 2005).
Kieber, J. J. & Schaller, G. E. Cytokinins, Vol. 12 (American Society of Plant Biologists, 2014).
Letham, D. S. Regulators of cell division in plant tissues: V. A comparison of the activities of zeatin and other cytokinins in five bioassays. Planta 74, 228–242 (1967).
Sakakibara, H. Cytokinin biosynthesis and transport for systemic nitrogen signaling. Plant J. 105, 421–430 (2021).
Sakakibara, H. in Plant Hormones: Biosynthesis, Signal Transduction, Action! (ed. Davies, P. J.) 45–114 (Springer Netherlands, 2010).
Han, Z., Chen, R. X. & Xu, H. Z. Advances in angustmycins and their applications. Zhongguo Kang Sheng Su Za Zhi 32, 329–333 (2007).
Kurakawa, T. et al. Direct control of shoot meristem activity by a cytokinin-activating enzyme. Nature 445, 652–655 (2007).
Paulsen, I. T., Sliwinski, M. K. & Saier, M. H. Jr Microbial genome analyses: global comparisons of transport capabilities based on phylogenies, bioenergetics and substrate specificities. J. Mol. Biol. 277, 573–592 (1998).
Zallot, R., Oberg, N. & Gerlt, J. A. The EFI web resource for genomic enzymology tools: Leveraging protein, genome, and metagenome databases to discover novel enzymes and metabolic pathways. Biochemistry 58, 4169–4182 (2019).
Naseem, M., Bencurova, E. & Dandekar, T. The cytokinin-activating LOG-family proteins are not lysine decarboxylases. Trends Biochem. Sci. 43, 232–236 (2018).
Takata, Y. et al. Catalytic mechanism of S-adenosylhomocysteine hydrolase. Site-directed mutagenesis of Asp-130, Lys-185, Asp-189, and Asn-190. J. Biol. Chem. 277, 22670–22676 (2002).
Yamada, T. et al. Catalytic mechanism of S-adenosylhomocysteine hydrolase: roles of His 54, Asp130, Glu155, Lys185, and Aspl89. Int. J. Biochem. Cell Biol. 37, 2417–2435 (2005).
Reddy, M. C. et al. Crystal structures of Mycobacterium tuberculosis S-adenosyl-L-homocysteine hydrolase in ternary complex with substrate and inhibitors. Protein Sci. 17, 2134–2144 (2008).
Yuan, C. S., Ault-Riché, D. B. & Borchardt, R. T. Chemical modification and site-directed mutagenesis of cysteine residues in human placental S-adenosylhomocysteine hydrolase. J. Biol. Chem. 271, 28009–28016 (1996).
Li, Y. et al. Functional characterization of a dehydratase domain from the pikromycin polyketide synthase. J. Am. Chem. Soc. 137, 7003–7006 (2015).
Nomura, J. et al. Crystal structure of aldoxime dehydratase and its catalytic mechanism involved in carbon-nitrogen triple-bond synthesis. Proc. Natl Acad. Sci. USA 110, 2810–2815 (2013).
Zhang, P. et al. Cyanobacterial dihydroxyacid dehydratases are a promising growth inhibition target. ACS Chem. Biol. 15, 2281–2288 (2020).
Bothwell, I. R. et al. Characterization of glutamyl-tRNA-dependent dehydratases using nonreactive substrate mimics. Proc. Natl Acad. Sci. USA 116, 17245–17250 (2019).
Lu, Z., Dunaway-Mariano, D. & Allen, K. N. HAD superfamily phosphotransferase substrate diversification: structure and function analysis of HAD subclass IIB sugar phosphatase BT4131. Biochemistry 44, 8684–8696 (2005).
Shiraishi, T., Xia, J., Kato, T. & Kuzuyama, T. Biosynthesis of the nucleoside antibiotic angustmycins: identification and characterization of the biosynthetic gene cluster reveal unprecedented dehydratase required for exo-glycal formation. J. Antibiot. 74, 830–833 (2021) .
Sambrock, J. & Russel, D. Molecular Cloning: A Laboratory Manual 3rd edn (Cold Spring Harbor Laboratory Press, 2001).
Kieser, T. et al. Practical Streptomyces Genetics: A Laboratory Manual (John Innes Foundation, 2000).
Shirling, E. & Gottlieb, D. Methods for characterizing Streptomyces species. Int. J. Syst. Bacteriol. 16, https://doi.org/10.1099/00207713-16-3-313 (1966).
Ochi, K., Yashima, S., Eguchi, Y. & Matsushita, K. Biosynthesis of formycin. Incorporation and distribution of 13C-, 14C-, and 15N-labeled compounds into formycin. J. Biol. Chem. 254, 8819–8824 (1979).
Acknowledgements
This work was supported by grants National Key R & D Program of China (2021YFC2100600 to W.C.), the National Natural Science Foundation of China (31770041, 31970052, and 32170026 to W.C.), the Hubei Province’s Outstanding Medical Academic Leader program to W.C., and the Open Funding Project of the State Key Laboratory of Microbial Metabolism (MMLKF19- 03 to W.C.). Mention of any trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture, who is an equal opportunity provider and employer.
Author information
Authors and Affiliations
Contributions
L.Y., W.Z., and Y.S. conducted the genetic and biochemical experiments. Y.-S.C. analyzed the NMR data. H.M. helped with the biochemical experiments. M.J. assisted with the proposing of the related enzymatic mechanism. W.C. and N.P.J.P conceived and directed the research. W.C. and N.P.J.P. wrote the manuscript. Z.D. revised the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Satish Nair and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yu, L., Zhou, W., She, Y. et al. Efficient biosynthesis of nucleoside cytokinin angustmycin A containing an unusual sugar system. Nat Commun 12, 6633 (2021). https://doi.org/10.1038/s41467-021-26928-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-26928-y
This article is cited by
-
A guide for the synthesis of key nucleoside scaffolds in drug discovery
Medicinal Chemistry Research (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.