Structural basis for the biosynthesis of lovastatin

Wang, Jialiang; Liang, Jingdan; Chen, Lu; Zhang, Wei; Kong, Liangliang; Peng, Chao; Su, Chen; Tang, Yi; Deng, Zixin; Wang, Zhijun

doi:10.1038/s41467-021-21174-8

Download PDF

Article
Open access
Published: 08 February 2021

Structural basis for the biosynthesis of lovastatin

Nature Communications volume 12, Article number: 867 (2021) Cite this article

9058 Accesses
38 Citations
14 Altmetric
Metrics details

Subjects

Abstract

Statins are effective cholesterol-lowering drugs. Lovastatin, one of the precursors of statins, is formed from dihydromonacolin L (DML), which is synthesized by lovastatin nonaketide synthase (LovB), with the assistance of a separate trans-acting enoyl reductase (LovC). A full DML synthesis comprises 8 polyketide synthetic cycles with about 35 steps. The assembling of the LovB–LovC complex, and the structural basis for the iterative and yet permutative functions of the megasynthase have remained a mystery. Here, we present the cryo-EM structures of the LovB–LovC complex at 3.60 Å and the core LovB at 2.91 Å resolution. The domain organization of LovB is an X-shaped face-to-face dimer containing eight connected domains. The binding of LovC laterally to the malonyl-acetyl transferase domain allows the completion of a L-shaped catalytic chamber consisting of six active domains. This architecture and the structural details of the megasynthase provide the basis for the processing of the intermediates by the individual catalytic domains. The detailed architectural model provides structural insights that may enable the re-engineering of the megasynthase for the generation of new statins.

Interim analyses of a first-in-human phase 1/2 mRNA trial for propionic acidaemia

Article 03 April 2024

Dwight Koeberl, Andreas Schulze, … Stephanie Grunewald

RNAi-based drug design: considerations and future directions

Article 03 April 2024

Qi Tang & Anastasia Khvorova

The landscape of small-molecule prodrugs

Article 02 April 2024

Zachary Fralish, Ashley Chen, … Daniel Reker

Introduction

Statins are inhibitors of hydroxymethylglutaryl-coenzyme reductase (HMG-CoA), which converts HMG-CoA to mevalonate, the rate-limiting step in cholesterol biosynthesis. This activity enables the medicinal use of statins to treat hypercholesterolemia and potentially to reduce mortality in multiple cancer types¹. Lovastatin is a precursor of the multibillion sold semisynthetic statins. It is biosynthesized in the filamentous fungus Aspergillus terreus^2,3,4,5. The first isolable intermediate of the lovastatin biosynthetic pathway, dihydromonacolin L (DML), is constructed by the highly reducing iterative polyketide synthase (HR-iPKS) LovB, which partners with LovC, an enoyl reductase that acts in trans^6,7.

LovB is a representative polyketide synthase that shares a common architecture and domain structures with animal and bacterial PKSs, which synthesize chemically diverse drugs and bioactive compounds⁸ (Supplementary Fig. 1a). In LovB, the β-ketoacyl synthase (KS), malonyl-acetyl transferase (MAT), dehydratase (DH), methyltransferase (CMeT), acyl carrier protein (ACP), and ketoreductase (KR) domains are active, except for the enoyl reductase (ER) domain. LovB terminates with a condensation (CON) domain commonly found in nonribosomal peptide synthetases (Supplementary Fig. 1b).

In DML biosynthesis by LovB, the initiation and elongation of the intermediate chain are carried out by the KS, MAT, and ACP domains. ß-Ketoacyl modification cycles are also repeated, but the tailoring domain usage during each iteration is highly programmed and permutative (Supplementary Fig. 1c). The combination of the KR–DH domains functions in iterations 1, 2, and 5, while the combination of the KR–DH–ER domains functions in iterations 4 and 6. The full domain usage of CMeT–KR–DH–ER occurs only once at iteration 3. The DH domain is then omitted, with only the KR domain used in the final two iterations 7 and 8. A Diels-Alder reaction is proposed to take place on the triene chain after iteration 5⁷. The structural basis for programming a single set of catalytic domains in the megasynthase differently during each iteration has remained a mystery for LovB.

Excellent domain swapping experimental results and biochemical assays of an isolated enoyl reductase from the Cox and Townsend labs have demonstrated, that the individual catalytic modifying domains themselves possess selectivity for specific substrates^9,10. The CMeT domain, which catalyzes methylation in polyketide formation for citrinin, functions as a gatekeeper¹¹. The starter unit selection carried out by the SAT domain for PksA also contribute to the selectivity¹². These observations led to the proposal that in the programming of iterative PKSs, ultimate control resides in the structure of the protein and the recognition of structurally ever-changing substrates⁹.

Current structural knowledge of HR-iPKS is derived from a bacterial type I PKS module^13,14, a hybrid MAS-like PKS¹⁵, the mammalian FAS (mFAS)^16,17 and several individual PKS domains¹⁸, which is insufficient for the understanding of lovastatin biosynthesis and the programming mechanism for HR-iPKSs. Therefore, key structural questions regarding topics such as the assembly of the whole enzyme, the architecture of the catalytic chamber, and the detailed constitution of individual catalytic tunnels need to be solved. Particularly in the biosynthesis of DML, the ER domain plays a key role in ensuring proper programming of the PKS, and such trans-acting ERs have been reported across various fungal PKS enzymes as a tactic in nature to diversify polyketides¹⁹. The site of interaction between the ER domain and the LovB PKS is unknown. These questions represent a barrier to full understanding of the catalytic cycles.

In this work, we determine the structures of LovB at 2.91 Å and the LovBC complex at 3.60 Å resolution using cryo-electron microscopy. The structure of LovB adopts an X-shaped dimer architecture, and the LovBC complex reveals the position of LovC which binds to the MAT domain of LovB, forming two complete L-shaped catalytic chambers. Mutational analysis of the LovB–C interface confirms the essential role of the catalytic chamber integrity for the production of DML. Together, our observation provides the structural basis for the iterative yet programmed biosynthesis of lovastatin.

Results

Overall architecture of the LovBC complex

We purified the full-length His-tagged LovB and LovC separately using a nickel column (Supplementary Fig. 2a, b). We then mixed LovB and LovC in a stoichiometric ratio of 1:1.2. The complex was then further purified using a final size exclusion column to remove the extra LovC (Supplementary Fig. 2c). Treatment of the protein sample using 1 mM DSS crosslinker before the size exclusion step helped reduce the monomeric LovB contaminant (Supplementary Fig. 2d). After the SEC step, three cofactors were re-added into the LovBC solution for cryo-EM sample preparation, ensuring that the particles were homogeneous enough for structure determination (Supplementary Fig. 3). None of the cofactors except NADP⁺ was observed in the final models. High-quality LovBC cryo-EM particles were obtained only by adding 2 mM of each of three cofactors (Mal-CoA, NADPH, and SAM) to the sample with the incubation for 1 h but in the absence of LovG (the product releasing thioesterase). The gain of high-quality particles could be due to that the majority of the protein complex was pushed to a conformation representing the final stage of the DML synthesis just before release. We collected cryo-EM images with a K2 Summit direct electron detector equipped on a Titan Krios electron microscope, and RELION was used for image processing^{20,21,22,23,24,25}. Rounds of 2D and 3D classification for particle selection and refinements were performed. Finally, the maps of overall 2.91 Å for LovB and 3.60 Å resolution for the LovBC complex were reconstructed (Supplementary Figs. 4 and 5). The resolution of the maps enabled us to reliably assign the individual domains, dissect their linker junctions and finally build the atomic models of LovB (Supplementary Fig. 6).

The LovBC complex adopts an X shape in the front view with two wings. Starting from the lower region, the KS and MAT domains are connected by the linker domain (LD), with the separate ER (LovC) domain interacting with the MAT domain (Fig. 1a, b). The MAT domain is linked to the upper region, which begins with the DH domain, followed by the intact CMeT domain, protruding from the relatively planar body of the whole complex. Then, the truncated ψKR domain is linked to the nonfunctional ψER domain, and the connected KR domain finally completes the upper tailoring region. The ACP and tethered CON domains were not solved, possibly due to the inherent flexibility of the ACP domain. The height of the complex is approximately 152 Å. LovB alone is ~176 Å wide, while the LovBC complex is ~294 Å wide. The thickness of the complex is ~95 Å. The two monomers contact each other with an approximately a 6197 Å² interface, which was mainly contributed by the KS, ψER, and DH domains, with buried surface areas of 2963 Å², 1545 Å², and 1250 Å², respectively. The post-MAT wing-junction linkers (Supplementary Fig. 7) mediate the contact of the upper wing with the lower wing. Contact between the KS and DH domains with a surface area of 304.78 Å² was also observed. Rounds of 3D classification generated two structures with slight differences in domain angles (≈0.4°), suggesting that the dynamic mobility of the whole structure is minimal (Supplementary Fig. 8).

**Fig. 1: Overall architecture of the LovBC complex.**

The adoption of a pseudo-twofold face-to-face symmetry of LovB, in connection with LovC, creates two L-shaped catalytic chambers. The chamber is chimeric, with the KS, and MAT domains coming from one chain, and the DH, CMeT, ψKR, ψER, and KR domains from another (Fig. 1c–e).

Structural analysis of LovB enzymatic domains

The dimeric KS–LD–MAT domains adopt a linear extended conformation (Fig. 2a), similar to the homologous mFAS and DEBS M3. Briefly, the MAT domain is slightly rotated relative to KS. They are connected by the 3α2β-fold linker domain (LD). The post-MAT linker can be divided into two parts. The lower part, together with the LD, play the structural roles in fixing position between KS and MAT. The upper part, defines the relative organization of the DH domains, and hence arranges the dimers into the face-to-face X conformation. The linker mediates the contact between the upper and lower wings and contributes to the fixative assembly of the complex as well (Supplementary Fig. 7), also no other significant conformation of LovB was detected through cryo-EM data processing, in contrast to mFAS, which has its lower wing rotated relative to the upper²⁶. The α, β-hydrolase core domain and ferredoxin-like subdomain contained MAT domain have conserved S656, R681, and H763 active site residues, compared with DEBS and mFAS (r.m.s deviation of 1.89 Å and 1.67 Å, respectively). The KS domain adopts an αβαβα structure and contains the conserved C181, H320, and H367 active site residues with r.m.s deviation of 1.57 Å and 1.81 Å, respectively, to DEBS and mFAS (Supplementary Fig. 13).

**Fig. 2: The KS–LD–MAT domains of LovB.**

Despite the common fold shared by the KS domains, the substrate tunnels in the KS domains of LovB show differences from those of mFAS and DEBS, highlighted by the disconnection of the PPant pocket from the acyl pocket (Fig. 2b), possibly due to the hydrophobic interaction of the residue F436 with M132. Although the acyl pocket tunnel is disconnected from the PP pocket tunnel observed here, it might represent a conformational state in the absence of an intermediate in the tunnel. The hydrophobic interacting residue pair (F436 and M132) could potentially function as part of a gate^27,28, that dynamically controls substrate access to the active site, prevents solvent access to specific regions of the protein, or synchronizes processes occurring in distant parts of the megasynthase. This truncated tunnel and its detailed constitutional surface may underpin the specific recognition of the relatively short acyl intermediates during the polyketide elongation cycles in DML synthesis.

The NADPH-dependent KR domain adopts the typical Rossmann fold, which belongs to the SDRs (short-chain dehydrogenases/reductases) family (Fig. 3a). The NADPH binding pocket with bound NADP⁺ nicotinamide ring can be clearly identified. The active site residues S2294 and Y2307 are conserved and the substrate tunnel travels along them. The structure is closely related to those of mFAS and Amp module 11 of modPKS (r.m.s deviation of 1.57 Å and 1.27 Å, respectively). While the substrate entry groove was narrower compared with that of mFAS or DEBS KR (Fig. 3b), which might due to that the particles were in the stage that ketoreduction has been completed. A 27 residues length of the post-KR loop was observed to interact with the KR domain with a buried surface of 563 Å². ψKR mainly serve as a structural role for the tailoring region completion, and is unable to bind NADPH due to the truncation of nearly half of the Rossmann fold compared with the fully active KR domain.

The organization of the two DH domains is V-shaped which is similar to mFAS, in contrast to the relatively linear DH domain organization in MAS and modPKSs¹⁵ (Supplementary Fig. 9). Each DH monomer of LovB adopts a pseudodimeric hot-dog fold and harbors the conserved active site residues H985, D1174, and Q1178, which are contributed by both hot-dog folds. The substrate tunnel begins at the α helix near the active site, and has a closed end inside the fold rather than traveling through the entire C-terminal hot dog fold as in mFAS. It is relatively shorter (~11 Å) than the tunnel of mFAS (~18 Å). Six tyrosine and phenylalanine residues surround the tunnel, ready for interaction with the hydrophobic elongated polyketide intermediates in DML synthesis.

The S-adenosyl-methionine (SAM)-dependent CMeT domain comprises two subdomains (Fig. 3d) and resembles the homologous modPKS CurJ (r.m.s deviation of 2.39 Å). The active site residues are located at the two-subdomain interface, and the conserved F1400 represents the substrate entrance region. The binding pocket for SAM is clear, and the location of the hydrophobic substrate cavity between two subdomains is facing towards the catalytic chamber of LovB for the access by ACP, which is necessary for methylation activity during DML synthesis.

The ψER adopts the medium-chain dehydrogenases/reductases MDR fold dimer. It lacks the active site residues, and the substrate tunnel is disrupted, leaving no space for substrate and NADP⁺ cofactor binding (Fig. 3e). This inactive version of the ER dimer mainly contributes to the architecture-fixing role, due to the extensive contacting interface they provide.

LovB–LovC interaction is essential for DML synthesis

We observe that 8% of the particles show density for both sides of LovC. Three-dimensional refinements resulted in an overall 4.21 Å resolution map (Supplementary Fig. 4). The EM density allows unambiguous fitting of the LovC crystal structure. However, the resolution at the LovB–LovC interface is too low for the modeling of the protein–protein interaction. 3D classification with a global search followed by local finer angular sampling resulted in 40.7% of particles showing a single chain of LovC. The reason that we observed particles with a single LovC chain could be that some of the LovC protein was disordered, or that some LovC fell off during the cryo-EM sample preparation plunge-frozen step. More likely, the particles were not evenly distributed in the solution and had some extent of directional preference (Supplementary Fig. 5d), or LovC was vibrating relatively to the LovB core part. Another possible physiologically relevant cause is that the binding of LovB with LovC has a moderate overall affinity, or there is a negative allosteric crosstalk between them to regulate the synthesis of the polyketide.

To obtain better density integrity for LovC part, the class of map with one LovC chain was further auto-refined and postprocessed. The density of LovC was resolved to the local resolution in the range of 6.1–9.9 Å, which is still not sufficient for precise model placement, but allows the low-resolution guided in silico docking of the LovC crystal structure. To further identify the critical interacting residues and more precisely describe the LovB–C interface, computational docking of the MAT–LovC interacting region (residues 695–757 of MAT) with LovC was performed using RosettaDock²⁹ (Fig. 4a, b). The 19 lowest-energy models of MAT generated by RosettaCM were used to dock with LovC, and all structures were kept rigid during the simulation process. The best performance of the first step was used as input for the second round of simulation. Successful docking was based on the formation of an energy funnel with rmsd <2.2 Å from the ten lowest-energy decoys. Figure 4a, b shows that one of the 19 models docks successfully with LovC.

**Fig. 4: Interaction between LovB and LovC is essential for the synthesis of DML.**

The LovB–C interface is approximately 522 Å², which is mainly contributed by several residues in the loop region of LovC and α helices of the MAT domain (Fig. 4c). To further verify the binding site, the interaction loops in LovC were mutated (T271L, R272I, K273G, and M274A). In parallel, three mutants of the MAT domain were also designed (MAT Mut1–3). Mut1 had E747A, D748A, E749A, and S750A mutations; Mut2 had H741A and G744A mutations; and Mut3 had D713A and A714S mutations. The mutant of LovC, the MAT domain and its mutants of LovB were cloned, expressed, and purified. Then the purified mutant proteins of the LovC and MAT domain were mixed and incubated for protein complex formation. Figure 4d shows that in contrast to the control incubation that contains the WT LovC and MAT domain protein, which coelute during size exclusion chromatography at 15 ml, mutation of the interacting loop on LovC disrupts the interaction. The LovC mutant and MAT domain protein elute separately. At the same time, the MAT domain Mut1 abolished interaction with LovC (Supplementary Fig. 10). These observations suggest that the loop within LovC and the helix of MAT domain of LovB are essential for the formation of the LovBC complex.

LovC functions as a gatekeeper for the normal lovastatin synthesis in A. terreus⁷. The gate-keeping function is specified by the specific recognition of the intermediates by the active site residues of LovC¹⁹. The residues interacting with LovB are not involved in the recognition of the substrates. This observation allowed us to test whether the binding of LovC to LovB plays a role, that is, whether the LovBC complex forms an integral catalytic chamber in the catalysis of DML synthesis. The significance of LovB–LovC binding for the integrity of the catalytic chamber was analyzed by an in vitro reaction assay catalyzed by LovB–LovC and LovG. Figure 4e shows that the interface mutation abolished the synthesis of DML acid, in spite of the full catalytic competence of the mutant (Supplementary Fig. 11). The interface mutant synthesized only pyrone shunt products as previously observed for LovB in the absence of LovC (Supplementary Fig. 12). We conclude that the formation of the LovBC complex is essential for the integrity of the catalytic chamber to the complete total synthesis of DML acid.

Discussion

HR-iPKSs exist widely in nature, including in animals³⁰. The structural basis of the LovB–LovC complex sheds light on the understanding of this family of megasynthases. For the total eight synthetic cycles LovBC uses to produce DML, the polyketide intermediate-tethered ACP domain needs to shuttle back and forth within the catalytic chamber to individual domains (Fig. 5a). The assembly of two LovB monomers between the KS–DH domains shows the interaction between the upper and lower wings (Supplementary Fig. 7), which could sterically hinder the large-scale domain rotation of LovB, which is in contrast with the observations in mFAS²⁶. Indeed, dramatic domain rotations in mFAS were not detected in our LovBC samples. Nevertheless, we envisage that the iterative domain interactions between the ACP domain and the catalytic domains are specified by the molecular surface observed here. Further structural study on the interaction of the ever-changing polyketide intermediates with the catalytic tunnels of each domain should reveal, how the HR-iPKS programs the specific permutative functions at each synthetic cycle (Fig. 5b).

**Fig. 5: Substrate shuttling within the catalytic chamber by the ACP domain.**

Methods

Strains, plasmids, and culture conditions

The strains, plasmids, and primers used in this work are listed in Supplementary Tables 1 and 2. The Saccharomyces cerevisiae and E. coli strains have been described in the previous publications⁷. YPD medium contains 20 g/l peptone, 10 g/l yeast extract, and 20 g/l dextrose. SC-Uracil dropout medium contains 5 g/l Bacto casamino acids, 6.7 g/l yeast nitrogen base with ammonium sulfate, 20 g/l dextrose, 0.2 g/l adenine hemisulfate, 0.2 g/l tryptophan, and 20 g/l Bacto agar (for solid medium). The Frozen-EZ Yeast Transformation IITMT2001 Kit was purchased from ZYMO RESEARCH CORP. Saccharomyces cerevisiae strains were routinely cultured in YPD medium. The yeast plasmid pXW_LovBcH was transformed into S. cerevisiae BJ5464-NpgA according to the previously published protocol⁷ using the Frozen-EZ Yeast Transformation IITMT2001 Kit.

For the overexpression of LovB, a single colony of S. cerevisiae BJ5464-NpgA/pXW_LovBcH was inoculated into 90 ml of SC-Uracil dropout medium in a 250 ml flask and cultured for 48 h at 28 °C with shaking at 220 rpm. Four milliliter aliquots of the culture were then separately inoculated into 1 l of YPD medium in a 3 l flask and cultured for 72 h at 28 °C with shaking at 220 rpm. Cells were harvested by centrifugation at 4500×g for 6 min. Approximately 20 g of cell paste was routinely obtained per 1 l culture. The cell pastes were flash-frozen and stored at −80 °C.

The DNA fragment containing the MAT domain of LovB (55 kDa) was amplified using primers Mat28_18_S and Mat28_18cH_A with plasmid pXW_LovBcH as the template. The vector pET28a was amplified by using primers V28_Mt55_S and V28_Mt55_A. The resulting DNA fragments were fused together using a Trelief™ SoSoo Cloning Kit (Tsingke Biological Technology). Transformation of the fusion product into E. coli DH10B generated the expression plasmid pLovB_MATcH. A polyhistidine tag was fused at the C-terminus of the protein.

For the construction of LovB_MAT mutants, pLovB_MATcH was used as a template. The primers were listed in Supplementary Table 2. These resulting PCR products were digested using DpnI to remove the template, and transformed into E. coli DH10B. The resulting plasmids were extracted from E. coli DH10B. After confirmation by sequencing, the mutational plasmids were transformed into E. coli BL21(DE3) for protein expression.

For the construction of the LovC mutant, the plasmid pET28_LovCcH was used as a template, and TRKM to LIGA-S and TRKM to LIGA-A were used as primers. The PCR product was digested using DpnI to remove the template and then transformed into E. coli DH10B. The resulting plasmids were isolated from E. coli DH10B, confirmed by sequencing and then transformed into E. coli BL21(DE3) for protein expression.

Bacterial cells were routinely cultured in Luria broth medium (10 g/l tryptone, 5 g/l yeast extract, and 10 g/l sodium chloride) supplemented with 50 µg/ml kanamycin when needed for strain selection. Specifically, the E. coli strains used in protein expression experiments were grown in 1 l of LB medium containing 50 µg/ml kanamycin at 37 °C with shaking at 220 rpm until the culture optical density at 600 nm (OD₆₀₀) reached 0.6. At this point, gene expression was induced by the addition of isopropyl-d-thiogalactopyranoside (IPTG) to a final concentration of 0.1 mM, and the culture was allowed to incubate for an additional 24 h at 16 °C. Cells were then harvested by centrifugation at 6000×g for 20 min, flash-frozen and stored at −80 °C.

Purification and sample preparations of His-tagged LovB, LovC and LovG

Polyethylene glycol-8000 (PEG8000), S-adenosyl-l-methionine (SAM), disuccinimidyl suberate (DSS), dimethylformamide (DMSO), and gravity columns were purchased from Sangon Biotech (Shanghai) Co., Ltd. TWEEN 20 and Millipore’s Amicon® Ultra-0.5 10k centrifugal filter devices were purchased from Sigma-Aldrich (Merck KGaA, Darmstadt, Germany). Ni-NTA resin and Superose 6 Increase 10/300 GL columns were purchased from GE Healthcare (GE Healthcare Life Sciences, Little Chalfont, UK). All experiments were performed at 4 °C unless indicated.

For the purification of LovC, 5 g of frozen cells were thawed, resuspended in 50 ml of buffer A (50 mM Tris·HCl pH 8.0, 150 mM NaCl, 5% glycerol, 40 mM imidazole) and lysed using a French press (Union-Biotech, Shanghai, China) operated at 4 °C. Cell debris was removed by centrifugation at 18,000×g for 30 min, and the resulting supernatant was loaded onto a 12 ml gravity-flow column packed with 2 ml of Ni-NTA resin pre-equilibrated with 20 ml of buffer A. The resin was then washed with 40 ml of buffer A. LovC was eluted using 5 ml of elution buffer (300 mM imidazole pH 8.0, 50 mM NaCl, 5% glycerol, 4 mM SAM). LovC was incubated on ice until its usage in the LovBC complex formation. The concentration of purified LovC was measured using the Bradford assay. Approximately 3 mg of LovC can routinely be obtained from 5 g of cell paste. The fluorometric activity assays of LovC and the mutants were carried out according to Ames et al.¹⁹. Purification of LovG was according to Xu et al³. Briefly, 3 g of frozen cells were thawed, resuspended in 30 ml of buffer A and lysed using the French press operated at 4 °C. Cell debris was removed by centrifugation at 18,000×g for 30 min, and the resulting supernatant was loaded onto a 12 ml gravity-flow column packed with 2 ml of Ni-NTA resin pre-equilibrated with 20 ml of buffer A. The resin was then washed with 40 mL of buffer A. LovG was eluted using 5 ml of elution buffer. Approximately 3 mg of LovG can routinely be obtained from 3 g of cell paste.

For the purification of LovB, 50 g of frozen cells were thawed, resuspended in 100 ml of buffer A and lysed by a French press operated at 4 °C (1100 bar). Cell debris was removed by centrifugation at 18,000×g for 60 min, and the resulting supernatant was precipitated using PEG8000. PEG8000 stock solution (50% w/v dissolved in 100 mM Tris pH 8.0) was added to the supernatant drop by drop slowly with stirring until the final concentration reached 8%. The solution was stirred for an additional 30 min and then centrifuged at 16,000×g for 10 min. The supernatant was discarded, and the resulting pellet was dissolved in 50 ml of buffer A. After centrifugation at 16,000×g and 4 °C for 10 min, the supernatant was aliquoted into two sterile 50 ml conical tubes, each containing 3 ml of Ni-NTA resin pre-equilibrated with buffer A and then gently rotated for 2 h using a QB-206 multipurpose shaker (Kylin-Bell, Haimen, China) for sufficient protein–resin interaction. After spinning at 800 g for 3 min, the supernatant was discarded, and a total of 6 ml of resin was transferred to a 20 ml gravity column using buffer A. The column was then washed with 60 ml of buffer A and eluted with 20 ml of elution buffer. The eluted liquid was collected in 5 ml aliquots. The two aliquots with the highest concentration were combined. Approximately 6 mg of LovB protein in 8 ml can be obtained from 50 g of frozen cells.

Six milligrams of LovB was mixed with 1 mg of LovC and 2 mM of each of the three cofactors (Mal-CoA, NADPH, and SAM) and incubated for at least 1 h. Then, 300 µl of 100 mM DSS (dissolved in DMSO) crosslinker was added to the mixture. The mixture was incubated on ice for 2 h for efficient crosslinking. The crosslinking reaction was quenched by adding 1 M Tris (pH 8.0) stock solution to a final concentration of 50 mM and incubating for an additional 30 min. The crosslinked LovBC solution was concentrated to 1 ml using a Millipore’s Amicon® Ultra 10k centrifugal filter device according to the protocol provided by the company. The solution was centrifuged at 16,000×g for 10 min to remove precipitates and then subjected to size exclusion chromatography using a pre-equilibrated Superose 6 Increase 10/300 GL column in sizing buffer (50 mM Tricine pH 8.0, 4 mM SAM) on an ÄKTA fast protein liquid chromatography system (GE Healthcare Life Sciences). The peak fractions were pooled and concentrated with the centrifugal filter device to a concentration of approximately 8 mg/ml (determined by Bradford assay using BSA as a standard). The sample was then added to a final concentration of 2 mM each of the three cofactors (Mal-CoA, NADPH, and SAM) again for cryo-EM specimen preparation.

Detection of DML acid from in vitro reconstitution experiments

Twenty-five micromolar of LovB was incubated with 25 μM WT or mutant of LovC, 25 μM LovG, 2 mM Malonyl-CoA, 2 mM NADPH, and 2 mM SAM in buffer (100 mM NaH₂PO₄, pH 7.4, 10% glycerol, 2 mM DTT, 2 mM EDTA) in a 250 μl solution at 25 °C for 24 h. Reactions were quenched and extracted twice with an equal volume of 99% ethyl acetate (EA)/1% acetic acid (AcOH). The organic phase was evaporated to dryness, and redissolved in 0.05 M NaOH in 15 μl of methanol and analyzed by LC-MS. LC-MS was conducted with an Agilent 1290 Infinity Liquid chromatography and 6545 Quadrupole Time-of-Flight Mass Spectrometer by using negative electrospray ionization and an Agilent 5μ 4.6 × 150 mm C18 reverse-phase column. Samples were separated at room temperature on a linear gradient of 5–95% CH₃CN (v/v) in H₂O supplemented with 0.05% (v/v) formic acid over 30 min, and held at 95% CH₃CN/ 0.05% formic acid for 30 min at a flow rate of 0.4 ml/min.

Fluorometric assay

The fluorometric activity assay was carried out using a BioTek Synergy 2 Multi-mode Microplate Reader with EX set to 340 nm and the EM set to 455 nm to follow the disappearance of EM₄₅₅, as NADPH was oxidized in the presence of substrate over time. Twenty-five micromolar of LovC (WT or Interface mutant or Active site residues mutant) was preincubated with 100 μM NADPH and added to the reaction solution (100 mM KH₂PO₄, pH 7.0, 2% DMSO, 200 μM Crotonoyl-CoA) in a total of 100 μl volumn in a Greiner 96 flat bottom plate. After quickly mixing the solution by pipetting, a total of 10 min scan with 20 s intervals was performed monitoring EM₄₅₅ at 25 °C. The mean value of relative fluorescence units (RFU) difference from 0 (Max) to 10 (Min) minutes was calculated. The relative activity of LovC Interface mutant was determined by the RFU difference value ratio to WT, which serve as positive control, and LovC active site residues mutant as negative control.

Cryo-EM specimen preparation and data acquisition

Immediately prior to specimen preparation, Tween 20 (10%) was added to the freshly purified LovBC complex to a final concentration of 0.1%, which improved the quality of vitreous ice in the specimen. Four microliter aliquots of specimen at ~8 mg/ml were applied to glow-discharged holey carbon grids (Quantifoil Cu, R1.2/1.3, 200 mesh) for 60 s of incubation and then blotted for 2.5 s and plunge-frozen into liquid ethane precooled by liquid nitrogen using a Vitrobot Mark IV (FEI) operated at approximately 100% humidity and 22 °C. Cryo-EM images were collected with a Titan Krios electron microscope (FEI) operated at 300 kV and equipped with a K2 Summit direct electron detector (Gatan). Thirty-eight frames were recorded for each movie stack at a nominal magnification of 22,500-fold in super resolution mode with a pixel size of 1.0 Å in the defocus range of 1.5–2.5 µm. A total of 8136 movie stacks of the LovBC complex were automatically collected using SerialEM²⁰ with an exposure time of 7.6 s (0.2 s per frame) and a total dose of 60.8 e−/Å².

Image processing

All movies of datasets were aligned and dose-weighted using MotionCor2²¹. Contrast transfer function (CTF) and defocus parameters were determined by Gctf²². Micrograph checking, particle autopicking, 2D, 3D classification, autorefinement, postprocessing, and resolution estimation of each density map were performed using RELION 3.0^23,24,25. Approximately 2000 particles of each dataset were manually picked and subjected to reference-free 2D classification. The best representative 2D classes were selected as templates for autopicking.

For the reconstruction of the LovBC complex, datasets were cleaned by removing ice contaminants and junk particles after two rounds of 2D classification, and good classes were kept to generate the 30 Å 3D initial model, which was low-pass filtered to 70 Å as the reference for subsequent 3D classification (Supplementary Fig. 4). One class of 16,444 particles was auto-refined and postprocessed, yielding the reconstruction of a 4.21 Å LovBC complex. The best classes of 205,047 particles were selected with a soft mask on the core (LovB) for focused 3D auto-refinement, postprocessing, CTF refinement and Bayesian polishing, yielding the reconstruction of a 3.09 Å LovB density map. 3D classification with finer, local angular searches was further performed for conformational difference detection. For better integrity of the LovC part of the LovBC complex, a total of 83,573 particles were auto-refined and postprocessed, yielding the reconstruction of a 3.60 Å LovB_C (with one side of LovC only) density map. For better resolution of the lovB, one class of 75,036 particles was imposed in parallel with C1 and C2 symmetry and auto-refined, yielding the density maps of 3.53 Å and 3.27 Å, respectively. No differences between the C1 and C2 maps was detected when inspecting in Coot. Finally, classes were combined and imposed with C2 symmetry. A total of 205,047 particles were further processed by focused 3D auto-refinement using a soft mask around the LovB with signal subtraction, yielding the reconstruction of the 2.91 Å LovB density map (Supplementary Fig. 5). The resolution of all density maps was estimated based on the corrected gold standard Fourier shell correlation (FSC) at the 0.143 criterion.

Model building and refinement

First, the HHpred server was used for protein homology analysis using the HMM–HMM comparison method^31,32. Multiple homologous crystal structures for each domain of LovB (KS, MAT, DH, CMeT, ψKR, ψER, and KR) were rigid-body fitted into the density using UCSF-Chimera³³ for comparative model rebuilding using RosettaCM^34,35,36. The resulting atomic coordinates were further manually adjusted and built using Coot³⁷ and ISOLDE³⁸. Structure refinement was carried out by Phenix in real space with secondary structure and geometry restraints to prevent over-fitting³⁹. Finally, MolProbity⁴⁰ was used for model validation. The statistics are summarized in Supplementary Table 3. All cryo-EM densities and atomic models were visualized, and the figures depicting them were prepared in PyMOL, UCSF-Chimera, and ChimeraX⁴¹.

Substrate tunnel generation and protein docking

Substrate tunnels within each domain of lovB were calculated with the program Hollow⁴² and adjusted in PyMOL (The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC.).

Docking between MAT/LovB (residues 695–757) and LovC was performed using RosettaDock in Rosetta v3.2²⁹ using the online Rosie server^43,44. Nineteen out of the 1000 lowest-energy MAT/LovB models generated by the first round of RosettaCM were selected for docking simulation with LovC (both fitted to the experimental density). The lowest-energy docking run decoy of one best simulation (out of 19) was used as the input for the second round of docking. The ten lowest-energy decoys with rmsd < 2.2 Å indicate docking success.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The 3D cryo-EM density maps in this study have been deposited in the Electron Microscopy Data Bank (https://www.emdataresource.org/) with accession numbers EMD-30434 and EMD-30435 (Supplementary Table 3). The atomic coordinates have been deposited in the Protein Data Bank as 7CPX and 7CPY. The atomic coordinate of LovB–C interface underlying Fig. 4c is provided as a Supplementary Data 1. Other data are available from the corresponding authors upon request. Source data are provided with this paper.

References

Tobert, J. A. Lovastatin and beyond: the history of the HMG-CoA reductase inhibitors. Nat. Rev. Drug Discov. 2, 517–526 (2003).
Article CAS PubMed Google Scholar
Barriuso, J. et al. Double oxidation of the cyclic nonaketide dihydromonacolin L to monacolin J by a single cytochrome P450 monooxygenase, LovA. J. Am. Chem. Soc. 133, 8078–8081 (2011).
Article CAS PubMed Google Scholar
Xu, W. et al. LovG: the thioesterase required for dihydromonacolin L release and lovastatin nonaketide synthase turnover in lovastatin biosynthesis. Angew. Chem. Int. Ed. Engl. 52, 6472–6475 (2013).
Article CAS PubMed Google Scholar
Meehan, M. J. et al. FT-ICR-MS characterization of intermediates in the biosynthesis of the α-methylbutyrate side chain of lovastatin by the 277 kDa polyketide synthase LovF. Biochemistry 50, 287–299 (2011).
Article ADS CAS PubMed Google Scholar
Campbell, C. D. & Vederas, J. C. Biosynthesis of lovastatin and related metabolites formed by fungal iterative PKS enzymes. Biopolymers 93, 755–763 (2010).
Article CAS PubMed Google Scholar
Kennedy, J. et al. Modulation of polyketide synthase activity by accessory proteins during lovastatin biosynthesis. Science 284, 1368–1372 (1999).
Article ADS CAS PubMed Google Scholar
Ma, S. M. et al. Complete reconstitution of a highly reducing iterative polyketide synthase. Science 326, 589–592 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Herbst, D. A., Townsend, C. A. & Maier, T. The architectures of iterative type I PKS and FAS. Nat. Prod. Rep. 35, 1046–1069 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fisch, K. M. et al. Rational domain swaps decipher programming in fungal highly reducing polyketide synthases and resurrect an extinct metabolite. J. Am. Chem. Soc. 133, 16635–16641 (2011).
Article CAS PubMed Google Scholar
Roberts, D. M. et al. Substrate selectivity of an isolated enoyl reductase catalytic domain from an iterative highly reducing fungal polyketide synthase reveals key components of programming. Chem. Sci. 8, 1116–1126 (2017).
Article CAS PubMed Google Scholar
Storm, P. A., Herbst, D. A., Maier, T. & Townsend, C. A. Functional and structural analysis of programmed C-methylation in the biosynthesis of the fungal polyketide citrinin. Cell Chem. Biol. 24, 316–325 (2017).
Article CAS PubMed PubMed Central Google Scholar
Foulke-Abel, J. & Townsend, C. A. Demonstration of starter unit interprotein transfer from a fatty acid synthase to a multidomain, nonreducing polyketide synthase. ChemBioChem 13, 1880–1884 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dutta, S. et al. Structure of a modular polyketide synthase. Nature 510, 512–517 (2015).
Article ADS CAS Google Scholar
Whicher, J. R. et al. Structural rearrangements of a polyketide synthase module during its catalytic cycle. Nature 510, 560–564 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Herbst, D. A., Jakob, R. P., Zähringer, F. & Maier, T. Mycocerosic acid synthase exemplifies the architecture of reducing polyketide synthases. Nature 531, 533–537 (2016).
Article ADS CAS PubMed Google Scholar
Maier, T., Jenni, S. & Ban, N. Architecture of mammalian fatty acid synthase at 4.5 A resolution. Science 311, 1258–1262 (2006).
Article ADS CAS PubMed Google Scholar
Maier, T., Leibundgut, M. & Ban, N. The crystal structure of a mammalian fatty acid synthase. Science 321, 1315–1322 (2008).
Article ADS CAS PubMed Google Scholar
Xu, W., Qiao, K. & Tang, Y. Structural analysis of protein–protein interactions in type I polyketide synthases. Crit. Rev. Biochem. Mol. Biol. 48, 98–122 (2013).
Article CAS PubMed Google Scholar
Ames, B. D. et al. Crystal structure and biochemical studies of the trans-acting polyketide enoyl reductase LovC from lovastatin biosynthesis. Proc. Natl Acad. Sci. USA 109, 11144–11149 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mastronarde, D. N. Automated electron microscope tomography using robust prediction of specimen movements. J. Struct. Biol. 152, 36–51 (2005).
Article PubMed Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Scheres, S. H. W. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife 7, 163 (2018).
Article Google Scholar
Nakane, T., Kimanius, D., Lindahl, E. & Scheres, S. H. Characterisation of molecular motions in cryo-EM single-particle data by multi-body refinement in RELION. Elife 7, 1485 (2018).
Article Google Scholar
Brignole, E. J., Smith, S. & Asturias, F. J. Conformational flexibility of metazoan fatty acid synthase enables catalysis. Nat. Struct. Mol. Biol. 16, 190–197 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kingsley, L. J. & Lill, M. A. Substrate tunnels in enzymes: structure-function relationships and computational methodology. Proteins 83, 599–611 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gora, A., Brezovsky, J. & Damborsky, J. Gates of enzymes. Chem. Rev. 113, 5871–5923 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chaudhury, S. et al. Benchmarking and analysis of protein docking performance in Rosetta v3.2. PLoS ONE 6, e22477–13 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Cooke, T. F. et al. Genetic mapping and biochemical basis of yellow feather pigmentation in budgerigars. Cell 171, 427–432.e21 (2017).
Article CAS PubMed PubMed Central Google Scholar
Soding, J. Protein homology detection by HMM-HMM comparison. Bioinformatics 21, 951–960 (2005).
Article PubMed Google Scholar
Zimmermann, L. et al. A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. J. Mol. Biol. 430, 2237–2243 (2018).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera-a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
DiMaio, F. et al. Atomic-accuracy models from 4.5-Å cryo-electron microscopy data with density-guided iterative local refinement. Nat. Methods 12, 361–365 (2015).
DiMaio, F., Tyka, M. D., Baker, M. L., Chiu, W. & Baker, D. Refinement of protein structures into low-resolution density maps using Rosetta. J. Mol. Biol. 392, 181–190 (2009).
Article CAS PubMed PubMed Central Google Scholar
Song, Y. et al. High-resolution comparative modeling with RosettaCM. Structure 21, 1735–1742 (2013).
Article CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of coot. Acta Crystallogr. Sect. D 66, 486–501 (2010).
Article CAS Google Scholar
Croll, T. I. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr. D Struct. Biol. 74, 519–530 (2018).
Article CAS PubMed PubMed Central Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D 66, 213–221 (2010).
Article CAS Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. Sect. D 66, 1–10 (2009).
CAS Google Scholar
Goddard, T. D. et al. UCSF ChimeraX: meeting modern challenges in visualization and analysis. Protein Sci. 27, 14–25 (2018).
Article CAS PubMed Google Scholar
Ho, B. K. & Gruswitz, F. HOLLOW: generating accurate representations of channel and interior surfaces in molecular structures. BMC Struct. Biol. 8, 49–6 (2008).
Article PubMed PubMed Central Google Scholar
Lyskov, S. et al. Serverification of molecular modeling applications: the Rosetta online server that includes everyone (ROSIE). PLoS ONE 8, e63906 (2013).
Article ADS PubMed PubMed Central Google Scholar
Lyskov, S. & Gray, J. J. The RosettaDock server for local protein-protein docking. Nucleic Acids Res. 36, W233–W238 (2008).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Z. Liu, F. F. Wang, G. Y. Li, L. H. Xin, J. L. Duan, and N. Liu for their help in the sample preparation and data collection. Cryo-EM images were collected at the National Facility for Protein Science in Shanghai (NFPS), Zhangjiang Lab. The computations in this paper were run on the π 2.0 cluster supported by the Center for High Performance Computing at Shanghai Jiao Tong University. This work was financially supported by National Key R&D Program of China (2018YFA0900700, 2019YFA0905400), the Ministry of Science and Technology (2015CB554203), the National Science Foundation of China (91753123, 31470830, 21661140002).

Author information

These authors contributed equally: Jialiang Wang, Jingdan Liang.

Authors and Affiliations

State Key Laboratory of Microbial Metabolism and School of Life Science & Biotechnology, Shanghai Jiao Tong University, Shanghai, China
Jialiang Wang, Jingdan Liang, Lu Chen, Wei Zhang, Zixin Deng & Zhijun Wang
National Facility for Protein Science in Shanghai, Shanghai, China
Liangliang Kong, Chao Peng & Chen Su
Department of Chemical and Biomolecular Engineering and Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA
Yi Tang

Authors

Jialiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jingdan Liang
View author publications
You can also search for this author in PubMed Google Scholar
Lu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Liangliang Kong
View author publications
You can also search for this author in PubMed Google Scholar
Chao Peng
View author publications
You can also search for this author in PubMed Google Scholar
Chen Su
View author publications
You can also search for this author in PubMed Google Scholar
Yi Tang
View author publications
You can also search for this author in PubMed Google Scholar
Zixin Deng
View author publications
You can also search for this author in PubMed Google Scholar
Zhijun Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.W., Y.T., and J.W. conceived the study. J.W., J.L., C.L. L.K., C.P., and C.S. performed the experiments. Z.W., J.W., Y.T. J.L. W.Z., and L.K. analyzed the data. All authors wrote the paper. Z.W., Z.D., and J.L. supervised this project.

Corresponding authors

Correspondence to Zixin Deng or Zhijun Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, J., Liang, J., Chen, L. et al. Structural basis for the biosynthesis of lovastatin. Nat Commun 12, 867 (2021). https://doi.org/10.1038/s41467-021-21174-8

Download citation

Received: 21 September 2020
Accepted: 13 January 2021
Published: 08 February 2021
DOI: https://doi.org/10.1038/s41467-021-21174-8

This article is cited by

Engineered and total biosynthesis of fungal specialized metabolites
- Russell J. Cox
Nature Reviews Chemistry (2024)
Insights into azalomycin F assembly-line contribute to evolution-guided polyketide synthase engineering and identification of intermodular recognition
- Guifa Zhai
- Yan Zhu
- Yuhui Sun
Nature Communications (2023)
Enzymology of assembly line synthesis by modular polyketide synthases
- Martin Grininger
Nature Chemical Biology (2023)
C–N bond formation by a polyketide synthase
- Jialiang Wang
- Xiaojie Wang
- Jingdan Liang
Nature Communications (2023)
Catalytic trajectory of a dimeric nonribosomal peptide synthetase subunit with an inserted epimerase domain
- Jialiang Wang
- Dandan Li
- Zhijun Wang
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.