Structural aspects of non-ribosomal peptide biosynthesis

https://doi.org/10.1016/j.sbi.2004.10.005Get rights and content

Small peptides have powerful biological activities ranging from antibiotic to immune suppression. These peptides are synthesized by non-ribosomal peptide synthetases (NRPS). Structural understanding of NRPS took a huge leap forward in 2002; this information has led to several detailed biochemical studies and further structural studies. NRPS are complex molecular machines composed of multiple modules and each module contains several autonomously folded catalytic domains. Structural studies have largely focused on individual domains, isolated from the context of the multienzyme. Biochemical studies have looked at individual domains, isolated whole modules and intact NRPS, and the combined data begin to allow us to visualize the process of peptide assembly by NRPS.

Introduction

Amino acids are structurally diverse and reactive molecules. The proteinogenic α-amino acids are limited to only 22 different structures. To increase the diversity and utility of these molecules, Nature has polymerised them via a peptide bond. The number of possible molecules for a peptide of length n amino acids is 22n. Large polypeptides are made using the ribosome and the post-translation machinery that are familiar to anyone with a rudimentary knowledge of biology. The vast majority of peptide bond formation is catalysed by ribosomes and, until recently, the catalysis of peptide bond formation by non-ribosomal peptide synthetases (NRPS) has been largely overlooked. A list of molecules made by NRPS is beyond the scope of this article and the reader is referred to a recent review for a comprehensive account of these molecules [1]. However, it is instructive to highlight some of the best-known examples to illustrate the importance of NRPS systems. The last-resort antibiotic vancomycin and its analogues (Figure 1) have exquisitely complex structures made by NRPS and associated enzymes [2]. Indeed, almost all peptide-based antibiotics are made by NRPS. Chelation of iron by bacteria is vital for their survival and is often a virulence determinant in pathogens. NRPS synthesize macrocycles such as enterobactin (Figure 1), which have an extraordinarily high iron affinity [3]. Cyclosporin (Figure 1), an immune suppressor that made organ transplants possible, and the potent anti-tumour compound bleomycin (Figure 1) are both made by NRPS 4., 5..

A cursory examination of the structures in Figure 1 highlights the contrast between familiar ribosomal peptides and non-ribosomal peptides. The molecules made by NRPS are often cyclic, have a high density of non-proteinogenic amino acids and often contain amino acids connected by bonds other than peptide or disulfide bonds. Given the complexity of the molecules, one might assume that each was made by a specific pathway and that very little in the way of general rules could be established. NRPS are now known to be very large proteins and, despite the obvious complexity of the products, consist of a series of repeating enzymes fused together. Such fusion of repeating enzymes in a single polypeptide closely parallels the protein machines responsible for polyketide biosynthesis [6]. For NRPS, one amino acid building block is normally incorporated into the peptide product by one module, thus products with ten amino acids would be expected to be constructed by an NRPS with ten modules stitched together. This is often termed the colinearity rule. Each module is normally specific to a particular amino acid substrate. It is important to note that there are several exceptions to the colinearity rule, particularly for NRPS that assemble siderophores 3., 7., 8.. The normally colinear sequence of modules in both NRPS and the peptide product offers the potential for great versatility in the bioengineering of these systems.

This review discusses the structures and mechanisms of action of the isolated components of the NRPS. We highlight new biochemical data that improve our understanding of these molecular machines.

Section snippets

Domains of non-ribosomal peptide synthetases

A minimal chain elongation module contains three core domains: the condensation (C) domain, the adenylation (A) domain and the peptidyl carrier (PCP) domain (also known as the thiolation [T] domain) [9]. The A and PCP domains contain the machinery necessary to sequester an activated, covalently linked amino acid ready for peptide bond formation. Amino (and sometimes other carboxylic) acids are first activated by adenylation (by the A domain). This process consumes ATP and generates a highly

Protein engineering prospects

A domains are usually highly selective and tolerate very little structural variation from their natural substrate (which is not necessarily a proteinogenic amino acid). For this reason, the A domain has been the subject of intense study to attempt to understand the structural basis of its selectivity 9., 11., 12., 13., 14.. The C domain has two substrate-binding sites, one for the peptide attached to the PCP domain of the preceding module and one for the amino acid attached to the PCP domain of

Adenylation domain

The A domain belongs to the adenylate-forming enzyme superfamily. There are several solved structures of members of this superfamily, including the A domain responsible for phenylalanine activation (PheA) from the NRPS responsible for gramicidin S biosynthesis [12]. More recently, a comprehensive study of DhbE (Figure 3) has been reported [22]. DhbE is a stand-alone adenylating enzyme that activates 2,3-dihydroxybenzoate (DHB) by adenylation of the carboxyl group and then catalyses the

Condensation, cyclisation and epimerisation

To date, the only atomic model of a C domain derives from the structure of VibH (Figure 4) [26••]. VibH is an isolated enzyme that catalyses peptide bond formation between one of the primary amino groups of norspermidine (NS) and a PCP-bound DHB thioester during the biosynthesis of the peptide siderophore vibriobactin. The PCP domain is part of the didomain protein VibB [23]. The degree of sequence conservation between VibH and conventional C domains argues that all C domains will have

Thioesterase

There are two types of TE associated with NRPS. Type I TEs are usually found as the last domain in NRPS multienzymes and catalyse cleavage of the assembled peptide product from the PCP domain in the last module. The structure of the TE domain from the NRPS responsible for surfactin A biosynthesis has been determined by two groups 19.••, 21.• (Figure 5). Unsurprisingly, the fold of the TE domain belongs to the α,β-hydrolase family, with the conventional catalytic triad comprising serine,

Future prospects

In the past three years, structural work has begun to map out NRPS. An atomic model of each of the component core modules now exists. In particular, our understanding of A domains is now detailed enough for theoretical predictions to be made about their substrate specificities. Similar understanding is still somewhat lacking for the related C, Cy and E domains. Although a model exists for their fold, there are no co-complexes and thus no detailed information regarding substrate recognition.

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

JHN is a Biotechnology and Biological Sciences Research Council (BBSRC) career development fellow. Both laboratories are supported under the BBSRC SPORT initiative. JHN acknowledges Wellcome Trust support and GLC is supported by the BBSRC.

References (39)

  • M.T. Mazur et al.

    Site-specific observation of acyl intermediate processing in thiotemplate biosynthesis by Fourier transform mass spectrometry: the polyketide module of yersiniabactin synthetase

    Biochemistry

    (2003)
  • A.M. Gehring et al.

    Reconstitution and characterization of the Escherichia coli enterobactin synthetase from EntB, EntE, and EntF

    Biochemistry

    (1998)
  • G. Weber et al.

    The peptide synthetase catalyzing cyclosporine production in Tolypocladium niveum is encoded by a giant 45.8-kilobase open reading frame

    Curr Genet

    (1994)
  • S.W. Yu et al.

    Analysis of the exochelin locus in Mycobacterium smegmatis: biosynthesis genes have homology with genes of the peptide synthetase family

    J Bacteriol

    (1998)
  • S. Lautru et al.

    Substrate recognition by nonribosomal peptide synthetase multi-enzymes

    Microbiology

    (2004)
  • N.J. Hillson et al.

    Dimeric structure of the six-domain VibF subunit of vibriobactin synthetase: mutant domain activity regain and ultracentrifugation studies

    Biochemistry

    (2003)
  • G.L. Challis et al.

    Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains

    Chem Biol

    (2000)
  • T. Stachelhaus et al.

    The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases

    Chem Biol

    (1999)
  • S.L. Clugston et al.

    Chirality of peptide bond-forming condensation domains in nonribosomal peptide synthetases: the C-5 domain of tyrocidine synthetase is a C-D(L) catalyst

    Biochemistry

    (2003)
  • Cited by (0)

    View full text