Pyrrolysyl-tRNA Synthetase , an Aminoacyl-tRNA Synthetase for Genetic Code Expansion

Genetic code expansion (GCE) has become a central topic of synthetic biology. GCE relies on engineered aminoacyl-tRNA synthetases (aaRSs) and a cognate tRNA species to allow codon reassignment by co-translational insertion of non-canonical amino acids (ncAAs) into proteins. Introduction of such amino acids increases the chemical diversity of recombinant proteins endowing them with novel properties. Such proteins serve in sophisticated biochemical and biophysical studies both in vitro and in vivo, they may become unique biomaterials or therapeutic agents, and they afford metabolic dependence of genetically modified organisms for biocontainment purposes. In the Methanosarcinaceae the incorporation of the 22nd genetically encoded amino acid, pyrrolysine (Pyl), is facilitated by pyrrolysyl-tRNA synthetase (PylRS) and the cognate UAG-recognizing tRNAPyl. This unique aaRStRNA pair functions as an orthogonal translation system (OTS) in most model organisms. The facile directed evolution of the large PylRS active site to accommodate many ncAAs, and the enzyme’s anticodon-blind specific recognition of the cognate tRNAPyl make this system highly amenable for GCE purposes. The remarkable polyspecificity of PylRS has been exploited to incorporate >100 different ncAAs into proteins. Here we review the Pyl-OT system and selected GCE applications to examine the properties of an effective OTS.


INTRODUCTION
EW macromolecular functions often come from the expansion of chemical diversity.In the field of synthetic biology, the increase in chemical complexity originates from a functional demand.Using the genetic code expansion approach, non-canonical amino acids can be incorporated into proteins to study various biological phenomena and allow creation of new biomaterials and entire organisms. [1]on-canonical amino acids include all amino acids that arose from genetic code expansion either naturally (such as selenocysteine or pyrrolysine) [2] or were added to the genetic code via genetic code expansion (GCE) experiments.

Translation With Standard and Non-canonical Amino Acids
Twenty natural amino acids ubiquitously serve as building blocks for protein synthesis.Each amino acid is first covalently attached to its cognate tRNA by a specific aaRS.An elongation factor then binds aminoacyl-tRNA and delivers it to the ribosome, where the amino acid becomes incorporated into a nascent polypeptide chain in response to an assigned codon (Figure 1).Of the 64 triplet codons, 61 sense codons are assigned to a specific standard AA, while 3 stop codons signal for polypeptide chain termination mediated by release factors.
A genomic code expansion approach to introduce ncAAs in vivo involves site-specific co-translational incorporation through stop codon suppression (SCS, Figure 1).The targeted protein is expressed from an mRNA template containing an internal stop codon reassigned for the ncAA.A ncAA specific aaRS aminoacylates its cognate tRNA devoted to decoding a stop codon (e.g.tRNA Pyl CUA for recognizing the UAG codon).Such a suppressor tRNA aminoacylated with the ncAA competes with release factors for the binding to the stop codon at the ribosome.If the suppressor tRNA is able to outcompete the release factors, the ncAA is then incorporated in response to the internal stop codon.

N
Suppressor tRNA and aaRS dedicated to ncAA incorporation should not cross-react with host aaRSs, tRNAs or activate canonical amino acids.This lack of cross-reactivity defines the ncAA-specific aaRStRNA as orthogonal (o-pair or orthogonal translation system, OTS).

The Best Orthogonal Translation Systems
The success of ncAA incorporation strongly depends on the efficiency of the orthogonal aaRStRNA pair.The efficiency of each o-pair is determined by its inherent or engineered orthogonality, the ability of the aaRS to recognize the cognate suppressor tRNA, and the malleability of the tRNA synthetase active site.In addition to the orthogonal aaRStRNA pair, factors such as intracellular availability of the ncAA, the sequence context of the targeted codon in the RNA and any effect the incorporated ncAA may have on the recombinant protein's folding also influence the ncAA incorporation efficiency.
1a] The latter is also amenable for use in Saccharomyces cerevisiae, while the E. coli Tyr-OTS provides a better choice than M. jannaschii Tyr-OTS [3] in yeast due to the lack of orthogonality of the archaeal o-pair in S. cerevisiae.Other notable examples for genetic code expansion experiments in E. coli include S. cerevisiae AspRStRNA Asp , GlnRStRNA Gln , TyrRStRNA Tyr and PheRStRNA Phe o-pairs.Archaeal O-phosphoseryl-tRNA synthetase (SepRS), a rare synthetase devoted to attaching phosphoserine (Sep) onto tRNA Cys as a precursor in Cys-tRNA Cys synthesis has also been exploited for GCE purposes. [4]The small number of OTSs developed that are not based on the Tyr-and Pyl-OTS and the even lower number of ncAAs that these OTSs can incorporate reflects the rigidity of aaRSs with respect to recognizing cognate AAs.1a] The PylRStRNA P y l Sy stem

Pyrrolysyl-tRNA Synthetase
PylRS is a homodimeric enzyme that catalyzes the formation of PyltRNA Pyl CUA.Formation of Pyl-tRNA Pyl follows the two step mechanism common to all aaRSs: Pyl is first activated with ATP to form a pyrrolysyl-adenylate (Pyl-AMP) and then transferred to the 3′-hydroxyl group of the tRNA Pyl terminal adenosine. [5]ach monomer of PylRS can be roughly divided into its N-terminal and C-terminal portions, which are encoded by a single gene in the Methanosarcinaceae species or by 2 separate genes in Desulfitobacterium hafniense (pylSn and pylSc, respectively). [6]Biochemical analysis of the N-terminal D. hafniense PylRS fragment demonstrate its role in tRNA binding; [6] this is further supported by the lack of in vivo enzyme activity after truncation of the N-terminal PylRS domain. [7]A structural understanding of the role of the N-terminal domain of PylRS is lacking, as it does not have sequence homology with any known protein domain and its low solubility impedes production for crystallization purposes.
While the structure of the N-terminal domain is still unknown, the PylRS C-terminal domain structures have been determined for the archaeal [8] and bacterial [9] enzymes. [10]All structures confirm that this soluble domain forms the catalytic core of the enzyme.The catalytic core is composed of an antiparallel β-sheet surrounded by α-helices, which is characteristic of class II aaRSs. [11]Classspecific motifs are typically located with sequence motif 1 building part of the dimer interface and motifs 2 and 3 recognizing the nucleotide placed at the active site (Figure 2, Figure 3, A).
The structure of the bacterial PylRStRNA Pyl complex shows tRNA binding stoichiometry of two tRNAs per enzyme dimer [9b] (Figure 2).Each tRNA binds along the surface of one of the monomers.Cognate tRNA recognition is ensured through interactions with specific nucleotides (i.e.identity elements) and core binding surface of the enzyme recognizing the characteristic tRNA tertiary structure.The top of the acceptor stem (base pair G1:C72 and R73, Figure 2) is involved in the critical portion of PylRStRNA Pyl interface.A concave structure on the enzyme surface sterically matches the acceptor helix and directs the tRNA Pyl 3′-end into the catalytic site.9b,12] The idiosyncratic core of tRNA Pyl is recognized dominantly by the elements of one subunit and fewer contacts are made with the other monomer.PylRS specific tRNAbinding domain 1 makes the majority of the interactions with the tertiary core of tRNA Pyl .The PylRStRNA Pyl structure confirms earlier biochemical data that although bases at position 33 and 37 surrounding the anticodon constitute identity elements, [13] PylRS does not use the tRNA anticodon as identity element per se.

Properties and Engineering of tRNA Pyl
9b] It has been proposed that in vivo M. barkeri tRNA Pyl is less modified than other cellular tRNAs and harbors only 2 modified nucleotides: a 4-thiouridine and a 1-methyl-pseudouridine. [14] Structure of D. hafniense tRNA Pyl shows that deletion of invariant uridine 8 (U8) abolishes the formation of a highly conserved tertiary base pair (U8:A14) and leads to unusual pairing of G14 with T-loop residue C59.Furthermore, U8 deletion allows the base on position 9 to flip away from the tRNA body.9b]

Pyrrolysine
In nature, Pyl-OTS and Pyl are confined to a small number of organisms including methanogenic archaea and some bacterial taxa. [15]In these organisms, three consecutive enzymes (PylB, PylC and PylD) form a Pyl biosynthesis cluster responsible for the transformation of two Lys molecules into Pyl. [16]In the first step, PylB, converts Lys to (3R)-methyl-D-ornithine (3MO), which is then ligated to the N εamino group of a second Lys molecule by PylC, giving rise to L-lysine-N ε -(3R)-methyl-D-ornithine. Pyl is subsequently attached to tRNA Pyl by PylRS.In Methanosarcinaceae, tRNA Pyl CUA is a natural suppressor tRNA that decodes a UAG stop codon present as an internal stop codon in set of methylamine methyltransferases genes [18] and a tRNA His guanylyltransferase. [19] The incorporation of Pyl in the methyltransferase is critical for enzyme catalysis and enables growth on methylamine as a sole energy source. [20]In contrast, in Methanosarcina acetivorans tRNA His guanylyltransferase the Pyl residue is not essential and can be exchanged with Trp without a loss of enzymatic activity. [19]lRS Active Site Architecture The PylRS active site contains a large amino acid binding pocket where Pyl is mainly recognized through (nonspecific) hydrophobic interactions. [8,21]Secondary carbonyl group of Pyl forms hydrogen bonds with Asn-346.A comparison of PylRS:Pyl and various PylRS:ncAA complexes [8,22] reveals that variations in the ncAA side chain are highly tolerated, particularly when they are located in the part of the active site responsible for pyrroline recognition. [23]This property of PylRS contrasts the general features of aaRSs, which are selective against non-cognate substrates.Both structural and in vivo studies demonstrate that PylRS can use substrates that possess a carbamate, carbonyl or amide moiety, which can be positioned to interact with Asn-346 or Cys-348.
The structure of M. mazei PylRS [8,21] shows that Arg-330 forms a hydrogen bond with the primary carbonyl of Pyl and this interaction is seen in complexes with both Pyl and Pyl analogs.Most interestingly, the primary aminogroup is not directly recognized but is bound by a water molecule (Figure 3, A).This water forms hydrogen bonds with amide oxygen of Asp-346 and two backbone amide nitrogen atoms of Leu-301 and Ala-302.22a,24] In concordance with this flexible mode of α-amino group recognition come reports showing acylation of tRNA Pyl with α-hydroxy acid, D-amino acid and a N α -methylamino acid. [25]Furthermore, it has been reported that several α-hydroxy acids can be incorporated into proteins by the PylRStRNA Pyl o-pair. [26]lthough the importance of Asn-346 and other Pyl binding residues has been demonstrated through mutational analysis, [22a] there is no coherent kinetic data that might disclose the effect of these residues on PylRS aminoacylation catalysis (Table 1).Steady state parameters for Pyl activation have been measured mostly in the absence of tRNA Pyl , using a ATP-[ 32 P]PPi exchange assay (Table 1).Conversely, enzyme-kinetic analyses of PylRS were hampered by the lack of synthetically available Pyl, necessitating the use of Pyl analog like N ε -((cyclopentyloxy)carbonyl)-L-lysine more frequently than Pyl.
Steady state kinetic parameters with Pyl for full length M. mazei PylRS and M. barkeri PylRS show that these aaRSs bind the amino acid with a Km of ∼50 μmoldm -3 and The wild-type enzyme is shown in magenta and the triple Y306G/Y384F/I405R mutant in cyan.Pyl-AMP and norbornene-AMP (nonhydrolyzable analogs) are given as stick representation in yellow and in green, respectively, with transparent surfaces indicating the substrate-binding pocket.In the triple mutant the Y306G mutation at the base of the AA binding pocket enlarges the binding site thus allowing placement of the norbornene head group (this space is normally occupied by Tyr-306 side chain, see wildtype structure).In the wild-type enzyme Asn-346 forms a hydrogen bond with the carbonyl-amide linkage of Pyl side chain.In contrast, the carbamate linkage of the norbornene does not form a bond with Asn-346 but with Cys-348.kcat of 0.1-0.3s −1 in ATP-[ 32 P]PPi exchange assay.In the presence of tRNA Pyl (aminoacylation assay) the affinity for the cognate amino acid is improved (20 μmol dm -3 ), yet the kcat remains very low (0.008-0.03 s −1 ).This kcat value is almost 3 orders of magnitude lower than many canonical aaRSs.22b]

Amino Acid Substrate Range
Although low turnover numbers make PylRS a fairly poor catalyst in vitro, to date it has been adapted for incorporation of more than 100 ncAA substrates (full list in). [27]Regardless of the low turnover, in vivo suppression efficiency can amount to almost 80 % with engineered enzymes making this enzyme a powerful tool for GCE experiments. [28]As PylRS can tolerate replacement of the pyrroline ring with a similar substituent, more than 20 lysine derivatives were reported to be suitable substrates for PylRS without active site engineering. [23,27]In addition to lysine derivatives, Pyl-OTS can be used to incorporate Phe derivatives [27] although a number of Phe derivatives have already been successfully incorporated using M. jannaschii and E. coli TyrRS.
The lack of specificity towards the functional group at the N ε -atom has allowed for incorporation of Lys analogs bearing reactive chemical groups that can undergo a selective reaction with a desired label.Because the label does not cross-react with cellular (macro)molecules the reaction can occur within a living cell.The earliest reports to introduce a chemical tag via Pyl-OTS included wild-type PylRS.Alkyne (1) and azide (2) derivatives (Figure 4) for Cu(I)-catalyzed alkyne-azide triazole-forming reactions (CuAAC, a type of reaction commonly referred to as "click chemistry") were introduced into myoglobin as proof-of-principle. [29]or the purposes of monitoring conformational changes of a protein PylRS was used to install a reactive dye for the distance-dependent Förster resonance energy transfer (FRET).While the first member of the FRET pair was generated by alkylation of a Cys residue, alkyne containing Pyl analogs (3 and 4, Figure 4) were introduced into calmodulin via SCS strategy and subsequently labelled with azidocoumarin through a CuAAC reaction. [30]o circumvent inherent cytotoxicity of Cu(I) activated cyclooctyne derivatives can also be introduced via Pyl-OTS.Such compounds can react with azide containing labels without additional catalysts.M. mazei PylRS double mutant (Y306A/Y384F) was used to introduce transcyclooctene (5) [31] and cyclooctyne (6 and 7) derivatives into target proteins in E. coli and mammalian cells. [32]Norbornene-bearing ncAA ( 8) is convenient as it can undergo two different types of conjugation: it can be reacted with azides (Cu(I) catalysis) or with tetrazines (inverse-electrondemand Diels-Alder reaction).Human carbonic anhydrase II was produced containing norbornene derivatized amino acid by using the triple M. mazei PylRS mutant (Y306G/Y384F/I405R).Subsequently, norbornene bearing sites were exploited for polyethylene glycol (PEG) attachment. [33]When comparing active site architectures (Figure 3B) it is seen that the mutation of Tyr-306 is critical to enlarge the hydrophobic binding pocket and to accommodate bulky head group of norbornene-containing compound.
In addition to chemical groups that can render a protein target susceptible for selective chemistry, many photoreactive groups act as substrates for both wild-type PylRS and PylRS variants.Photo-reactive ncAAs are attractive; they ensure 'protected' co-translational incorporation into target proteins, and subsequent facile deprotection will generate the desired 'natural' posttranslational protein modifications.An important modified AA, N ε -methyl-L-lysine, cannot be selectively incorporated via Pyl-OTS as it lacks the side chain carbonyl-oxygen that is recognized by the enzyme.However, protected versions of N ε -methyl-Llysine contain carbamate and can serve as substrates for PylRS.Upon incorporation they can be deprotected to leave N ε -methyl-L-lysine as the incorporated residue (Figure 5).
In contrast to N ε -methyl-L-lysine, another important lysine posttranslational modification, N ε -acetyl-L-lysine (AcK), has been successfully incorporated by an evolved PylRS, as the carbonyl oxygen is present in this compound and evolved PylRS does not lose its selectivity.Pyl-OTS mediated incorporation of AcK has been demonstrated in E. coli and mammalian cells.An evolved M. barkeri PylRStRNA pair was used to specifically incorporate AcK in myoglobin [34] and histones. [35]22b,36] Interestingly, there are several variants of both M. mazei and M. barkeri PylRS that are specific for AcK.It has been proposed that the equivalent efficiency of different PylRS variants originates from the nature of the substrate head group (Table 1).Since the acetyl group of AcK is much smaller than pyrroline ring of Pyl, mutations in the Pyl binding site need to occupy the vacant space to compensate for the loss of the bulky pyrroline ring. [34]at is/determines Orthogonality?Feasibility of site-specific incorporation of ncAA depends on the mutual interaction of host translational machinery with the orthogonal aaRStRNA pair.Ideally, the o-tRNA would be efficiently recognized by the host elongation factor and ribosome (Figure 1), but not by the host aaRSs.Conversely, o-aaRS should not recognize the host tRNAs or charge standard AAs.The conserved tertiary features of tRNAs and the relatively low complexity of nucleotide determinants often result in engineered tRNAs that interact with the host translation machinery.Attempts to improve orthogonality via tRNA engineering often becomes a complex challenge as adding or removing nucleotide determinants can introduce a determinant for another host aaRS.
37a] PylRS and E. coli TrpRS share a number of common identity determinants, which affect the crossreactivity of tRNA Pyl UCA.In E. coli, the tRNA anticodon positions C34 and A36, as well as G73 act as a minor determinant for TrpRS, all of which are present in tRNA Pyl UCA.37a] While natural amber suppressor tRNA Pyl CUA is orthogonal in E. coli, in S. cerevisiae the M. barkeri tRNA Pyl CUA was shown to be the substrate for alanyl-tRNA synthetase (AlaRS) [38] due to the presence of a positive identity determinant, G3:U70.Mutation of this base pair to A3:U70 changed the sequence of the M. barkeri tRNA Pyl to that of M. mazei tRNA Pyl and orthogonality was obtained. [38]n contrast to the known interaction of tRNA Pyl with the host aaRSs, PylRS shows little to no cross-reactivity with the host tRNAs, regardless of the host organism.However, a significant challenge for genetic code expansion with PylRS is in maintaining orthogonality toward the cellular pool of standard AAs.For instance, only 2 active site residues need to be replaced in PylRS to introduce specificity for L-phenylalanine. [39]These active site residues (Asn-346 or Cys-348) are frequently targets of randomization in directed evolution experiments as they bind the carbamate of Pyl or the ncAA substrate. [39]

Examples of Applications
In mammals and other eukaryotes protein-protein interactions and posttranslational modifications play crucial roles in regulating the function of proteins.Transient proteinprotein interactions can be explored in vivo by using photoactive ncAA crosslinkers.1a,40] Pyl-OTS incorporated (3-(3-methyl-3H-diazirin-3-yl)-propamino-carbonyl-N ε -L-lysine (DiZPK, 9) has the advantage over pBpa because of its flexibility and length and has been incorporated via Pyl-OTS into proteins in bacteria, yeast and mammalian cells. [41]ovel ways of studying posttranslational modifications in mammalian cell lines have been explored by introducing photo-caged AAs into proteins.Incorporation of these AA derivatives with the Pyl-OTS has allowed for a number of investigations that aimed to distinguish how AA modifications influence the behaviour of proteins in vivo.Wild-type PylRS utilizes some of the Boc-protected lysine derivatives without the need for engineering of the enzyme active site.For example, the ncAAs N ε -(tert-butoxycarbonyl)-L-lysine (10) and N ε -(tert-butoxycarbonyl)-N ε -methyl-L-lysine (11) have been incorporated with further deprotection resulting in proteins containing N ε -methylated Lys-residues. [29,42]To specifically charge the photocaged lysine derivative (12) a variant of the M. barkeri PylRS has been selected for through directed evolution.This PylRStRNA Pyl CUA pair was used to introduce a critical lysine residue into a kinase MEK1, in a photo-caged state.As the kinase is a part of the signalling pathway, a cellular network could be activated by light irradiation and subsequent release of the caging group. [43]An evolved version of M. barkeri was used to introduce a photo-caged version of Tyr into a STAT1 protein (a part of the interferon mediated signal transduction network) thus enabling studies of Tyr phosphorylation in vivo. [44]sing the SCS strategy for the incorporation of lysine derivatives into histone proteins has allowed for novel approaches to gain insight into the role of histone posttranslational modifications.Using the Pyl-OTS, ncAAs 10 and 11 were introduced into histone H3 and employed to generate N ε , N ε -dimethyl-L-lysine containing H3 proteins.After ncAA insertion and protein isolation, the recombinant histone was subjected to a number of chemical modification steps that enabled methylation on the previously protected site.This approach was undertaken to circumvent the inability of wild-type PylRS to activate N ε , N ε -dimethyl-L-lysine and the lack of specificity of evolved variants of this enzyme. [42]cordingly, a monomethylated lysine residue (N ε -methyl-L-lysine) was introduced in the photocaged state ( 13) into myoglobin by a heterologous Pyl o-pair (M.barkeri PylRS and M. mazei tRNA) and the system was shown to be functional in both E. coli and mammalian cells. [45]That and a similar photocaged version of monomethylated lysine (14)  were synthesized and introduced into green fluorescence protein (GFP) and Z domain protein.Here, the decaging process included UV photolysis (13) or hydrogenation (14). [46]A similar approach was utilized on a separate occasion, in this case by using the M. mazei enzyme, where a Pyl-OTS was evolved to facilitate N ε -((allyloxy)carbonyl)-N ε -methyl-L-lysine (15) incorporation in bacteria; this AA can be converted to N ε -methyl-L-lysine with a ruthenium catalyst. [46]n addition to introducing posttranslational modifications by decaging translated ncAAs, very large moieties can be incorporated by exploiting native chemical ligation strategy.The strategy involves a thiol group in the first protein that can attack a C-terminal thioester of a second protein.As 1,2-aminothiols can react with thioesters to form an amide bond, ncAA δ-thiol-L-lysine ( 16) was introduced by Pyl-OTS for protein ubiquitination. [47]The protein containing 16 was then reacted with an ubiquitin thioester and, after desulfurization of the complex, ubiquitin conjugates were isolated.
The challenge of incorporating ncAAs in multicellular organisms was first achieved by using Pyl-OTS in C. elegans. [48]tRNA Pyl was used to decode amber stop codon and introduce N ε -(tert-butoxycarbonyl)-L-lysine (10) and N ε -((prop-2-yn-1-yloxy)carbonyl)-L-lysine (1).Since ncAA containing protein was produced from stop codon containing transcripts, nonsense mediated decay pathway that targets transcripts with internal termination codons for destruction had to be circumvented.This was achieved by using a smg2 knockout strain, where product of smg2 gene is a component of the NMD pathway. [49]n D. melanogaster, genetic code expansion was also achieved using Pyl-OTS.Site-specifically labeled proteins were produced containing N ε -((bicyclo[2.2.1]hept-5-en-2ylmethoxy)carbonyl)-L-lysine (8) reacted with a tetrazine probe. [50]Moreover, same ncAA was stochastically incorporated into proteins in specific tissues and at specific times.This was achieved by crossing flies that expressed Pyl-OTS and a gene of interest with flies expressing Gal4 in specific tis sues and at specific times.Transcriptional activator Gal4 is expressed using a tissue-specific promoter and, when present, activates the expression of transgenes under GAL4 upstream activating sequences (Gal4 UAS).Both pylS and the gene of interest with an internal stop codon were placed under Gal4 UAS and in this manner site-specific incorporation of ncAA was enabled only when Gal4 was present, i.e. in a specific tissue and at specific time. [50]hile the incorporation of ncAAs into proteins with the Pyl-OTS has been extensively reported in eukaryotes, one notable exception has been yeast, where there have been only two reports of Pyl-OTS use in S. cerevisiae for proof-of-principle and optimization purposes.To date Pyl-OTS has not been used in any yeast species other than S. cerevisiae.The first reported use of the Pyl-OTS in yeast utilized the M. mazei PylRS and M. mazei tRNA Pyl pair. [51]The authors used a M. mazei PylRS variant that had been selected for in E. coli to utilize N ε -(tert-butyloxycarbonyl)-L-lysine (10).3a] A weak phenotype dependent on the presence of the ncAA was observed, demonstrating both the orthogonality and function of the M. mazei PylRStRNA Pyl pair in S. cerevisiae.Although a weak phenotype was observed, a reporter protein was not isolated and the ncAA incorporation was not confirmed through MS, making it difficult to assess the efficiency of incorporation.
Further work to characterize and optimize the Pyl-OTS in S. cerevisiae successfully utilized the M. barkeri PylRSM.mazei tRNA Pyl pair and derivatives for the incorporation of several lysine analogs. [38]Initially, the authors used the M. barkeri PylRSM.barkeri tRNA Pyl pair, which had successfully been used in other organisms; [40] however, due to a lack of orthogonality the M. barkeri tRNA Pyl was mutated to that of M. mazei tRNA Pyl .Expression of the tRNA Pyl was a challenge and the use of external RNA polymerase III promoters that have been effective in heterologous tRNA expression in yeast did not produce sufficient amounts of functional tRNA Pyl for effective suppression.This was overcome through engineering of the yeast dicistronic tRNA operon encoding tRNA Arg and tRNA Asp , where the two mature tRNAs are produced from a dimeric precursor RNA. [52]In this instance the tRNA Asp was replaced by the gene for tRNA Pyl and both tRNA genes transcribed from the internal promoter of the tRNA Arg gene.Using the M. mazei tRNA Pyl with variants of M. barkeri PylRS resulted Gal4 read-through based phenotypes dependent on the required ncAA.While incorporation was confirmed for 3 of the 5 analogs utilized, the yields of reporter protein obtained (30-100 µg/L) were nearly two orders of magnitude below what can be obtained with other ncAAs and orthogonal pairs in yeast. [53]hile only two studies have been reported, it is clear that both expression and orthogonality tRNA Pyl is different in yeast when compared to other eukaryotic cells.The distinct challenges of using this OTS in yeast in conjunction with the known low solubility of PylRS in other organisms and the low activity of PylRS and variants toward ncAAs combined likely account for the low incorporation efficiency observed with this orthogonal pair in yeast.

New Routes to Pyl
A complete understanding of kinetic features of the Pyl-OTS is still lacking due to the difficult nature of Pyl chemical synthesis.The first reported chemical synthesis of Pyl involved coupling of (4R,5R)-4-methylpyrroline-5-carboxylic acid to lysine. [54]During synthesis, the sensitive carboxypyrroline ring is exposed to strong acidic conditions, which may be responsible for pyrroline ring degradation and low Pyl yields.To avoid this, an alternative synthetic route was developed where Pyl was synthesized with the generation of the pyrroline ring occurring in the penultimate step of the synthesis which resulted in a 2 times higher yield of Pyl. [55]n appealing alternative to chemical synthesis is the transplantation of the Pyl biosynthetic cluster to E. coli for Pyl producing purposes.Because Pyl is synthesized from Lys, no additional amino acid is needed to be supplemented in the culture media.This strategy was used to produce Pyl containing proteins in E. coli, as Pyl-OTS can to be co-transformed in the same strain. [56]Interestingly, supplementing culture medium with D-ornithine results with the production of pyrroline-carboxy-lysine (Pcl, 17) instead of Pyl.Furthermore, different 3′-substituted D-ornithine derivatives (3MO analogs) can be added to create Pyl analogs with the help of incomplete Pyl biosynthetic cluster. [57]Only genes for PylC and PylD are needed to produce Pyl, if 3MO or 3MO analog is added.For this reason, adding D-ornithine or (3S)ethynyl-D-ornithine to the media results with desmethylpyrrolysine (dmPyl, 18) and ethynyl-pyrrolysine production (ePyl, 19).Although the production of Pyl analogs was streamlined for amber suppression and incorporation, [57] manipulating Pyl biosynthetic cluster opens new perspectives on intracellular ncAA production.

Stop Codon Suppression vs. Sense Codon Reassignment
In-frame stop codons targeted for ncAA incorporation in a gene of interest are not the only such stop codons in a transcriptome of a model organism (with the exception of recoded E. coli strains) [1b,58] as the same codons occur as natural stop codons in a variety of host's cellular proteins.Decoding such sites with the aid of introduced o-pair may have deleterious consequences as illustrated by the Sep-OTS in E. coli [59] or Pyl-OTS in certain mammalian cell lines. [60]In E. coli, natural stop codons are not equally susceptible to o-tRNA mediated decoding and this susceptibility depends upon the presence of the appropriate release factor and/or introduction of ribosomal proteins with lower affinity for the same release factor. [61]In mammalian cell lines, transcriptional response to amber suppression varies greatly between different cell lines with the transcription of over a 1000 genes dysregulated after introducing Pyl-OTS (or a derivative AcKRS-OTS) into mouse embryonic stem cells and only 11 in mouse embryonic fibroblast. [60]part from the toxic effect that a suppressor tRNA may exert on cellular fitness, protein production via SCS suffers from variable yields as a consequence of the competition between the suppressor tRNA and the cellular release factors for binding to the same stop codon.This problem is partially alleviated by using strains with release factor deletions. [58]An alternative to using a release factor deletion strain is to target sense codons.As PylRS does not recognize the tRNA anticodon, any sense codon can be targeted for ncAA incorporation by Pyl-OTS.Sense codon reassignment was previously attempted in Mycoplasma capricolum.As this organism's genome contains only 6 instances of the CGG (arginine) codon and lacks the corresponding tRNA Arg CCG, this codon was considered to be an "open" sense codon.Unfortunately, engineered tRNA Pyl CCG was shown not to be orthogonal in this organism as it was recognized by M. capricolum ArgRS. [62]This problem was circumvented in E. coli by introducing serine and leucine anticodons into tRNA Pyl . [28]As neither the E. coli LeuRS or SerRS use the tRNA anticodon as a recognition element, these enzymes do not cross-react with tRNA Pyl CAG or tRNA Pyl ACU, resulting in orthogonal tRNA Pyl variants that are able to decode sense codons.Using the M. mazei PylRS IFRS variant, which aminoacylates 3-iodo-L-phenylalanine (3-I-Phe), it was possible to quantify incorporation at targeted Ser AGU codons.The extent of 3-I-Phe incorporation at the targeted site in a GFP reporter protein was ~65 %. [28] The fact that the efficient IFRS can, in part, accomplish reassigning a high-frequency serine codon [28] opens new opportunities for using this enzyme as an integral part of E. coli translation machinery.
Another strategy to utilize sense codons for GCE purposes consists in targeting the degenerate codons that occur in the genome at a low frequency (rare codons) and are decoded by a low abundance tRNA isoacceptor.The approach requires elimination of the host's dedicated tRNA and the synonymous replacement of targeted codons within the genome (at least in essential genes).Recently, a critical step toward successful emancipation of the rare AUA (Ile) codon was achieved by introducing Mycoplasma mobile tRNA Ile UAU into E. coli.M. mobile tRNA Ile UAU is able to translate AUA codons without modification of its anticodon while E. coli tRNA Ile2 CAU depends on tRNA Ile -lysidine synthetase (encoded by tilS) to modify its C34 to 2-lysyl-cytidine for proper decoding of rare AUA codons.As M. mobile tRNA can rescue the lethal tilS deletion through direct AUA codon reading it is now possible to introduce an orthogonal IleRS and reassign the AUA codon with a desired ncAA. [63]Successful reassignment of a rare Arg codon (AGG) was also achieved in E. coli with M. mazei PylRStRNA Pyl CCU. [64]In opposition to rare AUA codons, AGG codons are recognized by 2 native tRNAs, tRNA Arg4 UCU (translates both AGA and AGG codons) and the low abundance tRNA Arg5 CCU (devoted only to AGG decoding).Although tRNA Arg5 CCU is dispensable in E. coli, tRNA Arg4 UCU is required to decode AGA codons; however, the levels of tRNA Arg4 UCU needed to be reduced to avoid wobble decoding of AGG codons and enable preferential reading by the orthogonal tRNA Pyl CCU.64b] Sense codon reassignment offers the exciting possibility introducing artificial amino acids throughout the organism's proteome.

Is Orthogonality Impairing
Functionality?
Although unusual tRNA Pyl structure helps maintain orthogonality to host aaRSs and tRNAs, ncAA-tRNA Pyl still has to be efficiently recognized by the host elongation factor and the ribosome.It was recently recognized that tRNA Pyl might not be an optimal substrate for the bacterial EF-Tu.Improved tRNA Pyl binding was achieved through mutagenesis of nucleotides in the acceptor and T-stems (which are expected to interact with elongation factor). [65]When using the optimized version of tRNA Pyl (tRNA Pyl -opt) the authors were able to show increased multiple AcK incorporation into three different proteins by an engineered PylRS variant.In vitro kinetic measurements with PylRS variants showed that tRNA Pyl -opt is basically undistinguishable from wildtype tRNA Pyl thus inferring that the improved suppression efficiency of tRNA Pyl -opt originates from its improved interaction with EF-Tu.

CONCLUSION
The Pyl-OTS is a powerful tool for genetic code expansion.
To a large extent this rests on the astonishing polyspecificity of the enzyme toward ncAAs and the orthogonality of Pyl-OTS in the majority of common host organisms.While biochemical and structural studies have aided in our understanding of Pyl and ncAA binding, a thorough kinetic analysis is still lacking due to the limited availability (through chemical synthesis) of Pyl and the unexplored possibility of large scale preparation of Pyl from in vivo sources.As shown above, the Pyl-OTS offers a robust methodology for ncAA incorporation, and it has been widely applied.However, much still remains to be discovered to really make the Pyl-OTS a superior synthetic biology tool for GCE.

Figure 1 .
Figure 1.Schematic representation of stop codon suppression strategy.PylRS aminoacylates an orthogonal suppressor tRNA (anticodon CUA) with a non-cognate amino acid (ncAA).NcAA-tRNA Pyl CUA is delivered to the ribosome by the host elongation factor EF-Tu.At the ribosome, the ncAA-tRNA Pyl CUA decodes an internal stop codon (UAG) and the ncAA becomes incorporated in the protein of interest.

Figure 2 .
Figure 2. Structure of the PylRStRNA Pyl complex from D. hafniense (PDB ID 2ZNI) from two perspectives.Individual monomers of the PylRS are shown in grey and magenta, tRNA(I) in light blue and tRNA(II) pink.

Figure 3 .
Figure 3. PylRS topology and α-amino group recognition in the active site.The enzyme is composed of two identical monomers (grey and magenta) forming a homodimer.Characteristic class II aaRS motifs are accentuated: motif 1 that forms part of the dimer interface is shown in green and motif 2 loop (involved in tRNA 3′-CCA recognition) in blue.Pyl is bound in the active site and shown in yellow.(Inset) Idiosyncratic recognition of the Pyl α-amino group in M. mazei PylRS (PDB ID 2ZCE).Instead of directly recognizing the α-amino group of the AA substrate PylRS uses a water molecule bound to the Asn-346.Water molecule (cyan) is shown as a ball.Hydrogen bonds are shown as black dashed lines (B) Malleability of the active site as illustrated by comparison of the wild-type enzyme and a norbornene charging mutant (PDB ID 2Q7H and 4BWA, respectively).The wild-type enzyme is shown in magenta and the triple Y306G/Y384F/I405R mutant in cyan.Pyl-AMP and norbornene-AMP (nonhydrolyzable analogs) are given as stick representation in yellow and in green, respectively, with transparent surfaces indicating the substrate-binding pocket.In the triple mutant the Y306G mutation at the base of the AA binding pocket enlarges the binding site thus allowing placement of the norbornene head group (this space is normally occupied by Tyr-306 side chain, see wildtype structure).In the wild-type enzyme Asn-346 forms a hydrogen bond with the carbonyl-amide linkage of Pyl side chain.In contrast, the carbamate linkage of the norbornene does not form a bond with Asn-346 but with Cys-348.

Table 1 .
Steady-state kinetic parameters for various ncAAs and PylRS enzymes (c)All structures are of M. mazei enzymes.