Skip to main content
Advertisement
  • Loading metrics

Decomposing bulk signals to reveal hidden information in processive enzyme reactions: A case study in mRNA translation

  • Nadin Haase,

    Roles Formal analysis, Investigation, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Leibniz University Hannover, Institute of Cell Biology and Biophysics, Hannover, Germany

  • Wolf Holtkamp,

    Roles Investigation, Writing – review & editing

    Affiliations Max Planck Institute for Multidisciplinary Sciences, Department of Physical Biochemistry, Göttingen, Germany, Paul-Ehrlich-Institut, Division of Allergology, Langen, Germany

  • Simon Christ,

    Roles Data curation, Software, Visualization, Writing – review & editing

    Affiliation Leibniz University Hannover, Institute of Cell Biology and Biophysics, Hannover, Germany

  • Dag Heinemann,

    Roles Supervision, Writing – review & editing

    Affiliations Leibniz University Hannover, Hannover Centre for Optical Technologies (HOT), Hannover, Germany, Leibniz University Hannover, Institute of Horticultural Production Systems, Hannover, Germany, Leibniz University Hannover, PhoenixD Cluster of Excellence, Hannover, Germany

  • Marina V. Rodnina,

    Roles Investigation, Resources, Supervision, Writing – review & editing

    Affiliation Max Planck Institute for Multidisciplinary Sciences, Department of Physical Biochemistry, Göttingen, Germany

  • Sophia Rudorf

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    rudorf@cell.uni-hannover.de

    Affiliation Leibniz University Hannover, Institute of Cell Biology and Biophysics, Hannover, Germany

Abstract

Processive enzymes like polymerases or ribosomes are often studied in bulk experiments by monitoring time-dependent signals, such as fluorescence time traces. However, due to biomolecular process stochasticity, ensemble signals may lack the distinct features of single-molecule signals. Here, we demonstrate that, under certain conditions, bulk signals from processive reactions can be decomposed to unveil hidden information about individual reaction steps. Using mRNA translation as a case study, we show that decomposing a noisy ensemble signal generated by the translation of mRNAs with more than a few codons is an ill-posed problem, addressable through Tikhonov regularization. We apply our method to the fluorescence signatures of in-vitro translated LepB mRNA and determine codon-position dependent translation rates and corresponding state-specific fluorescence intensities. We find a significant change in fluorescence intensity after the fourth and the fifth peptide bond formation, and show that both codon position and encoded amino acid have an effect on the elongation rate. This demonstrates that our approach enhances the information content extracted from bulk experiments, thereby expanding the range of these time- and cost-efficient methods.

Author summary

Ensemble (bulk) in-vitro experiments are less complex and expensive than single-molecule measurements, but their read-outs are blurred due to the stochastic nature of biomolecular processes. Our study investigates how information about individual steps in processive reactions can be deduced from ensemble experiments, despite these inherent challenges. We introduce a least-squares method for decomposing ensemble signals from synchronized enzymatic reactions, allowing us to reveal distinct features of individual reaction steps. Using mRNA translation as a showcase, we propose a systematic approach to assess the feasibility of ensemble signal decomposition. To demonstrate the efficacy and limits of our approach, we decompose simulated fluorescence signatures of in-vitro mRNA translation processes. We then apply our method to experimental data of LepB mRNA translation, where a codon-by-codon analysis reveals that both the codon position and the encoded amino acid impact the translation dynamics. Our approach can be applied to assess bulk signals of other processive enzymatic reactions beyond mRNA translation, provided a comprehensive quantitative understanding of the process is available. The introduced method offers a robust means to extract detailed information from bulk experimental setups, thus helping to bridge the gap between cost-effective bulk experiments and detailed single-molecule studies.

Introduction

Many biochemical reactions are driven by processive enzymes that catalyze multiple rounds of reactions while staying attached to the same polymeric substrate molecule. Prominent examples of processive enzymes are DNA and RNA polymerases, ribosomes, cellulases, or endonucleases [1, 2]. To study the kinetics of the biochemical processes driven by processive enzymes, basically two distinct approaches can be taken: single-molecule experiments and ensemble studies [35]. In single-molecule experiments, the action of an individual enzyme is monitored over time, which results in a detailed picture of the step-by-step motion of this molecule (see Fig 1). Because thermal fluctuations cause differences in the molecular activities, single-molecule experiments have to be performed on a multitude of individual enzymes to obtain a complete picture of the biochemical process. This need for repetition in combination with an often complicated setup can make single-molecule experiments a technical and financial challenge. In comparison, ensemble (bulk) experiments are relatively cost- and time-efficient. Their results have a high statistical power, because they are performed on a large number of enzymes in parallel. However, the signal of an ensemble experiment is typically a superposition of all individual signals, each of which is emitted by a single enzyme in the reaction volume. Due to the stochasticity of processive biochemical processes, these individual signals differ from each other to some extend. Consequently, features or characteristics of the individual signals are blurred or even lost during superposition and, thus, are not present in the ensemble signal [6, 7]. The more time has passed after start of the experiment, the more pronounced is this blurring, even for initially synchronized ensemble reactions. Therefore, ensemble time traces need to be computationally decomposed to reveal hidden information about the underlying enzymatic process [8]. A meaningful computational analysis of ensemble data relies on a mathematical model that fulfills two requirements: First, the model has to reproduce the experimental data and, second, it must do so with an appropriately limited set of parameters. In general, an under-parameterized model lacks the complexity required to reflect the true biochemical process, whereas model over-parameterization leads to overfitting, which causes ambiguous and unreliable results [9, 10]. However, which number of parameters is appropriate is highly case-specific.

thumbnail
Fig 1. Schematic representation of the decomposition of ensemble time traces.

a): The action of an individual processive enzyme is monitored over time. Changes in the signal are interpreted to obtain a detailed picture of step-by-step transitions and related kinetics of this molecule. b): Ensemble time trace for a large number of enzymes in parallel. The detected signal is a superposition of all individual signals. Due to the stochasticity of biochemical processes, these individual signals differ from each other to some extend. Consequently, the features of the individual signals are blurred or even lost during superposition. c): The ensemble time trace needs to be computationally decomposed to reveal the hidden information on the transitions of a single molecule.

https://doi.org/10.1371/journal.pcbi.1011918.g001

In the context of molecular biology and biochemistry, fluorophores are used for different purposes, e.g., for visualization of cell compartments or localization of compounds [11, 12]. Because the intensity of the light emitted by a fluorophore depends on the physico-chemical properties of environment via quenching and de-quenching, fluorophores can also be applied as molecular probes for state transitions in biochemical reactions [1318]. For the latter case, it is required that to each state of the reaction process a specific fluorescence intensity can be assigned. We refer to these state-specific fluorescence intensities as “intrinsic fluorescence intensities” (IFIs). The aim of the decomposition of an ensemble fluorescence time trace is to determine these state-specific signal intensities for the individual states. For the sake of clarity, we restrict our analysis to processes that can be assumed to have a fixed (non-stochastic) number of rounds of reactions, i.e., where fluctuations in the processivity can be neglected (e.g., template-directed reactions such as mRNA translation or nucleic acid polymerization). Furthermore, we assume that the state-specific signal intensities are the only parameters to be determined by the decomposition procedure and that the kinetic rates related to the action of the enzyme do not need to be inferred from the particular ensemble time trace that is to be decomposed. Of course, determination of IFIs by signal decomposition can be part of an extended experimental and model fitting strategy that in addition also allows for the fitting of kinetic parameters, as we have shown [8].

A well-known example for a processive biochemical process is messenger RNA (mRNA) translation, i.e., the biosynthesis of peptides by molecular machines called ribosomes [19]. Briefly, during translation initiation a ribosome binds to mRNA, i.e., single-stranded RNA which consists of a sequence of trinucleotides called codons that encode for the different amino acids. After binding to the mRNA, the ribosome moves from codon to codon catalyzing the peptide bond formation of the corresponding amino acid molecule and the nascent peptide chain. Translation of the mRNA is finished when the ribosome reaches the stop codon and gets released from the mRNA. Premature drop-off of ribosomes from mRNAs is a rare event (probability of roughly 10−4 per codon [20]), therefore the processivity of a translating ribosome is mostly simply determined by the length of the mRNA, i.e., the number of codons the mRNA consists of. Except for a few specific cases [2126], directly assessing the kinetics of mRNA translation in its natural in-vivo environment, i.e., in the living cell, is currently infeasible without heavily disturbing the system. However, the movement of ribosomes along mRNAs can be indirectly monitored in in-vitro translation experiments by using fluorescent molecules that are attached to the N-termini of the nascent peptides, see ref. [8] for details. The experiments are constructed in a way such that each mRNA is translated only a single time: Once a ribosome reaches the end of the truncated mRNA, it is bound to remain there and does not engage in another round of translation. This yields a unique fluorescence time trace for each translated mRNA that we refer to by the mRNA’s “fluorescence signature”. All ribosomes are initially synchronized and are simultaneosly translating identical mRNA sequences [8]. The ensemble fluorescence time trace (or ensemble fluorescence signature) is a superposition of all fluorescence signals. Translation is a stochastic process, thus the ensemble fluorescence signature obtained from a bulk experiment is much smoother than the individual fluorescence time traces generated by single mRNA translation processes, which consist of discrete gradations [8]. The goal is to find the fluorescence intensities that are associated with the individual states of the translation process from the ensemble fluorescence signature.

In this work, we investigate the feasibility of the decomposition of ensemble time traces that result from the simultaneous action of a multitude of identical, initially synchronized ribosomes. In particular, we consider the typical processivity of the enzymes and analyze its impact on the meaningfulness and truthfulness of ensemble signal decomposition. At first, we show that time traces of low-processivity reactions can be decomposed with good fidelity, in contrast to reactions with higher processivity. We then analyze this phenomenon in terms of the condition number associated to the underlying biomolecular process. Finally, we show that the application of a regularization method can expand the applicability of ensemble signal decomposition to reactions with higher processivity.

Results

Decomposition of time traces from low-processivity reactions: Fluorescence signatures of mRNAs consisting of only four codons can be fitted by standard least-squares approach

At first, we study the consecutive translation of only four codons by the ribosome, which can be seen as an enzymatic action with comparably low processivity. To assess the quality of the decomposition results and to precisely quantify the deviations of output (fitted) from input (assigned) parameter values, we use simulated ensemble fluorescence signatures instead of experimentally obtained data: We first generate artificial ensemble fluorescence signatures from mRNAs that consist of four identical codons. Then, we “forget” all IFI values that were used to generate the data and, instead, apply fluorescence signature decomposition to obtain estimates for the IFIs.

For data generation, we simulate translation as a continuous-time Markov process where the states comprising the Markov chain represent the steps that a ribosome undergoes during translation. Each state in the Markov process is associated with a respective IFI. The Markov model used in this work includes 6n + 2 states and n + 3 assigned IFIs, where n is the number of codons of each mRNA. Therefore, the Markov model for translation of an mRNA consisting of four codons includes 26 states and seven IFIs. The IFIs are assigned to the states in a way such that they represent expected changes in fluorescence intensity during the biochemical process of translation, e.g., after translocation of the ribosome to the next codon. Therefore and in the interest of fit parameter reduction some states are assigned the same IFI value. For further information on the Markov model and the assignment of the IFIs, see [8] and Methods. Using our Markov model of translation, we calculate the time-dependent state occupancy probabilities P, see Methods. These probabilities express how likely it is to find a ribosome at a specific time in a certain state. Furthermore, we construct IFI input vectors consisting of n + 3 random positive real numbers. The simulated input ensemble fluorescence signature is then given by the sum over all components of the IFI vector weighted by the time-dependent state occupancy probabilities P of the Markov process: (1)

To imitate experiments in which thermal fluctuations modulate the fluorescence signal, we overlay the simulated input fluorescence signature by a Gaussian random noise such that resulting final simulated data are given by , see Methods.

To test our decomposition strategy for the simulated ensemble fluorescence signature, we pretend that all state-specific IFIs are unknown. Analogously to experimental data analysis, where short-term fluctuations are smoothed out, the simulated data are also smoothed by a moving average operation, i.e., . Fig 2a) shows an example for a smoothed simulated ensemble fluorescence signature . The smoothing step is necessary to fit the model using a least-squares solver to obtain the output IFI vector (2) see Methods for details.

thumbnail
Fig 2. Fluorescence signature of an mRNA that consists of four identical codons for random IFI input vector.

a): The simulated fluorescence signature is compared to the best theoretical fit in terms of least squares. The theoretical model and simulated data curves are in perfect agreement. b): Fitted IFIs obtained from the analysis of the fluorescence signature compared to the given IFI input vector.

https://doi.org/10.1371/journal.pcbi.1011918.g002

Ideally, a perfect fit would reveal the IFI values that were used to generate the fluorescence time traces, i.e., . However, due to the added noise, the output and input IFI values and may slightly deviate. Figure A in S1 Text shows that the uncertainty is lower in fitted IFI values that correspond to the states in the beginning and towards the end of the studied process. This inhomogeneity in the fit quality is a consequence of the constraints given by the particular “imitated” experimental setup [8]: First, the translation processes are initially synchronized in the sense that all ribosomes simultaneously start to translate the mRNA molecule that they are bound to. By definition, the intrinsic fluorescence intensity that corresponds to the initial state is set to IFI1 = 1, i.e., it serves as a reference for the normalization of all fluorescence values. Second, all ribosomes end up and remain in the same last “end” state. Therefore, the IFI corresponding to the “end” state can be well determined as long as the process is monitored long enough, see also Fig B in S1 Text.

In summary, time traces from initially synchronized biochemical reactions with low processivity can be decomposed by standard least-square approaches, and even subtle differences in intrinsic fluorescence intensities of different states can be detected, see Fig 2b).

Standard least-squares approach is not sufficient to decompose time traces from higher-processivity reactions

In this paragraph, we show that signal time traces from reactions with higher processivity generally cannot be decomposed by standard least-square methods and that, instead, more elaborate data analysis methods are required to obtain meaningful fits. To show the limits of the standard least-square approach, we proceed as described in the previous paragraph. However, here we analyze simulated ensemble fluorescence signatures corresponding to translation of mRNAs consisting of 26 identical codons, see Fig 3a), instead of just four codons as in the previous paragraph. The Markov model includes 6n + 2 = 158 states and n + 3 = 29 assigned IFIs, with n = 26 the number of codons on the mRNA. Decomposition of these ensemble fluorescence signatures by a standard least-square approach reveals IFI values that greatly deviate from the randomly chosen input IFI values that were used to generate the fluorescence data, see Fig 3b). The components of the fitted IFI vector exhibit strong oscillations, span several orders of magnitude, and have negative values. These unreasonable fitting results indicate that the present least-squares problem is ill-posed [27]. To validate this assumption, the singular values σi and the condition number κ(P) of matrix P are calculated, see S2 Text for definitions and further details: The singular value decomposition of matrix P reveals that the singular values σi decay gradually to zero, see Fig C in S1 Text. Furthermore, the condition number κ(P) of matrix P is very large, with a value of κ(P) = 4.8 × 109. Both criteria imply that the matrix P is ill-conditioned, which means that the computed solution of the ill-posed problem Eq (2) is potentially very sensitive to perturbations in the input data, i.e., the presence of thermal noise in the simulated ensemble fluorescence signature [27, 28]. Fig 4a) shows that the condition number grows exponentially with the level of processivity, i.e., the number of repeats of the enzymatic action performed by the processive enzyme. This exponential growth is accelerated when the number of repeats is larger than five. The condition number corresponding to translation of mRNAs consisting of 26 codons is several orders of magnitude higher than the condition number corresponding to translation of four-codons mRNAs. Therefore, solving Eq (2) for the unknown IFI values in the sense of least-squares is feasible for signals from the translation of 4-codon-mRNAs, but is an ill-posed problem for longer mRNAs, e.g., in the case of mRNAs with 26 codons. In addition, the condition number depends on the kinetics of the biochemical reaction, see Fig 4b). Here, the kinetics are determined by the elongation rate of the translating ribosomes, which is reflected in the state occupancy probabilities matrix P.

thumbnail
Fig 3. Fitting fluorescence signatures of longer mRNAs reveals signs of an ill-posed problem.

a): Fluorescence signature of an mRNA that consists of 26 identical codons for a random IFI input vector. The simulated fluorescence signature is compared to the best theoretical fit in terms of least squares. No regularization is applied. b): Fitted IFIs obtained from the decomposition of the fluorescence signature in a) compared to the given IFI input vectors. No regularization is applied.

https://doi.org/10.1371/journal.pcbi.1011918.g003

thumbnail
Fig 4. Characteristics of the occupancy probabilities matrix P.

a): Condition number κ for translation of mRNAs consisting of n identical codons with elongation rates ωelo = 10s1 and ωelo = 20s1. b): Condition number κ for translation of an mRNA that consists of 26 identical codons for various elongation rates ωelo.

https://doi.org/10.1371/journal.pcbi.1011918.g004

Application of Tikhonov regularization to restore decomposability of signals from high-processivity processes

To find a meaningful solution of a linear ill-posed problem by the method of least squares, the inclusion of additional information is required, which is commonly done by applying Tikhonov regularization [29]. The purpose of regularization is to include side constraints and provide robust methods for choosing the weight given to these side constraints to solve the ill-posed problem, see S2 Text for an introduction to the method. Briefly, a damping is added to filter out the components corresponding to the small singular values σi of the matrix P [28]. The amount of applied filtering is controlled by the regularization parameter α. For larger values of the regularization parameter α, stronger regularization is applied, i.e., more filtering is introduced, the fitted solution vector gets smoother and changes only little with higher α. Simultaneously, the regularization error increases, which means that some physically meaningful components of the unknown solution are filtered out (regularization error). For a small regularization parameter α, the fitted solution vector is dominated by terms corresponding to the smallest singular values σi and its entries show oscillatory sign changes (perturbation error). In addition, small changes in α result in strong changes in the solution . To choose an optimal value for the regularization parameter α, the norm of the regularized solution and the residual norm are plotted against each other. For discrete ill-posed problems, the plot in log-log scale shows a characteristic L-shaped curve with the optimal regularization parameter α corresponding to the corner of the curve [27, 30]. The optimal regularization parameter α balances the regularization and the perturbation error to find a stable solution.

Fig 5a) shows an example of the L-curve for the decomposition of the fluorescence signature of an mRNA that consists of 26 identical codons. For three different choices of α, the corresponding fitted IFI vectors are shown in Fig 5b), together with the random input IFI vector . For all choices, Tikhonov regularization led to much more reliable fit results compared to the standard least-square approach (Fig 3b)). However, high-frequency features of the input vector are filtered out and thus cannot be recovered by the fitting. For small regularization parameters (see α = 0.005 in Fig 5b)), the perturbation error increases and oscillatory sign changes appear. Nonetheless, the fitted IFI values are still smoother than the input values. Because of this effect, the fit quality depends on the smoothness of the input IFI values. Furthermore, for the same reasons as discussed above for the decomposition of fluorescence signatures of 4-codon-mRNAs, IFI values corresponding to the first and the last states of the translation process are fitted best, see also Fig D in S1 Text.

thumbnail
Fig 5. Decomposition of fluorescence signatures of mRNAs consisting of 26 identical codons with Tikhonov regularization.

a): For different values of α, the norm of the regularized solution is plotted versus the residual norm . The log-log plot shows the characteristic L-shaped curve. The optimal regularization parameter in terms of trade-off between regularization and perturbation error is α = 0.01 and is found by locating the corner of the curve. b): Input and fitted IFI vectors for three different regularization parameter values.

https://doi.org/10.1371/journal.pcbi.1011918.g005

Additional simulation results can be found in Figs F—S in S1 Text and Figs A—H in S2 Text for a) the decomposition of fluorescence signatures of mRNAs with different lengths, b) translation processes with uniform and non-uniform translation rates, and c) processes where an unknown kinetic rate is determined simultaneously with the unknown IFI vector.

Decomposition of experimentally determined fluorescence signatures of LepB mRNA translation

In this paragraph, we apply our method to fluorescence signatures of LepB mRNA translation. As the kinetics of LepB mRNA translation are not known at codon resolution, the simultaneous fitting of unknown kinetic rates and IFI values is necessary (see Determination of kinetic rate ω45 in S1 Text for more details). To this end, we follow an approach introduced previously for the analysis of the translation of short poly(U) mRNAs [8]: Translation of truncated mRNAs (denoted here as LepB2 to LepB11) of increasing length in terms of codons is monitored via fluorescent molecules that are attached to the N-termini of the nascent peptide chains and move along the Escherichia coli ribosomal exit tunnel. The resulting fluorescence signatures are decomposed, where for short mRNAs the simultaneous fitting of kinetic rates and IFI values yields a unique solution. In contrast, for longer mRNAs the analysis gets ambiguous (see S2 Text) and requires to take into account additional information. A longer mRNA—e.g. LepB11—cannot be directly decomposed to reveal the IFI values together with the kinetics of the process. A step-by-step analysis of the translation of increasingly longer mRNAs is required, where the results obtained for a specific mRNA can be retained for each subsequent analysis.

As an example, Fig 6 shows the measured and fitted fluorescence signatures of translation for LepB3 and LepB11 (see Figs I to K in S2 Text for all nine analyzed mRNAs). During fluorescence signature decomposition, Tikhonov regularization was applied for mRNAs with five or more codons. The fitted intrinsic fluorescence intensities of the different states are depicted in Fig 7. The IFI that belongs to the translation states between binding of Phe and binding of the second Ala is considerably lower than all other IFI values (IFI5, see Fig 8b) for definition). The corresponding fitted kinetic rates are listed in Table 1. Notably, within the initial five codons (after the start codon), the translation process exhibits a significant slowdown at the fourth codon (Phe). This observation aligns with our earlier findings derived from the analysis of short poly(U) mRNA translation [8]. Furthermore, codons coding for Ile are the slowest in the studied mRNAs (positions 7 and 10), and codons coding for identical amino acids have very similar elongation rates (positions 1 and 5, 6 and 8). Together with our previous study [8], these results indicate that both the position of the codon and the encoded amino acid influence the translation rate.

thumbnail
Fig 6. Decomposition of fluorescence signatures of LepB mRNAs.

Measured fluorescence signatures of the in-vitro translation of LepB3 and LepB11 mRNA (colored lines) and best fits in terms of least squares (black lines).

https://doi.org/10.1371/journal.pcbi.1011918.g006

thumbnail
Fig 7. Fitted IFIs obtained from the analysis of the fluorescence signatures of truncated LepB mRNAs of different lengths.

The IFIs correspond to the different states of the translation process (see Fig 8 and [8]). The lowest IFI (IFI5) is associated to the states after binding of Phe and before binding of the second Ala to the nascent peptide chain. The IFIs associated to the artificial state of a ribosome at the end of a truncated mRNA are not shown.

https://doi.org/10.1371/journal.pcbi.1011918.g007

thumbnail
Fig 8. Representation of mRNA translation elongation as a Markov process and assigned IFIs for a mRNA that consists of four codons.

The Markov model for translation of a short mRNA consisting of n = 4 codons includes 6n + 2 = 26 states and n + 3 = 7 assigned IFIs. Each state in the process corresponds to one biochemically resolved step: State 01 represents the initiation complex with the start codon in the ribosomal A site. States 01-41 describe initial selection, including ternary complex binding and recognition, GTPase activation, GTP hydrolysis and rearrangement of EF-Tu. This is followed by tRNA accommodation and peptide bond formation (51). Afterwards, the ribosome translocates to the second codon (02). The elongation cycle is repeated for the next three codons before the ribosome reaches the end of the truncated mRNA (E4) where the P site is occupied by the fourth codon while the A site remains empty. See refs. [3945] for more details on the translation process. Possible transitions between the states are indicated by arrows and occur with transitions rates ωij, which is exemplified by the ternary complex binding rate ω01 and the unbinding rate ω10. The average rate of translation for one codon is denoted by ωelo. For the first codon, changes in the state-specific IFIs are assumed to occur both after peptide bond formation and translocation, and states assigned with identical IFI values have the same color. Note that due to initial synchronization of ribosomes, the two IFI changes on the first codon become visible in the ensemble fluorescence signature. For all following codons only one change in fluorescence intensity after peptide bond formation (after transition from state (4k) to state (5k)) is considered to avoid overfitting. The final intrinsic fluorescence intensity E4 corresponds to the artificial state of the ribosome at the end of the truncated mRNA. Previously published as the 6n2/n3 model in [8]. Nomenclature of IFIs for a): a generic mRNA, and b): the specific case of LepB5 mRNA.

https://doi.org/10.1371/journal.pcbi.1011918.g008

thumbnail
Table 1. Codon position-specific translation rates calculated from fitted in-vitro rates.

https://doi.org/10.1371/journal.pcbi.1011918.t001

Discussion

We presented a least-squares approach to decompose time-dependent ensemble signals of initially synchronized processive enzymatic reactions and analyzed the limits of the method. The decomposition allows to identify state-specific signal intensities for the individual steps of a reaction, which can be valuable information already by itself or part of a larger model-fitting procedure [8]. To demonstrate the method with a practical example, we modeled mRNA translation as a Markov process and assigned a specific intrinsic fluorescence intensity (IFI) to each state in the Markov chain. We computed the time-dependent fractions of ribosomes in the different states of the translation process and determined the corresponding ensemble fluorescence signatures of mRNA translation, including thermal noise. These simulated ensemble fluorescence signatures were then used to test our decomposition method by comparison of input and fitted output IFI values. Additionally, we applied our method to decompose experimental fluorescence signatures of LepB mRNA translation. We found that for reactions with low processivity (corresponding to seven different IFI values in our mRNA translation example), a standard least-squares approach is sufficient to reliably fit IFI values with good precision. In contrast, investigation of model properties in terms of singular values and condition numbers revealed clear signs of ill-posed problems for reactions with higher processivity. However, by using least-square fitting with Tikhonov regularization it is still possible to obtain meaningful solutions. From these observations we draw the conclusion that even before a bulk experiment with initially synchronized processive enzymes is conducted, the following steps can be taken to assess the feasibility of ensemble signal decomposition: First, a model that appropriately describes the stochastic biochemical process of interest needs to be developed, including the assignment of IFIs to the different states of the model. Second, based on this model, the matrix of state occupancy probabilities and its condition number have to be computed (see Eq (4) and S2 Text). For small condition numbers, a standard least-squares approach might be sufficient to decompose ensemble time traces, given that no other factors, such as a low signal-to-noise ratio, prevent a meaningful analysis of the data. Otherwise, regularization is required or, alternatively, strategies must be developed to help decrease the condition number, including, for example, additional measurements to reduce the number of free parameters. Bulk experiments are often much more cost- and time-efficient than methods probing single molecules directly, but they lack the depth of information that the latter approaches achieve. Decomposition of ensemble signals can expand the range of bulk methods by providing a way to extract more detailed information from the same experimental setup. For example, analysis of the codon-by-codon translation of LepB mRNA presented here demonstrates the differences of elongation rates at different codons and supports the notion that translation rate drops at the fourth translated codon (i.e., the last codon of LepB5 mRNA), consistent with earlier data [8]. We also observe slow translation at positions 7 and 10. The reasons for the observed slow down are unknown and may be related to both translation per se or protein folding in the narrow exit tunnel of the ribosome, which may regulate the translation rate. Of note, the stalling codons at positions 7 and 10 encode for Ile, which is one of the amino acids prevalent in inhibitory bicodons [31]. These results in combination with our previous findings [8] show that both the codon position and the encoded amino acid have an effect on the elongation rate. Also IFI analysis reveals an anisotropic environment of the N-terminal reporter group, which in the future can be used to study the effect of individual mutations in the peptide or mutations in the ribosome, e.g., on translational rates or co-translational protein folding. Our approach extends beyond the decomposition of fluorescence signatures in mRNA translation and can be applied to assess various bulk signals originating from biological processes. To achieve this, a prerequisite is a comprehensive understanding of the underlying process, including its potential states and the kinetic rates governing transitions between these states. The subsequent conscientious assignment of state-specific fluorescence intensities ensures biological relevance throughout the analysis.

Materials and methods

Modeling of mRNA translation as a Markov process

In the literature, different approaches are used for the modeling of mRNA translation and the determination of kinetic rates ([15, 3236]). In this work, we modeled mRNA translation as a time-continuous Markov process as is described in ref. [8, 37], and [38]. For the sake of consistency both in terms of modeling approach and model parameterization, like in our earlier work, the Markov model used here describes a large fraction of the translation-elongation cycle steps that were biochemically resolved and of which the corresponding kinetic rates were investigated by Rodnina and co-workers over the past decades [3945]. Fig 8 shows as an example the Markov model for translation of an mRNA that consists of four codons. By solving the master equations of the Markov process (3) the time evolution of the occupancy probability Pi(t), which is the probability to find a ribosome in state i = 1, …N at time t, is obtained for a given set of transition rates ωij from state i to state j (see Table 2). The average rate of translation ωelo for one codon can be calculated by first step analysis [46] using the transition rates ωij [37]. According to the experimental setup in [8], the translation processes are initially synchronized at state i = 0 at time t = 0, i.e., the initial condition is

The matrix of occupancy probabilities P is defined as (4) where N is the number of states in the Markov process and M is the number of considered time points. The time points of the simulation were selected based on the experimental temporal resolution [8].

Generation and fitting of test data

The random IFI input vector is created using the MATLAB function rand() for uniformly distributed random numbers as follows: , where n is the number of required IFI components. The values of the IFI input vector range between 0 and 2, where the first IFI is set to 1. The simulated ensemble fluorescence signature contains m = 1...M fluorescence values, one for each time point tm. Each individual component of is given by the sum over all components of the IFI input vector weighted by the time-dependent state occupancy probabilities P of the Markov process (see Eqs (1) and (4)). The simulated fluorescence signature is superimposed with a Gaussian random noise with a standard deviation of 0.005. The influence of the chosen standard deviation on the fitted IFI vector is shown in Fig E in S1 Text. The time points in the simulation and the random noise levels are based on experimental data [8]. The data are smoothed using the MATLAB function smooth() with the specified method moving and a subset size of 30 data points to obtain . See Fig 9 for an exemplary representation of , , and . We use the non-negative least-squares solver lsqnonneg in MATLAB to solve the system of linear Eq (2) with the components of the IFI vector as adjustable parameters.

thumbnail
Fig 9. Generation of simulated data.

a): Simulated ensemble fluorescence signature constructed from the random IFI input vector weighted by the time-dependent state occupancy probabilities P of the Markov process. b): Superimposition of Gaussian noise to imitate thermal fluctuations. c): The simulated data curve is smoothed using a moving average. Note: To make differences in and visible, a Gaussian noise with a standard deviation of 0.01 and a moving average with a non-optimal subset size of 200 steps were used in this figure.

https://doi.org/10.1371/journal.pcbi.1011918.g009

Buffer and reagents

All experiments were carried out in HiFi buffer (50 mM Tris-HCl, pH 7.5, 70 mM NH 4Cl, 30 mM KCl and 3.5 mM MgCl2, 8 mM putrescine, 0.5 mM spermidine, and 1 mM DTT) [47]. Chemicals were from Roche Molecular Biochemicals, Sigma Aldrich or Merck, nucleotide triphosphates from Jena Bioscience. Radioactive compounds were from Hartmann Analytic. Total E. coli tRNA was from Roche. EF-Tu, initiation factors, [3H]Met-tRNA fMet, and Bodipy-[3H]Met-tRNA fMet were prepared from E. coli as described [47]. 70S ribosomes were prepared from E. coli according to the published protocol [48]. 70S initiation complexes were prepared by incubating 70S ribosomes (typically 0.5 μM) with a 1.5-fold excess of IF1, 2, and 3, 1.5-fold excess of Bodipy-[3H]Met-tRNA fMet, and a 4-fold excess of the mRNA of the indicated length in the presence of 1 mM GTP, 3 mM phosphoenolpyruvate and 1 μg μL-1 pyruvate kinase for 15 min at 37°C. Translation was performed in the stopped-flow apparatus by mixing 70S initiation complexes (0.025 μM final concentration) with EF-Tu (15 μM), purified total aminoacyl-tRNA (40 μM) and EF-G (0.05 μM) in the presence of 1 mM GTP, 3 mM phosphoenolpyruvate and 1 μg μL-1 pyruvate kinase. The fluorophore Bodipy was exited at 470nm and the emission monitored after passing a 500 nm cut-off filter. The time courses are averages of 6–8 technical replicates.

Supporting information

S1 Text. File containing additional simulation results.

https://doi.org/10.1371/journal.pcbi.1011918.s001

(PDF)

S2 Text. File containing additional simulation results and further supplementary information.

https://doi.org/10.1371/journal.pcbi.1011918.s002

(PDF)

Acknowledgments

We thank Reinhard Lipowsky for valuable comments and fruitful discussions. Plots were produced using the julia programming language [49] and Plots.jl [50].

References

  1. 1. Breyer WA, Matthews BW. A structural basis for processivity. Protein Sci. 2001;10(9):1699–711. pmid:11514661
  2. 2. Hippel PHV, Fairfield FR, Dolejsi MK. On the processivity of polymerases. Ann N Y Acad Sci. 1994;726:118–31.
  3. 3. Zheng X, Bi C, Li Z, Podariu M, Hage DS. Analytical methods for kinetic studies of biological interactions: A review. J Pharm Biomed Anal. 2015;113:163–80. pmid:25700721
  4. 4. van Oijen A. Cutting the forest to see a single tree? Nature chemical biology. 2008;4:440–3. pmid:18641617
  5. 5. Yakovlieva L, Walvoort M. Processivity in Bacterial Glycosyltransferases. ACS Chem Biol. 2020;15(1):3–16. pmid:31750644
  6. 6. T I Jr, G RL Jr. Biological mechanisms, one molecule at a time. Genes Dev. 2011;25(12):1205–1231.
  7. 7. Lai WC, Ermolenko DN. Ensemble and single-molecule FRET studies of protein synthesis. Methods. 2018;137:37–48. pmid:29247758
  8. 8. Haase N, Holtkamp W, Lipowsky R, Rodnina MV, Rudorf S. Decomposition of time-dependent fluorescence signals reveals codon-specific kinetics of protein synthesis. Nucleic Acids Research. 2018;46(22).
  9. 9. Gutenkunst RN, Waterfall JJ, Casey FP, Brown KS, Myers CR, Sethna JP. Universally sloppy parameter sensitivities in systems biology models. PLoS Comput Biol. 2007;3(10). pmid:17922568
  10. 10. Lillacci G, Khammash M. Parameter estimation and model selection in computational biology. PLoS Comput Biol. 2010;6(3):e1000696. pmid:20221262
  11. 11. DeVree BT, Steiner LM, Głazowska S, Ruhnow F, Herburger K, Persson S, et al. Current and future advances in fluorescence-based visualization of plant cell wall components and cell wall biosynthetic machineries. Biotechnology for Biofuels. 2021;14(1):78. pmid:33781321
  12. 12. Lelek M, Gyparaki MT, Beliu G, Schueder F, Griffié J, Manley S, et al. Single-molecule localization microscopy. Nature Reviews Methods Primers. 2021;1(39). pmid:35663461
  13. 13. Gong P, Campagnola G, Peersen OB. A quantitative stopped-flow fluorescence assay for measuring polymerase elongation rates. Anal Biochem. 2009;391(1):45–55. pmid:19406094
  14. 14. Choi J, Puglisi J. Three tRNAs on the ribosome slow translation elongation. Proceedings of the National Academy of Sciences. 2017;114(52):13691–13696. pmid:29229848
  15. 15. Prabhakar A, Krahn N, Zhang J, Vargas-Rodriguez O, Krupkin M, Fu Z, et al. Uncovering translation roadblocks during the development of a synthetic tRNA. Nucleic Acids Res. 2022;50(18):10201–10211. pmid:35882385
  16. 16. Sarkar I, Mishra AK. Fluorophore tagged bio-molecules and their applications: A brief review. Applied Spectroscopy Reviews. 2018;53(7):552–601.
  17. 17. Wang HY, Yang H, Holm M, Tom H, Oltion K, Al-Khdhairawi AAQ, et al. Synthesis and single-molecule imaging reveal stereospecific enhancement of binding kinetics by the antitumour eEF1A antagonist SR-A3. Nature Chemistry. 2022;14(12):1443–1450. pmid:36123449
  18. 18. Asher WB, Geggier P, Holsey MD, Gilmore GT, Pati AV, Meszaros J, et al. Single-molecule FRET imaging of GPCR dimers in living cells. Nature Methods. 2021;18(4):397–405. pmid:33686301
  19. 19. An E, Friend K. mTORC1 Enhances Early Phase Ribosome Processivity. Frontiers in Molecular Biosciences. 2020;7. pmid:32656229
  20. 20. Sin C, Chiarugi D, Valleriani A. Quantitative assessment of ribosome drop-off in E. coli. Nucleic acids research. 2016;44(6):2528–2537. pmid:26935582
  21. 21. Mustafi M, Weisshaar JC. Simultaneous Binding of Multiple EF-Tu Copies to Translating Ribosomes in Live Escherichia coli. mBio. 2018;9(1):e02143–17. pmid:29339430
  22. 22. Bakshi S, Siryaporn A, Goulian M, Weisshaar JC. Superresolution imaging of ribosomes and RNA polymerase in live Escherichia coli cells. Mol Microbiol. 2012;85(1):21–38. pmid:22624875
  23. 23. Metelev M, Lundin E, Volkov IL, Gynnå AH, Elf J, Johansson M. Direct measurements of mRNA translation kinetics in living cells. Nature Communications. 2022;13(1):1852. pmid:35388013
  24. 24. Tanenbaum ME, Gilbert LA, Qi LS, Weissman JS, Vale RD. A protein tagging system for signal amplification in gene expression and fluorescence imaging. Cell. 2015;159(3).
  25. 25. Morisaki T, Stasevich TJ. Quantifying Single mRNA Translation Kinetics in Living Cells. Cold Spring Harb Perspect Biol. 2018;10(11):a032078. pmid:30385605
  26. 26. Wang C, Han B, Zhou R, Zhuang X. Real-Time Imaging of Translation on Single mRNA Transcripts in Live Cells. Cell. 2016;165(4):990–1001. pmid:27153499
  27. 27. Hansen C. Rank-Deficient and Discrete Ill-Posed Problems: Numerical Aspects of Linear Inversion (Monographs on Mathematical Modeling and Computation). Society for Industrial and Applied Mathematics; 1998.
  28. 28. Golub GH, Hansen C, O’Leary DP. Tikhonov Regularization And Total Least Squares. SIAM Journal on Matrix Analysis and Applications. 1999;21(1):185–194.
  29. 29. Hochstenbach ME, Reichel L. An iterative method for Tikhonov regularization with a general linear regularization operator. Journal of Integral Equations and Applications. 2010;22(3).
  30. 30. Regińska T. A Regularization Parameter in Discrete Ill-Posed Problems. Siam Journal on Scientific Computing. 1996;17(3):740–749.
  31. 31. Arpat AB, Liechti A, Matos MD, Dreos R, Janich P, Gatfield D. Transcriptome-wide sites of collided ribosomes reveal principles of translational pausing. Genome Res. 2020;30(7):985–999. pmid:32703885
  32. 32. Frank J, Sengupta J, Gao H, Li W, Valle M, Zavialov A, et al. The role of tRNA as a molecular spring in decoding, accommodation, and peptidyl transfer. FEBS Lett. 2005;579(4):959–62. pmid:15680982
  33. 33. v d Haar T. MATHEMATICAL AND COMPUTATIONAL MODELLING OF RIBOSOMAL MOVEMENT AND PROTEIN SYNTHESIS: AN OVERVIEW. Computational and Structural Biotechnology Journal. 2012;1(1).
  34. 34. Nissley D, O’Brien E. Timing Is Everything: Unifying Codon Translation Rates and Nascent Proteome Behavior. Journal of the American Chemical Society. 2014;136(52). pmid:25486504
  35. 35. Dykeman EC. A stochastic model for simulating ribosome kinetics in vivo. PLoS Comput Biol. 2020;16(2):e1007618. pmid:32049979
  36. 36. Dykeman EC. Modelling ribosome kinetics and translational control on dynamic mRNA. PLoS Comput Biol. 2023;19(2):e1010870. pmid:36689464
  37. 37. Rudorf S, Thommen M, Rodnina MV, Lipowsky R. Deducing the Kinetics of Protein Synthesis In Vivo from the Transition Rates Measured In Vitro. PLoS Computational Biology. 2014;10(10):e1003909. pmid:25358034
  38. 38. Rudorf S, Lipowsky R. Protein Synthesis in E. coli: Dependence of Codon-Specific Elongation on tRNA Concentration and Codon Usage. PLOS ONE. 2015;10(8):1–22.
  39. 39. Mittelstaet J, Konevega AL, Rodnina MV. A Kinetic Safety Gate Controlling the Delivery of Unnatural Amino Acids to the Ribosome. Journal of the American Chemical Society. 2013;135(45):17031–17038. pmid:24079513
  40. 40. Fischer N, Konevega AL, Wintermeyer W, Rodnina MV, Stark H. Ribosome dynamics and tRNA movement by time-resolved electron cryomicroscopy. Nature. 2010;466:329–333. pmid:20631791
  41. 41. Kothe U, Rodnina MV. Delayed Release of Inorganic Phosphate from Elongation Factor Tu Following GTP Hydrolysis on the Ribosome. Biochemistry. 2006;45(42):12767–12774. pmid:17042495
  42. 42. Gromadski KB, Daviter T, Rodnina MV. A Uniform Response to Mismatches in Codon-Anticodon Complexes Ensures Ribosomal Fidelity. Molecular cell. 2006;21(3):369–77. pmid:16455492
  43. 43. Gromadski KB, Rodnina MV. Kinetic Determinants of High-Fidelity tRNA Discrimination on the Ribosome. Molecular cell. 2004;13(2):191–200. pmid:14759365
  44. 44. Pape T, Wintermeyer W, Rodnina MV. Complete kinetic mechanism of elongation factor Tu-dependent binding of aminoacyl-tRNA to the A site of the E.coli ribosome. The EMBO Journal. 1998;17(24):7490–7497. pmid:9857203
  45. 45. Rodnina MV, Pape T, Fricke R, Kuhn L, Wintermeyer W. Initial Binding of the Elongation Factor Tu-GTP-Aminoacyl-tRNA Complex Preceding Codon Recognition on the Ribosome. Journal of Biological Chemistry. 1996;271(2):646–652.
  46. 46. Taylor HM, Karlin S. An introduction to stochastic modeling. Academic Press; 1985.
  47. 47. Samatova E, Konevega AL, Wills NM, Atkins JF, Rodnina MV. High-efficiency translational bypassing of non-coding nucleotides specified by mRNA structure and nascent peptide. Nat Commun. 2014;5 (4459).
  48. 48. Rodnina MV, Wintermeyer W. GTP consumption of elongation factor Tu during translation of heteropolymeric mRNAs. PNAS. 1995;92(6):1945–1949. pmid:7892205
  49. 49. Bezanson J, Edelman A, Karpinski S, Shah VB. Julia: A Fresh Approach to Numerical Computing. SIAM Review. 2017;59(1):65–98.
  50. 50. Christ S, Schwabeneder D, Rackauckas C, Borregaard MK, Breloff T. Plots.Jl—A User Extendable Plotting API for the Julia Programming Language. Ubiquity Press. 2023;11(1):5.