Maximal information transmission is compatible with ultrasensitive biological pathways

Micali, Gabriele; Endres, Robert G.

doi:10.1038/s41598-019-53273-4

Download PDF

Article
Open access
Published: 15 November 2019

Maximal information transmission is compatible with ultrasensitive biological pathways

Scientific Reports volume 9, Article number: 16898 (2019) Cite this article

1526 Accesses
4 Citations
Metrics details

Subjects

Abstract

Cells are often considered input-output devices that maximize the transmission of information by converting extracellular stimuli (input) via signaling pathways (communication channel) to cell behavior (output). However, in biological systems outputs might feed back into inputs due to cell motility, and the biological channel can change by mutations during evolution. Here, we show that the conventional channel capacity obtained by optimizing the input distribution for a fixed channel may not reflect the global optimum. In a new approach we analytically identify both input distributions and input-output curves that optimally transmit information, given constraints from noise and the dynamic range of the channel. We find a universal optimal input distribution only depending on the input noise, and we generalize our formalism to multiple outputs (or inputs). Applying our formalism to Escherichia coli chemotaxis, we find that its pathway is compatible with optimal information transmission despite the ultrasensitive rotary motors.

Modulation of transcription factor dynamics allows versatile information transmission

Article Open access 14 February 2023

Alan Givré, Alejandro Colman-Lerner & Silvina Ponce Dawson

Escherichia coli chemotaxis is information limited

Article 25 November 2021

H. H. Mattingly, K. Kamino, … T. Emonet

Size limits the sensitivity of kinetic schemes

Article Open access 08 March 2023

Jeremy A. Owen & Jordan M. Horowitz

Introduction

Biological cells continuously process environmental cues, allowing them to make critical decisions quickly, e.g. whether to move or stay, whether to express a certain protein, or whether to divide¹. These decisions are generally made based on the level of one or more key proteins, which are the internal representation of the extracellular stimulus. The higher the amount of environmental information encoded in the intracellular representation, the more reliable the response. In contrast, cell-external and internal noise may reduce the reliability. Hence, a biological system under evolutionary pressure is expected to evolve to optimally transmit information under biologically relevant constraints², at least when information is a limiting factor^3,4,5.

To formalize this optimization problem, cells can be considered input-output devices, where stimuli of extracellular concentrations are the input, receptors (or the entire pathway) are the communication channel, and the intracellular concentration of a key protein (or the final behavior of the cell) is the output. In contrast to engineered physical systems, the distinction between input, channel, and output is not always clear in biological systems with feedback. Take for instance Escherichia coli chemotaxis, a well-characterized pathway allowing bacteria to sense chemicals and to swim towards nutrients^6,7. The intracellular level of the phosphorylated protein (CheY_p) represents the extracellular concentration of a chemical and regulates the motors (clockwise or counterclockwise rotation) and hence motility (‘run’ or ‘tumble’) (Fig. 1). The swimming behavior clearly affects the input as cells change their location, making information flow a circular problem. Hence, the question emerges how to tackle such problems.

Shannon’s mutual information is generally used to quantify information transmission, capturing the statistical (linear and nonlinear) dependencies between inputs and outputs⁸. Specifically, the mutual information describes the ability on average to reconstruct the input distribution after repeatedly measuring the output^8,9. Often maximal mutual information is assumed, either to reflect biological function or because the mutual information cannot be calculated otherwise^10,11. How should mutual information be maximized? Maximizing with respect to the (generally unknown) distribution of inputs leads to the channel capacity. Such an approach was, e.g., used to study transcriptional regulation in the developing fruit-fly embryo^12,13. While in this case, the mother organism may be able to tune its maternal factors to the optimal input distribution to match the ‘expectation’ of the embryo, the channel capacity may not generally be a valid approach. Alternatively, it is possible to assume a fixed input distribution and to maximize the mutual information with respect to the input-output curve^14,15. The underlying idea is that the input-output curve is adjusted by evolution to match the distribution of inputs. This approach might apply to organisms confined to certain environments, e.g. bacteria living in specific niches¹⁵. In addition to these two optimization procedures (and some attempts to combine them for specific types of input-output curves^16,17,18), the Fisher information from estimation theory may also be used to predict the input distribution^19,20. Hence, due to multiple available approaches there is considerable uncertainty to what optimization procedure to use.

Even the principle of maximal information transmission may be questioned. Recently, maximizing information transmission at the E. coli receptors led to maximal drift of cells swimming up a chemical gradient and hence optimal chemotactic behavior¹⁵. Although a reasonable result, there are two potential problems when considering the whole pathway: Firstly, the dose-response curves of the motors are measured to have Hill coefficients up to 20²¹. Hence, such switch-like motors may only transmit about one bit of information, distinguishing only low and high levels of CheY_p. Secondly, the CheY_p level, which maximizes the drift is not in the sensitive region of the motor^22,23,24. Both issues may lead one to suggest that high information transmission at the receptors is wasted downstream at the motors, and that an entirely different principle may guide cell behavior^25,26.

Here, we address two key questions: (1) How should the mutual information be calculated in a biological context, and (2) does the bacterial chemotaxis pathway maximize information transmission? Specifically, we reconcile the various optimization procedures, leading to a new way of maximizing the mutual information, particularly useful for biological systems. Assuming general external and internal noise, and a fixed range of sensitivity (as any biological or physical system is necessarily limited), we analytically derive both the optimal input distribution and input-output curve. Unlike previous approaches^16,17,18, this general solution does not assume specific input-output curves, such as Hill functions. Surprisingly, we find an universal optimal input distribution, only dependent on the input noise. Furthermore, numerically and with the help of simulations, we were able to extend our formalism to multiple outputs (or inputs), greatly extending the applicability of our formalism to biological systems. As an illustrative example, we focus on the E. coli chemotaxis pathway using previously estimated noise and measured dose-response curves at the receptors and the motors. By deriving analytical results for multiple output motors, we show that although the optimal response is not a Hill function, the measured Hill coefficients naturally emerge from our optimal prediction. Overall our results confirm the idea of maximal information transmission in E. coli chemotaxis, and prove that maximal information transmission at the receptors is critical for the whole pathway despite the ultrasteep motor dose-response curves.

Results

Maximizing mutual information: comparison of different approaches

Information transmission between an input, X, and an output, Y, is often quantified by the mutual information, which is a measure of statistical dependency and reflects the average ability for inferring the input after measurements of the output (^{8,9,10,11,27,28,29} for extended reviews). For continuous random variables, X and Y, the mutual information is defined by

$$ {\mathcal I} [X,Y]\,:\,=\,\int \,{\rm{d}}y\,{\rm{d}}x\,p(y|x)p(x){\log }_{2}[\frac{p(y|x)}{p(y)}],$$

(1)

where p(y|x) is the conditional probability of observing Y = y at given X = x, encoding the input-output curve and noise. Quantities p(x) and p(y) represent the input and output distributions, respectively, which are mathematically connected by the conservation of probability p(x)dx = p(y)dy, valid in the small-noise limit. Considering small Gaussian noise with conditional probability, $p(y|x)=\exp [-{(y-\bar{y})}^{2}\mathrm{/(2}{\sigma }_{{\rm{T}}}^{2})]/\sqrt{2\pi {\sigma }_{{\rm{T}}}^{2}}$, with mean $\bar{y}$(x) and standard deviation σ_T(x), Eq. (1) becomes

$$ {\mathcal I} [X,Y]=-\,{\int }_{{x}_{{\rm{on}}}}^{{x}_{{\rm{off}}}}\,{\rm{d}}x\,p(x)\,{\log }_{2}[\frac{\sqrt{2\pi e}\,{\sigma }_{{\rm{T}}}(x)}{\bar{y}\text{'}(x)}p(x)],$$

(2)

where x_on and x_off set the sensitive region, i.e. the dynamic range of inputs^14,15 (similar equations appear in^12,13,20). In addition to p(x), Eq. (2) depends on the gain $\bar{y}^{\prime} $(x), i.e. the first derivative of the input-output curve $\bar{y}$(x), and total noise σ_T(x).

To understand if biological systems maximize information flow, we need to maximise the mutual information and derive general principles or compare with data. To maximise the mutual information, the channel capacity is often considered, i.e. the mutual information maximized with respect to the input distribution^12,13. Alternatively, the mutual information can be maximized with respect to the input-output curve assuming a fixed input distribution^14,15. The former method is an attempt to deal with the often unknown input distribution, while the latter is based on the idea that the biological channel can be modified by evolution. Additionally, the two approaches were combined for specific input-output Hill and Hill-like functions^16,17,18. However, is there a general way to unify the different methods, without making assumptions about functional form of the input-output functions?

Formally, we maximize the mutual information, Eq. (2), with respect to p(x) and $\bar{y}$x) by writing

$$\mathop{{\rm{\max }}}\limits_{p(x)} {\mathcal I} [X,Y]\,\to \,\frac{\partial {\mathcal L} }{\partial p}=\mathrm{0,}$$

(3)

$$\mathop{max}\limits_{\bar{y}(x)} {\mathcal I} [X,Y]\,\to \,\frac{\partial {\mathcal L} }{\partial \bar{y}}-\frac{{\rm{d}}}{{\rm{d}}x}\frac{\partial {\mathcal L} }{\partial \bar{y}\text{'}}=\mathrm{0,}$$

(4)

where the right-hand side of Eqs. (3) and (4) are Euler-Lagrange equations from calculus of variations with Lagrangian $ {\mathcal L} (x,p,\bar{y},\bar{y}^{\prime} )=p\,{\log }_{2}[\frac{\sqrt{2\pi e}{\sigma }_{{\rm{T}}}}{\bar{y}^{\prime} }p]$, i.e. the integrand of Eq. (2). Equation (3) represents the channel capacity applied to a Gaussian channel (i.e. Gaussian conditional probability p(y|x), see^12,13 for examples in gene regulation). In contrast, Eq. (4) is used to obtain the optimal input-output curve for a given input distribution (see^14,15 for examples in sensory systems). For completeness, we provide the solutions of the individual maximizations of Eqs. (3) and (4) in SI Text Sec. 1.1–1.2, with a discussion of the sensitive region in Sec. 1.8.

When the noise is uniform (σ_T constant) Eqs (3) and (4) coincide. Specifically, $\frac{\partial {\mathcal L} }{\partial \bar{y}}=0$ in Eq. (4) so that $\bar{y}$ is a cyclic variable. As a result, $\frac{{\rm{d}}}{{\rm{d}}x}\frac{\partial p}{\partial \bar{y}^{\prime} }=0$ so that p/$\bar{y}^{\prime} $ = const and hence is conserved (not in time but in input space), following the Emmy Noether theorem. In this case, maximizing the mutual information leads to a simple matching relationship ($\bar{y}^{\prime} $ ∝ p), so that the input-output curve is the cumulative integral of the input distribution (see S1 text, Sec. 1.1 and Fig. S1)³⁰. However, in general when both input and output noise matter the noise is a function of the input and input-output curve, given by σ_T = σ_T(x, $\bar{y}$, $\bar{y}^{\prime} $). Assuming independent cell-external and internal noise, we consider

$${\sigma }_{{\rm{T}}}(x,\bar{y},\bar{y}^{\prime} )=\sqrt{{\sigma }_{x}^{2}\,\bar{y}{\text{'}}^{2}+{\sigma }_{y}^{2}},$$

(5)

which follows from error propagation. Specifically, σ_x is the input noise depending on x only, amplified by the gain $\bar{y}^{\prime} $(x), and σ_y is the output noise depending on $\bar{y}$(x) only. In case of negligible input (σ_x ≈ 0) or output (σ_y ≈ 0) noise, Eq. (3) again converges to Eq. (4) and the system can be solved for any input-output curve $\bar{y}$(x) (see S1 text, Sec. 1.2–1.3). As a result, the predicted input and output distributions from the two optimization approaches become identical (Fig. 2A). However, in general the two equations differ and the resulting optimal input and output distributions are very different in the two approaches (Fig. 2B). In particular, the output distributions can be uni- or bimodal with details described in SI Text, Sec. 1.3.

It is worth noting that, in Bayesian statistics the Fisher information is linked to the channel capacity^19,20. A key problem in Bayesian statistics is choosing a prior distribution for a given stochastic process (i.e. p(y|x))³¹. The idea of having a prior which does not affect the posterior distribution (i.e. p(x|y)) is linked to maximal mutual information, given by the average Kullback-Leibler divergence between prior and posterior distributions. This prior distribution is called the reference prior¹⁹, given by $p(x)\propto \sqrt{ {\mathcal F} (x)}$ with Fisher information $ {\mathcal F} (x)=\int \,dy\,p(y|x){(\partial \log (p(y|x))/\partial x)}^{2}$. As shown in ref. ²⁰. the Fisher information is the result of maximizing the equivalent of Eq. (2) for a general (not necessarily Gaussian) conditional probability distribution (see S1 Text, Sec. 2.). Hence, the channel capacity and the approach based on the Fisher information are equivalent.

Maximizing mutual information: a new approach

The difference between Eqs. (3) and (4) is that Eq. (3) assumes a fixed input-output curve and a variable input distribution, while Eq. (4) assumes a fixed input distribution and a variable input-output curve. There might be situations in which one approach is more appropriate than the other but in a general biological context the two are intrinsically connected (Fig. 1). From a mathematical point of view, Eqs (3) and (4) can be combined and solved together, i.e. $ {\mathcal I} [x,y]$ maximized with respect to both p(x) and $\bar{y}$(x). Similar numerical double optimizations are common in rate distortion theory using, e.g., the Blahut algorithm^4,32.

In what follows, we provide the analytical solution for p(x) and $\bar{y}$(x) by solving Eqs. (3) and (4) together. We assume a fixed dynamical range of inputs set by x_on and x_off (given by the receptor sensitivity), leading in return to a fixed dynamical range of outputs from y(x_on) = 1 to y(x_off) = 0. We consider Eq. (2) with noise given by Eq. (5). After a first integration, we obtain the formal solution

$$p=\frac{\bar{y}^{\prime} }{Z{\sigma }_{{\rm{T}}}},$$

(6a)

$${\sigma }_{x}^{2}{\bar{y}}^{^{\prime} 2}=Q{\sigma }_{y}^{2},$$

(6b)

where Z and Q are two constants set by normalization and boundary conditions, respectively (see Materials and Methods and S1 Text, Sec. 1.7).

Equation (6a) for the input distribution extends the matching relationship found in³⁰ to nonuniform noise. In the latter case the optimal input distribution weighs certain inputs more than uncertain inputs^13,20,33. Equation (6b) determines the input-output curve which maximizes the mutual information given the noise. A solution of the system of equations exists if the transmitted input noise can be expressed in terms of the output noise or vice versa (see S1 Text, Sec. 1.7). While such a solution may seem very specific, it is certainly plausible, given enough time, that evolution eventually finds it.

How may evolution find the solution? To mimic evolution, we envision an adaptive algorithm, allowing the pathway to iteratively reach optimal information transmission. Given an environment and hence a distribution of inputs, p₁(x) (Fig. 3A), evolution selects the optimal internal input-output curve, y₁(x) (Fig. 3B). However, the distribution of inputs is susceptible to changes, which might be caused by a change of the organism’s behavior, even in the same environment. The new input distribution, p₂(x), may again lead to an increase in information transmission at fixed input-output curve, y₁(x) (Fig. 3C). Subsequently, evolution will select a new input-output curve, y₂(x), which enhances information transmission at a fixed input distribution, p₂(x) (Fig. 3D). This cycle is repeated many times. If the optimal configuration is achievable and information transmission is a proxy for fitness, we expect that the solution of Eqs. (6a) and (6b) naturally emerges in the pathway. This is indeed the case for the examples studied here (see Fig. 3E,F).

Information transmission at E. coli chemoreceptors

To apply our new approach, we use the chemotaxis pathway of E. coli as an explicit example, since it is relatively simple and well characterized in its molecular components⁶. Briefly, chemoattractant (ligand) binding turns receptors off, inhibits the kinase CheA, and hence reduces the phospho-transfer from CheA_p to CheY. This leads to ‘runs’ as only CheY_p can bind the 6–8 motors to introduce ‘tumbling’. There is also an adaptation mechanism, where addition of methyl groups to receptors compensates for increased attractant concentration by increasing the receptor activity and hence the CheY_p level to induce cell tumbling. Removal of methyl group has the opposite effect^34,35. In order to study signaling in fixed adaptational states, the adaptation enzymes can be removed from the chromosome and the receptor expressed with specific, genetically engineered, receptor modification levels to mimic receptor methylation (see S1 Text, Sec. 4.3)^34,35.

Specifically, we consider the instantaneous information transmission between the chemoattractant methylaspartate (MeAsp) as the input and the response regulator CheY_p as the output. Hence, we consider the information transmitted by the initial (fast) response for a given adaptational state (which only changes slowly). (At a later time this response is removed by adaptation and hence is transient only). Note, unlike ref.³⁶ we do not assume small Gaussian inputs but natural stimuli drawn from broad, potentially asymmetric input distributions p(x). When the input distribution of cells simulated in gradients of different strength match the optimal information-theoretical input distribution for the same receptor modification levels, the drift velocity up the gradient is maximized and, hence, this leads to optimal chemotaxis¹⁵. As this matching of input distributions occurs anywhere in the gradient, these initial responses describe chemotaxis in the whole gradient, and so implicitly include adaptation. Indeed, the predicted distributions of inputs are scale invariant (when normalized by the adapted concentration) and reproduce Weber’s law and logarithmic sensing¹⁵. The latter can also be captured by the predictive mutual information³⁷.

The functional form of the noise is assumed to be known and derived from microscopic theory as in³⁸ (see S1 Text, Secs. 3 and 4.2 for noise estimation and sensitivity to noise parameters, respectively). In short, the input variance is considered to be proportional to the input strength, σ_x² = α₁x, with x in units of the ligand concentration the cell is adapted to. Furthermore, α₁ ∝ (DNτ)⁻¹ is given by the Berg-and-Purcell limit³⁹, where D is the diffusion constant of the ligand molecules, N is the number of receptors acting cooperatively in a cluster, and τ is the averaging time, assuming a spherical cell³⁹. The output noise has three contributions: signaling noise, switching noise due to on/off changes of the receptor state, and a constant background noise, leading to σ_y² = α₂$\bar{y}$(1 − $\bar{y}$) + α₃$\bar{y}$ + α₄ with phosphorylated $\bar{y}$ in units of the total CheY level, Y_T, an intrinsic dependence on the (unknown) input-output curve, and α₂₋₄ additional parameters defined in S1 Text, Sec. 3. Note these effective noise terms are time-averaged due to their dependence on chemical reactions based on finite rate constants³⁸. Despite σ_y² being specific, this noise should apply to many receptor-signaling pathways, including other two-component pathways⁴⁰.

Using this noise, the explicit solution of Eq. (6b) is

$$\bar{y}(x)=\frac{{\alpha }_{2}+{\alpha }_{3}+\sqrt{{({\alpha }_{2}+{\alpha }_{3})}^{2}+4{\alpha }_{2}{\alpha }_{4}}\,\sin [\sqrt{{\alpha }_{2}}(C+2\sqrt{\frac{Qx}{{\alpha }_{1}}})]}{2{\alpha }_{2}},$$

(7)

where C and Q are constants set by imposing fixed boundary conditions, y(x_off) = 1 and y(x_on) = 0 (Fig. 4A). Note that x appears with the prefactor Q/α₁. Thus, α₁ is set by the boundary conditions and it can be seen as the units of x. We consider two special cases: by simplifying the output noise for α₂ = 0, we obtain

$$\bar{y}(x)=\frac{{\alpha }_{3}^{2}{C^{\prime} }^{2}-4{\alpha }_{4}}{4{\alpha }_{3}}+{\alpha }_{3}C^{\prime} \sqrt{\frac{Qx}{{\alpha }_{1}}}+\frac{{\alpha }_{3}Qx}{{\alpha }_{1}},$$

(8)

where the solution is again independent of α₁ after imposing the boundary conditions. For α_2,3 = 0, we obtain

$$\begin{array}{rcl}\bar{y}(x) & = & 2Q\sqrt{\frac{{\alpha }_{4}}{{\alpha }_{1}}}\sqrt{x}+C^{\prime\prime} \\ & = & -\frac{\sqrt{x}}{\sqrt{{x}_{{\rm{on}}}}-\sqrt{{x}_{{\rm{off}}}}}+\frac{\sqrt{{x}_{{\rm{on}}}}}{\sqrt{{x}_{{\rm{on}}}}-\sqrt{{x}_{{\rm{off}}}}},\end{array}$$

(9)

which does not depend on both α₁ and α₄ once Q and C′′ are set by the boundary conditions. The optimal input distribution obtained by inserting Eq. (6b) into Eq. (6a) is $p={(Z\sqrt{\mathrm{(1}+Q)/Q}{\sigma }_{x})}^{-1}$. In particular, for our choice of external noise and fixed sensitivity range, the input distribution converges to $p(x)={[\mathrm{2(}\sqrt{{x}_{{\rm{on}}}}-\sqrt{{x}_{{\rm{off}}}})\sqrt{x}]}^{-1} \sim 1/\sqrt{x}$ independently of α₁₋₄ (see S1 Text, Sec. 1.4.6 and Fig. S3). Importantly, this is a general result for the Berg-and-Purcell input noise, and hence should be valid for many signaling pathways. This result was previously found numerically¹⁶, and such an input distribution of glutamine was suggested to optimize nitrogen sensing³³.

Extracting the sensitive regime of E. coli receptors for a fixed modification level (resembles receptor methylation level, see S1 Text, Sec. 4.3)^35,41, we test the convergence of the adaptive algorithm to the solution in Eq. (7). After a few iterative cycles the system indeed converges (Fig. 4A), increasing the mutual information at each step (Fig. 4C, blue line). This convergence to the analytical solution occurs when starting at different initial conditions, showing robustness of our algorithm. Note that in Fig. 4C the solution $\bar{y}$(x) is fitted to Hill functions for convenience of presentation, allowing the mutual information to be plotted as a function of a single parameter (i.e. the Hill coefficient n). The optimal curve selects a Hill coefficient compatible to the experimental measurements from FRET data, at least for larger receptor modification levels (Fig. S9)³⁵. Applying the adaptive algorithm instead to Hill-function constrained input-output curves produces the same optimal Hill coefficient n albeit with a smaller mutual information (Fig. 4C, red solid line, with Fig. 4B comparing the corresponding optimal input distributions and optimal input-output curves). Note that for a fixed Hill equation the optimal mutual information is calculated directly using the input distribution from Eq. (6a), resulting in

$$ {\mathcal I} [x,y]={\log }_{2}[\frac{Z}{\sqrt{2\pi {\rm{e}}}}]\,{\rm{with}}\,Z={\int }_{{x}_{{\rm{on}}}}^{{x}_{{\rm{off}}}}\,dx\,\frac{\bar{y}\text{'}(x)}{{\sigma }_{{\rm{T}}}(x)},$$

(10)

as shown in Fig. 4C (red dashed line).

However, unlike the sine function in Eq. (7), experimental dose-response curves of CheY_p are thought to be well approximated by Hill functions rather than Eq. (7)³⁴. There are several possible reasons for this discrepancy. For instance, in our model receptors are either fully sensitive or fully insensitive, and the solution given by Eq. (7) is only valid in the sensitive region and constant otherwise. In reality, receptors have a smooth sensitivity curve spanning the ligand-dissociation constant of the off and on states (such as dF/dlog(x) in⁴¹, where F is the receptor free-energy difference between on and off states). This may lead to a smooth Hill-function-like response. One way of imposing smooth input-output curves is to introduce the additional constraint of zero first derivatives at the boundary. In this case, however, we only obtain sigmoidal input-output curves without internal switching noise (α₃ = 0) (see S1 Text, Sec. 1.4.4 and Fig. S3E). Moreover, very asymmetric (or even bimodal) input distributions might be uncommon in natural environments¹⁵, potentially favoring log-normal input distributions and hence Hill-function-like responses⁴². Finally, E. coli needs to account for many other constraints and the pathway performs other tasks at the same time, such as sensing temperature and pH^43,44,45. Hence, the E. coli sensory system might be in a suboptimal configuration for transmitting information about chemicals in order to account for all the other tasks. In the S1 Text, Sec. 1.4.5 we also solve the inverse problem and derive the optimal noise, which leads to an exact Hill function (see Fig. S4). In this case, the predicted input and output noises are not independent anymore. In summary, the mismatch between the experimental Hill functions and the solution in Eq. (7) is not an artifact of our assumptions on the sources of noise; it emerges when considering independent input and output noise, and when maximizing the mutual information with respect to both the input distribution and the input-output curve.

Information transmission along the E. coli chemotaxis pathway

Now that we understand the optimization of the mutual information better, we can tackle the second problem: Does the chemotaxis pathway maximise information transmission? Previous work suggests that the higher the information transmission at the receptors, the higher the drift velocity in the direction of the gradient¹⁵. Is this finding compatible with the recent observation of the ultrasensitive response of the motor to changes in internal CheY_p²¹, or does such a steep response prevent the cell from high information transmission? To answer this question we extend our analysis to the whole chemotaxis pathway.

We consider a minimal model of two channels: a receptor channel for sensing by the chemoreceptors and a motor channel for the flagellar motors. For the receptor channel, the external chemical concentration x is the input and the internal CheY_p concentration, y, is the output. For the motor channel, y is the input and the motor clockwise (CW) bias z (for tumble) is the output (Fig. 5A). To simplify the problem and to closely resemble the experimental dose-response curves, we now restrict the curves to Hill functions, with Hill coefficients n and m for receptors and motors, respectively. The noise expressions for the receptor and motor channels are given by ${\sigma }_{{\rm{yT}}}=\sqrt{{\alpha }_{1}x{G}_{y}^{2}+{\alpha }_{2}\bar{y}\mathrm{(1}-\bar{y})+{\alpha }_{3}\bar{y}+{\alpha }_{4}}$ and ${\sigma }_{{\rm{zT}}}=\sqrt{{\sigma }_{{\rm{yT}}}^{2}{G}_{z}^{2}+{\beta }_{2}\bar{z}\mathrm{(1}-\bar{z})+{\beta }_{3}\bar{z}+{\beta }_{4}}$, respectively, where G_y and G_z are the gains of the receptor and motor channels. Parameters β₂₋₄ represent the noise of the motors and are kept generic due to lack of characterization, but may reflect analogous biological processes including adaptation of the motors^21,46,47 (see S1 Text, Sec. 3 for further details and robustness of results to changes in β values). Due to the immense gain at the motors²¹, we generally have higher noise at the motor then at the receptor (see S1 Text Sec. 4.5 for the discussion of the two limits). Here only n and m are considered adjustable parameters in our model.

The data processing inequality, which characterizes the flow of information in a Markov chain, states that, at any additional processing step, information can only be lost, never gained⁴⁸. For instantaneous information transmission this means that the mutual information between the external concentration and the motor bias cannot be higher than the minimum of the mutual informations of the receptor and the motor channels, i.e. $ {\mathcal I} [x,z]\le \,{\rm{\min }}\{ {\mathcal I} [x,y],\, {\mathcal I} [y,z]\}$. A strategy to possibly increase the mutual information is then to maximise the limiting mutual information.

We start by considering a single motor, represented by a single output z. We calculate the maximal mutual information at the receptor (Fig. 5B, left) and motor (Fig. 5B, right) channels, dealing with the optimization of the two channels separately. The mutual information at the motor, $ {\mathcal I} [y,z]$, always limits the whole information transmission for any Hill coefficient n of the receptors and m of the motor. Hence, single-motor cells should optimize the motor rather than the receptor channel. This result is not unexpected for the chemotaxis pathways since the ultrasensitive motor enhances the downstream noise, which is generally larger than the upstream noise (here the total CheY_p noise is the input noise for the motor channel). The resulting optimal information transmission corresponds to relatively low n and m (≈6; red area in right panel of Fig. 5B). Experimentally, the Hill coefficient of the receptor channel agrees with our prediction, ranging from 6–12 in Tar-only cells³⁵. In contrast, the ultrasteep motor response curve with m ≈ 20 is in stark contradiction to our single-motor dose-response model. Hence, the single-motor model indicates higher information transmission at the receptors (see Figs. 5, S6, S7, and S1 Text, Secs. 3 and 4.2 for dependence on noise). However, E. coli has multiple motors, which might effect information transmission.

We now extend our model to multiple (K) motors, allowing the cell to make multiple measurements of the internal CheY_p concentration. There are now a single input, y, and multiple outputs z₁, ..., z_K, of the CW biases. The chain rule for the mutual information allows us to calculate $ {\mathcal I} [y;{z}_{1},\,\mathrm{..}{z}_{K}]$. In particular, for the extreme case of fully coupled motors the mutual information does not increase with the growing motor number, $ {\mathcal I} [y;{z}_{1},\,\mathrm{..}{z}_{K}]= {\mathcal I} [y;{z}_{1}]$. In contrast, for completely uncoupled motors (i.e. for motors which are simultaneously independent and conditionally independent given y) the mutual information increases with the number of motors, $ {\mathcal I} [y;{z}_{1},\,\mathrm{..}{z}_{K}]=K {\mathcal I} [y;{z}_{1}]$ (see S1 Text, Sec. 4.5). Real motors show evidence of partial coupling⁴⁹, and thus we assume conditional indepen-dent motors, i.e. p(z₁, ..z_K|y) = Π_i^Kp(z_i|y). This means that all motors depend on the common y level but can independently select their CW bias. For two motors, the mutual information becomes $ {\mathcal I} [y;{z}_{1},{z}_{2}]= {\mathcal I} [{z}_{1};y]+ {\mathcal I} [{z}_{2};y|{z}_{1}]= {\mathcal I} [{z}_{1};y]-H({z}_{2}|y)+H({z}_{2}|{z}_{1})$. For K motors this is generalized to

$$ {\mathcal I} [y;{z}_{1},\ldots ,{z}_{K}]= {\mathcal I} [{z}_{1};y]-KH({z}_{1}|y)+\mathop{\sum }\limits_{i=2}^{K}\,H({z}_{i}|{z}_{i-1},\ldots ,{z}_{1}\mathrm{).}$$

(11)

Using the small noise Gaussian approximation for p(z_i|y), the conditional entropy is given by $H({z}_{1}|y)=\int {\rm{d}}y\,p(y)$ · $\log (\sqrt{2\pi {\rm{e}}}{\sigma }_{T}(y))$, and H(z₂|z₁) ≈ 0 (Fig. 5C, see also S1 Text, Sec. 4.5). Thus, Eq. (11) becomes

$$ {\mathcal I} [y;{z}_{1},\ldots ,{z}_{K}]= {\mathcal I} [{z}_{1};y]-K\int {\rm{d}}y\,p(y)\log (\sqrt{2\pi {\rm{e}}}{\sigma }_{T}(y))\mathrm{.}$$

(12)

We numerically tested that the conditional independence of the motors holds for two motors, despite the fact that motors compete for the binding of internal CheY_p molecules, which can introduce negative correlations (see S1 Text, Sec. 4.6 and Fig. S10). Therefore, $ {\mathcal I} [y;{z}_{1},{z}_{2}] > {\mathcal I} [y;{z}_{1}]$, and more generally $ {\mathcal I} [y;{z}_{1},\,\mathrm{..}{z}_{K}] > {\mathcal I} [y;{z}_{1},\,\mathrm{..}{z}_{K-1}]$ for K conditionally independent motors⁵⁰.

Hence, for conditionally independent motors, the mutual information at the motors will eventually overtake the mutual information at the receptors when the number of motors increases. Consequently, the mutual information at the receptors becomes the limiting factor for information transmission (Fig. 5C, see⁵⁰ for the case of Gaussian input distributions). In other words, for a small number of motors the cell has high information transmission at the receptors, which will be wasted at the motors (cf. red and blue solid lines). In contrast, for a large number of motors the information transmission at the motors exceeds the information transmission at the receptors without overall improvement. However, in the intermediate case both receptors and motors equally limit the transmission of information (cf. red dashed and blue solid lines in Fig. 5C). E. coli chemotaxis seems to avoid bottlenecks and to optimally allocate resources the latter case is the most advantageous⁵¹. Hence, the ultrasteep Hill function of the motor (m ≈ 20) can be explained by this matching of the information transmission at the receptors and motors (orange arrow in Fig. 5C). Note that in addition to the high Hill coefficient m of the motors there is also a corresponding low m solution (green arrow in Fig. 5C). However, the latter is not robust to changes in m, i.e. a small change in m can lead to a drastic reduction of information transmission, which can emerge from varying the number of FliM molecules of the motor⁵². In addition, note that the mutual information shown in Fig. 5C with a red dashed line is calculated assuming that the two motors are optimized separately. However, the mutual information is further increased by maximizing the two motors simultaneously (dashed orange line in Fig. 5C). Our overall result that a high mutual information can be achieved with a high Hill coefficient of the motors remains valid (see S1 Text, Sec. 4). In between m ≈ 1 and ≈20, the information transmission of the motor is wasted as receptors are information-flow limiting. In conclusion, multiple ultrasensitive motors are only useful when motors are sufficiently independent. Any residual coupling among motors may be the result of close motor proximity or mechanical coupling of the flagella.

Discussion

This study presents a new approach to maximise the mutual information, particularly suitable for evolving biological systems subject to random mutations and selection. Previously, the channel capacity, i.e. the mutual information maximized with respect to the input distribution was widely used for electronic and biological communication channels^12,13,20,33. However, this method fails to capture possible changes of the internal input-output curve (e.g. by mutations). Furthermore, the mutual information maximized with respect to the input-output curve neglects the biological relevant feedback of the output on the input^14,15. Here, we reconciled these two approaches by maximizing the mutual information with respect to both the input distribution and the input-output curve for Gaussian channels with small noise. Only when the total noise is uniform, or when the input or the output noise is negligible, the two approaches are identical. Unlike previous joint optimizations^16,17,18, our input-output curves are not restricted to Hill or Hill-like functions. Our adaptive algorithm demonstrates how evolution might implement this iteratively.

Our analytical solution of the joint optimization provides a number of new insights into optimal information transmission. First, the optimal input distribution is universal, depending only on the input noise. For Berg-and-Purcell type input noise, we specifically obtain $p(x) \sim {\sqrt{x}}^{-1}$. Hence, organisms are optimized for environments in which low intensity stimuli occur with high frequency. This is sensible as their high frequency would compensate for their large relative noise levels. Second, the optimal input-output curve is invariant to up or down scaling of the input noise (parameter α₁), which sets the units of the input. Hence, only the shape of the input noise (i.e. its functional dependence on input x) affects the input-output curve. Third, our optimal input-output curve is rather linear (Figs. 3D and 4A). While this does not match the sigmoidal Hill functions as suggested by models of E. coli chemotaxis^34,53, a near linear input-output curve makes best use of a given dynamic range. Furthermore, enforcing either zero slope of the input-output curve at the boundaries of the sensitive region or Hill functions as input-output curves leads to assumptions on the noise which are hard to justify biologically. Hence, Hill functions are incompatible with independent input and output noise (see Supplementary Information Sec. 1.7 for details).

How can cells actively influence and optimize their distribution of sensory input? Genetic changes in the downstream pathway and motor can clearly change chemotactic behavior and hence the experienced input stimuli. For instance, increases in the motor speed lead to larger changes in stimulus and hence broader distributions of inputs. Similarly, faster adaptation leads to narrower distributions. However, the notion that cells influence their microenvironments is most supported by the important role of niches in stem cell differentiation, cancer development, gut microbiota, and host-pathogen interactions^{54,55,56,57,58}. Once inside the gut, E. coli related pathogen C. rodentium in mice (and similarly EPEC/EHEC in humans) injects effector proteins into the epithelial host cells. In response, these cells secrete increased levels of oxygen, allowing in return the pathogen to perform aerobic metabolism⁵⁹. Hence, its aerotaxis ability, inherited from E. coli based on Aer and Tsr receptors, experiences an increased frequency of oxygen stimuli, which the pathogen actively stimulated. If we take the assumption of maximal information transmission seriously, then cells do not only actively influence but also optimize their environment.

To apply our information-theoretical approach, we analytically showed that the entire E. coli chemotaxis pathway can maximize the instantaneous mutual information between chemical concentration and motor bias despite the ultrasteep dose-response curve of the motors. Briefly, ultrasensitive motors do not restrict information transmission, since a collection of motors boosts information transmission, in addition to providing other chemotactic advantages in the soil or animal intestine⁶⁰. In particular, our model identifies the number of motors and their conditional independence as key quantities to transmit large amounts of information in peritrichous bacteria.

What is the additional information at the motors used for if the ultimate behavioral output is just binary runs and tumbles? We speculate that the tumble angle, torque, and filament handedness could be regulated^61,62. Indeed, real-time imaging of E.coli with fluorescent flagella showed that the tumble angles increased with the number of clockwise-turning motors, allowing for differential cell responses⁶¹. Having non-identical motors with different Hill coefficients and thresholds may further increase the information transmission (e.g. as produced by different number of FliM in the motor ring)¹⁶ but this may not be feasible in the bacterial chemotaxis pathway, as the adapted activity set the operating point of the motors. For instance, different threshold values for the motor would lead to some motor always rotating clockwise and counter-clockwise. Our model also makes the prediction that chemotactic bacteria with a single motor should prefer a relatively low Hill coefficient at the motor or multiple response regulators feeding into a motor with a high Hill coefficient to highly transmit information. This prediction could be tested with the uni-flagellated bacterial species, such as Rhodobacter sphaeroides, Pseudomonas aerugiuosa or monotrichous marine bacteria^60,63. In support of our theory, R. sphaeroides is known to have multiple CheY’s^64,65.

While applicable to many biological systems, our model makes a number of simplifications (in addition to assumptions on noise and receptor sensitivity). Our results are based on the independent maximizations of the receptor and motor channels. However in SI text, Sec. 4.5, we discuss the general case, providing estimes of the mutual information $ {\mathcal I} [x;z]$, between ligand input and final motor output. Our analysis suggests, once again, that high Hill coefficient for multiple motors can support high information transmission. In particular, we analytically identify two expected limits, (i) when the receptor noise is much smaller than the motor noise, we obtain $ {\mathcal I} [x;z]\approx {\mathcal I} [y;z]$, and (ii) when the motor noise is smaller than the receptor noise, we have $ {\mathcal I} [x;z]\approx {\mathcal I} [x;y]$. Our analysis over the chemotactic pathway primarily focuses on the Hill coefficient. However, the dissociation constant k_d^m of the motor response is known to be larger than the adapted CheY_p concentration. In SI text, Sec. 4.5.4, we explicitly study the role of the dissociation constant k_d^m, and found a relative weak dependence of the mutual information on it. We also found that after fixing the Hill coefficient m and using the optimized output distribution of the receptor channel, the k_d^m that maximizes the mutual information at the motor matches the experimentally measured value (which is larger than the adapted CheY_p level, see Fig. S8).

Another simplification is that our model deals with instantaneous information transmission, and hence does not explicitly include any history dependence^36,37,66,67. Hence, our approach should be highly suitable for the slow genetic response in quorum sensing^68,69. In this system, the input-output relation has been measured but input distributions were simply guessed, and not predicted. Another area of application is eukaryotic chemotaxis, where cells move slowly while actively shaping their chemical gradient by ligand secretion⁷⁰ and degradation⁷¹. In all these examples, the input distributions and cell behaviors need to match the input-output relations to allow for optimal information gathering. Nevertheless, our model is valid for information transmission by initial transient chemotactic responses, and as this applies anywhere in the gradient, our model describes chemotaxis even including adaptation¹⁵. We expect that our model even works in relatively steep gradients, where, in addition to adaptation, long-history effects are important, such as caused by receptor saturation and rotational diffusion²⁴. The main assumption in¹⁵ is that gradients can be linearized over the range of input distributions. However, we do not assume small Gaussian-distributed inputs. A drawback of our model is that we neglect any cell-to-cell variability, which can be substantial^72,73, so that in effect our theory focuses on a certain subpopulation of cells. This cell-to-cell variability may lead to advantages in terms of bet-hedging strategies not directly related to information processing⁷⁴.

In conclusion, we provided a biologically inspired adaptive algorithm with analytical solution for complex problems in information transmission in sensory systems. Future studies may need to account for time-dependencies explicitly, including adaptation both at the receptors and motors. This may be achieved by considering trajectories of molecular concentrations and/or cell behavior, which may also help establish a link between chemotactic performance (e.g. drift velocity), information transmission, and energetic cost of chemotaxis. This link may show interesting tradeoffs and new design principles²⁹. A methodological contribution might be necessary as calculation of the mutual information based on trajectories are hampered by the high-dimensional phase space of all possible trajectories.

Materials and Methods

Maximization of mutual information with respect to input distribution and input-output curve

To maximize Eq. (2) assuming the noise in Eq. (5), we focus on the integrand $ {\mathcal L} (x,p,\bar{y},\bar{y}^{\prime} )=p\,{\log }_{2}[\frac{\sqrt{2\pi e}{\sigma }_{{\rm{T}}}}{\bar{y}^{\prime} }p]$ and use the well-known Lagrange formalism from calculus of variations, where x is the independent variable while p = p(x), $\bar{y}$ = $\bar{y}$(x) and $\bar{y}^{\prime} $ = $\bar{y}^{\prime} $(x) are the dependent variables. Note that p′(x) is not appearing in the Lagrangian $ {\mathcal L} $. To find the maximum of I with respect to p and $\bar{y}$, we need to solve Eqs. (3) and (4) together.

Equation (6a) is the solution of Eq. (3), which can be rewritten through differentiation as

$$\frac{p^{\prime} }{p}=\frac{\bar{y}^{\prime\prime} }{\bar{y}^{\prime} }-\frac{{\sigma ^{\prime} }_{{\rm{T}}}}{{\sigma }_{{\rm{T}}}}\mathrm{.}$$

(13)

To derive Eq. (6b), we evaluate the following derivatives

$$\frac{\partial {\mathcal I} }{\partial \bar{y}}=\frac{p}{{\sigma }_{{\rm{T}}}}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}},$$

(14)

$$\frac{\partial {\mathcal I} }{\partial \bar{y}^{\prime} }=\frac{p}{{\sigma }_{{\rm{T}}}}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}\text{'}}-\frac{p}{\bar{y}^{\prime} },$$

(15)

$$\frac{d}{dx}\frac{\partial {\mathcal I} }{\partial \bar{y}^{\prime} }=-\,\frac{p^{\prime} }{\bar{y^{\prime} }}+\frac{p\bar{y}^{\prime\prime} }{{\bar{y}}^{^{\prime} 2}}-\frac{p^{\prime} }{{\sigma }_{{\rm{T}}}}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}^{\prime} }+\frac{p{\sigma ^{\prime} }_{{\rm{T}}}}{{\sigma }_{{\rm{T}}}^{2}}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}}+\frac{p}{{\sigma }_{{\rm{T}}}}\frac{d}{dx}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}^{\prime} }\mathrm{.}$$

(16)

Using Eqs (13)–(16), Eq. (4) becomes

$$2\frac{\bar{y}^{\prime\prime} }{\bar{y}^{\prime} }-2\frac{{\sigma ^{\prime} }_{{\rm{T}}}}{{\sigma }_{{\rm{T}}}}+\frac{\frac{d}{dx}\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}^{\prime} }}{\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}\text{'}}}=-\,\frac{1}{\bar{y}^{\prime} }\frac{\frac{\partial {\sigma }_{{\rm{T}}}}{\partial x}}{\frac{\partial {\sigma }_{{\rm{T}}}}{\partial \bar{y}^{\prime} }}\mathrm{.}$$

(17)

Now assuming independent cell-external and internal noises as in Eq. (5), Eq. (17) becomes Eq. (6b).

References

Perkins, T. J. & Swain, P. S. Strategies for cellular decision-making. Mol Syst Biol 5, 326–326 (2009).
Article PubMed PubMed Central Google Scholar
Vergassola, M., Villermaux, E. & Shraiman, B. I. ‘infotaxis’ as a strategy for searching without gradients. Nature 445, 406 (2007).
Article ADS CAS PubMed Google Scholar
Lander, A. D. How cells know where they are. Science 339, 923–927 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Taylor, S. F., Tishby, N. & Bialek, W. Information and fitness. arXiv preprint 0712.4382 (2007).
Rivoire, O. & Leibler, S. The value of information for populations in varying environments. J Stat Phys 142, 1124–1166 (2011).
Article ADS MathSciNet MATH Google Scholar
Berg, H. C. Motile behavior of bacteria. Phys Today 53, 24–29 (2000).
Article ADS CAS Google Scholar
Endres, R. Physical Principles in Sensing and Signaling: With an Introduction to Modeling in Biology (Oxford University Press, 2013).
Shannon, C. E. A mathematical theory of communication. Bell Syst Tech J 27, 379–423 (1948).
Article MathSciNet MATH Google Scholar
Bowsher, C. G. & Swain, P. S. Environmental sensing, information transfer, and cellular decision-making. Curr Opin Biotech 28, 149–155 (2014).
Article CAS PubMed Google Scholar
Tkačik, G. & Walczak, A. M. Information transmission in genetic regulatory networks: a review. Journal of Physics: Condensed Matter 23, 153102 (2011).
ADS PubMed Google Scholar
Tkačik, G. & Bialek, W. Information processing in living systems. Annual Review of Condensed Matter Physics 7, 89–117 (2016).
Article ADS Google Scholar
Tkačik, G., Callan, C. G. Jr & Bialek, W. Information flow and optimization in transcriptional regulation. Proc Natl Acad Sci USA 105, 12265–12270 (2008).
Article ADS PubMed PubMed Central Google Scholar
Tkačik, G., Callan, C. G. Jr & Bialek, W. Information capacity of genetic regulatory elements. Phys Rev E 78, 011910 (2008).
Article ADS MathSciNet CAS Google Scholar
Detwiler, P. B., Ramanathan, S., Sengupta, A. & Shraiman, B. I. Engineering aspects of enzymatic signal transduction: photoreceptors in the retina. Biophys J 79, 2801–2817 (2000).
Article CAS PubMed PubMed Central Google Scholar
Clausznitzer, D., Micali, G., Neumann, S., Sourjik, V. & Endres, R. G. Predicting chemical environments of bacteria from receptor signaling. PLoS Comput Biol 10, e1003870 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Tkačik, G., Walczak, A. M. & Bialek, W. Optimizing information flow in small genetic networks. Physical Review E 80, 031920 (2009).
Article ADS CAS Google Scholar
Walczak, A. M., Tkačik, G. & Bialek, W. Optimizing information flow in small genetic networks. ii. feed-forward interactions. Physical Review E 81, 041905 (2010).
Article ADS CAS Google Scholar
Tkačik, G., Walczak, A. M. & Bialek, W. Optimizing information flow in small genetic networks. iii. a self-interacting gene. Physical Review E 85, 041903 (2012).
Article ADS CAS Google Scholar
Bernardo, J. M. Reference posterior distributions for Bayesian inference. J R Stat Soc Ser B Stat Methodol 41, 113–147 (1979).
MathSciNet MATH Google Scholar
Brunel, N. & Nadal, J. P. Mutual information, Fisher information, and population coding. Neural Comput 10, 1731–1757 (1998).
Article CAS PubMed Google Scholar
Yuan, J. & Berg, H. C. Ultrasensitivity of an adaptive bacterial motor. J Mol Biol 425, 1760–1764 (2013).
Article CAS PubMed Google Scholar
Dufour, Y. S., Fu, X., Hernandez-Nunez, L. & Emonet, T. Limits of feedback control in bacterial chemotaxis. PLoS Comput Biol 10, e1003694 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Wong-Ng, J., Melbinger, A., Celani, A. & Vergassola, M. The role of adaptation in bacterial speed races. PLoS Comput Biol 12, e1004974 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Micali, G., Colin, R., Sourjik, V. & Endres, R. G. Drift and behavior of e. coli cells. Biophysical Journal 113, 2321–2325 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Skoge, M., Meir, Y. & Wingreen, N. S. Dynamics of cooperativity in chemical sensing among cell-surface receptors. Phys Rev Lett 107, 178101–178101 (2011).
Article ADS PubMed CAS Google Scholar
Thomas, P. J. & Eckford, A. W. Capacity of a simple intercellular signal transduction channel. IEEE Transactions on Information Theory 62, 7358–7382 (2016).
Article MathSciNet MATH Google Scholar
Levchenko, A. & Nemenman, I. Cellular noise and information transmission. Curr Opin Biotechnol 28, 156–164 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mc Mahon, S. S. et al. Information theory and signal transduction systems: From molecular information processing to network inference. Semin Cell Dev Biol 35C, 98–108 (2014).
Article CAS Google Scholar
Micali, G. & Endres, R. G. Bacterial chemotaxis: information processing, thermodynamics, and behavior. Curr Opin Microbiol 30, 8–15 (2016).
Article CAS PubMed Google Scholar
Laughlin, S. A simple coding procedure enhances a neuron’s information capacity. Z Naturforsch C 36, 910–912 (1981).
Article CAS PubMed Google Scholar
Jeffreys, H. An invariant form for the prior probability in estimation problems. P Roy Soc Lond A Mat 186, 453–461 (1946).
Article ADS MathSciNet CAS MATH Google Scholar
Blahut, R. Computation of channel capacity and rate-distortion functions. IEEE transactions on Information Theory 18, 460–473 (1972).
Article MathSciNet MATH Google Scholar
Komorowski, M. et al. Analog nitrogen sensing in escherichia coli enables high fidelity information processing. bioRxiv 015792 (2015).
Keymer, J. E., Endres, R. G., Skoge, M., Meir, Y. & Wingreen, N. S. Chemosensing in escherichia coli: two regimes of two-state receptors. Proc Natl Acad Sci USA 103, 1786–1791 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Endres, R. G. et al. Variable sizes of Escherichia coli chemoreceptor signaling teams. Mol Syst Biol 4, 211 (2008).
Article PubMed PubMed Central CAS Google Scholar
Tostevin, F. & ten Wolde, P. R. Mutual information between input and output trajectories of biochemical networks. Phys Rev Lett 102, 218101–218101 (2009).
Article ADS PubMed CAS Google Scholar
Becker, N. B., Mugler, A. & ten Wolde, P. R. Optimal prediction by cellular signaling networks. Phys Rev Lett 115, 258103 (2015).
Article ADS PubMed CAS Google Scholar
Clausznitzer, D. & Endres, R. G. Noise characteristics of the Escherichia coli rotary motor. BMC Syst Biol 5, 151–151 (2011).
Article PubMed PubMed Central Google Scholar
Berg, H. C. & Purcell, E. M. Physics of chemoreception. Biophys J 20, 193–219 (1977).
Article CAS PubMed PubMed Central Google Scholar
Salazar, M. E. & Laub, M. T. Temporal and evolutionary dynamics of two-component signaling pathways. Curr Opin Microbiol 24, 7–14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Endres, R. G. & Wingreen, N. S. Precise adaptation in bacterial chemotaxis through “assistance neighborhoods”. Proc Natl Acad Sci USA 103, 13040–13044 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Frank, S. A. Input-output relations in biological systems: measurement, information and the Hill equation. Biol Direct 8, 31–31 (2013).
Article PubMed PubMed Central Google Scholar
Oleksiuk, O. et al. Thermal robustness of signaling in bacterial chemotaxis. Cell 145, 312–321 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. & Sourjik, V. Opposite responses by different chemoreceptors set a tunable preference point in Escherichia coli pH taxis. Mol Microbiol 86, 1482–1489 (2012).
Article CAS PubMed Google Scholar
Hu, B. & Tu, Y. Behaviors and strategies of bacterial navigation in chemical and nonchemical gradients. PLoS Comput Biol 10, e1003672 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Tu, Y. & Berg, H. C. Tandem adaptation with a common design in escherichia coli chemotaxis. Journal of molecular biology 423, 782–788 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zhang, C., He, R., Zhang, R. & Yuan, J. Motor adaptive remodeling speeds up bacterial chemotactic adaptation. Biophys J 114, 1225–1231 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cover, T. M. & Thomas, J. A. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing), 99 edn (Wiley-Interscience, 1991).
Mears, P. J., Koirala, S., Rao, C. V., Golding, I. & Chemla, Y. R. Escherichia coli swimming is robust against variations in flagellar number. eLife 3, e01916 (2014).
Article PubMed PubMed Central Google Scholar
Cheong, R., Rhee, A., Wang, C. J., Nemenman, I. & Levchenko, A. Information transduction capacity of noisy biochemical signaling networks. Science 334, 354–358 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Govern, C. C. & ten Wolde, P. R. Optimal resource allocation in cellular sensing systems. Proc Natl Acad Sci USA 111, 17486–17491 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Yuan, J., Branch, R. W., Hosu, B. G. & Berg, H. C. Adaptation at the output of the chemotaxis signalling pathway. Nature 484, 233–236 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Sourjik, V. & Berg, H. C. Binding of the Escherichia coli response regulator CheY to its target measured in vivo by fluorescence resonance energy transfer. Proc Natl Acad Sci USA 99, 12669–12674 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Hawkins, E. D. et al. T-cell acute leukaemia exhibits dynamic interactions with bone marrow microenvironments. Nature 538, 518 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Tanaka, M. et al. Identification of anti-cancer chemical compounds using xenopus embryos. Cancer science 107, 803–811 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cao, Z. et al. Angiocrine factors deployed by tumor vascular niche induce b cell lymphoma invasiveness and chemoresistance. Cancer cell 25, 350–365 (2014).
Article CAS PubMed PubMed Central Google Scholar
Plaks, V., Kong, N. & Werb, Z. The cancer stem cell niche: how essential is the niche in regulating stemness of tumor cells? Cell stem cell 16, 225–238 (2015).
Article CAS PubMed PubMed Central Google Scholar
Messer, J. S., Liechty, E. R., Vogel, O. A. & Chang, E. B. Evolutionary and ecological forces that shape the bacterial communities of the human gut. Mucosal immunology 10, 567 (2017).
Article CAS PubMed PubMed Central Google Scholar
Berger, C. N. et al. Citrobacter rodentium subverts atp flux and cholesterol homeostasis in intestinal epithelial cells in vivo. Cell metabolism 26, 738–752 (2017).
Article CAS PubMed PubMed Central Google Scholar
Stocker, R. Reverse and flick: Hybrid locomotion in bacteria. Proc Natl Acad Sci USA 108, 2635–2636 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Turner, L., Ryu, W. S. & Berg, H. C. Real-time imaging of fluorescent flagellar filaments. Journal of bacteriology 182, 2793–2801 (2000).
Article CAS PubMed PubMed Central Google Scholar
Darnton, N. C., Turner, L., Rojevsky, S. & Berg, H. C. On torque and tumbling in swimming escherichia coli. J Bacteriol 189, 1756–1764 (2007).
Article CAS PubMed Google Scholar
Sourjik, V. & Armitage, J. P. Spatial organization in bacterial chemotaxis. EMBO J 29, 2724–2733 (2010).
Article CAS PubMed PubMed Central Google Scholar
Porter, S. L., Wadhams, G. H. & Armitage, J. P. Rhodobacter sphaeroides: complexity in chemotactic signalling. Trends in Microbiology 16, 251–260 (2008).
Article CAS PubMed Google Scholar
Porter, S. L., Wadhams, G. H. & Armitage, J. P. Signal processing in complex chemotaxis pathways. Nature Reviews Microbiology 9, 153–165 (2011).
Article CAS PubMed Google Scholar
Tu, Y., Shimizu, T. S. & Berg, H. C. Modeling the chemotactic response of Escherichia coli to time-varying stimuli. Proc Natl Acad Sci USA 105, 14855–14860 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Shimizu, T. S., Tu, Y. & Berg, H. C. A modular gradient-sensing network for chemotaxis in Escherichia coli revealed by responses to time-varying stimuli. Mol Syst Biol 6, 382 (2010).
Article PubMed PubMed Central Google Scholar
Mehta, P., Goyal, S., Long, T., Bassler, B. L. & Wingreen, N. S. Information processing and signal integration in bacterial quorum sensing. Mol Syst Biol 5, 325 (2009).
Article PubMed PubMed Central CAS Google Scholar
Taillefumier, T. & Wingreen, N. S. Optimal census by quorum sensing. PLoS computational biology 11, e1004238 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
De Palo, G., Yi, D. & Endres, R. G. A critical-like collective state leads to long-range cell communication in Dictyostelium discoideum aggregation. PLoS Biol 15, e1002602 (2017).
Article PubMed PubMed Central CAS Google Scholar
Tweedy, L., Knecht, D. A., Mackay, G. M. & Insall, R. H. Self-generated chemoattractant gradients: attractant depletion extends the range and robustness of chemotaxis. PLoS Biol 14, e1002404 (2016).
Article PubMed PubMed Central CAS Google Scholar
Colin, R., Rosazza, C., Vaknin, A. & Sourjik, V. Multiple sources of slow activity fluctuations in a bacterial chemosensory network. eLife 6, e26796 (2017).
Article PubMed PubMed Central Google Scholar
Keegstra, J. M. et al. Phenotypic diversity and temporal variability in a bacterial signaling network revealed by single-cell fret. eLife 6, e27455 (2017).
Article PubMed PubMed Central Google Scholar
Frankel, N. W. et al. Adaptability of non-genetic diversity in bacterial chemotaxis. eLife 3, e03526 (2014).
Article PubMed Central CAS Google Scholar
Sourjik, V. & Berg, H. C. Receptor sensitivity in bacterial chemotaxis. Proc Natl Acad Sci USA 99, 123–127 (2002).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Chiu Fan Lee, Thomas Ouldridge, Michael Stumpf and Peter Swain for valuable comments on the manuscript, Judith Armitage for valuable discussions about Rhodobactersphaeroides, and Victor Sourjik for collaborating with us on related projects. This work was supported by European Research Council Starting-Grant N. 280492-PPHPI. Additional support came from the Biotechnological and Biological Sciences Research Council grant BB/G000131/1 (RGE) and the Swiss National Science Foundation grant nr. 31003A_169978 to Martin Ackermann (GM).

Author information

Authors and Affiliations

Department of Life Sciences, Imperial College, London, UK
Gabriele Micali & Robert G. Endres
Centre for Integrative Systems Biology and Bioinformatics, Imperial College, London, UK
Gabriele Micali & Robert G. Endres
Department of Environmental Microbiology, Eawag, Dübendorf, Switzerland
Gabriele Micali
Department of Environmental Systems Science, ETH Zürich, Zürich, Switzerland
Gabriele Micali

Authors

Gabriele Micali
View author publications
You can also search for this author in PubMed Google Scholar
Robert G. Endres
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors (G.M. and R.G.E.) designed the study. G.M. performed the research. Both analyzed results and data, and wrote the paper.

Corresponding author

Correspondence to Robert G. Endres.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Micali, G., Endres, R.G. Maximal information transmission is compatible with ultrasensitive biological pathways. Sci Rep 9, 16898 (2019). https://doi.org/10.1038/s41598-019-53273-4

Download citation

Received: 24 September 2018
Accepted: 29 October 2019
Published: 15 November 2019
DOI: https://doi.org/10.1038/s41598-019-53273-4

This article is cited by

Escherichia coli chemotaxis is information limited
- H. H. Mattingly
- K. Kamino
- T. Emonet
Nature Physics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.