Phenotypic equilibrium as probabilistic convergence in multi-phenotype cell population dynamics

Da-Quan Jiang; Yue Wang; Da Zhou

doi:10.1371/journal.pone.0170916

Abstract

We consider the cell population dynamics with n different phenotypes. Both the Markovian branching process model (stochastic model) and the ordinary differential equation (ODE) system model (deterministic model) are presented, and exploited to investigate the dynamics of the phenotypic proportions. We will prove that in both models, these proportions will tend to constants regardless of initial population states (“phenotypic equilibrium”) under weak conditions, which explains the experimental phenomenon in Gupta et al.’s paper. We also prove that Gupta et al.’s explanation is the ODE model under a special assumption. As an application, we will give sufficient and necessary conditions under which the proportion of one phenotype tends to 0 (die out) or 1 (dominate). We also extend our results to non-Markovian cases.

Citation: Jiang D-Q, Wang Y, Zhou D (2017) Phenotypic equilibrium as probabilistic convergence in multi-phenotype cell population dynamics. PLoS ONE 12(2): e0170916. https://doi.org/10.1371/journal.pone.0170916

Editor: Zhen Jin, Shanxi University, CHINA

Received: September 12, 2016; Accepted: January 12, 2017; Published: February 9, 2017

Copyright: © 2017 Jiang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper.

Funding: Da-Quan Jiang is supported by National Natural Science Foundation of China 11271029 and National Natural Science Foundation of China 11171024 (http://www.nsfc.gov.cn/publish/portal1/). Da Zhou is supported by National Natural Science Foundation of China 11401499 and the Natural Science Foundation of Fujian Province of China (No. 2015J05016) (http://www.fjkjt.gov.cn/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

With the same genetic background, cell population may have different cellular phenotypes. This has been one of the major topics in the research of cell population dynamics [1, 2]. Very recently much attention has been paid to the stochastic conversions between different phenotypes [3, 4]. For example, we know that cancer stem cells can give rise to cancer non-stem cells, but cancer non-stem cells can also transform back to cancer stem cells [5, 6]. Generally, we can use a branching process (stochastic model) [7–11] or an ODE system (deterministic model) [12] to describe the dynamics of such cell population with multiple phenotypes. However, in many experimental settings, it is difficult or even impossible to count the total cell population [10, 11, 13]. Thus in the last fifty years, people began to consider the proportions of cell individuals with distinct phenotypes instead of the absolute numbers of cells of various phenotypes [7].

We know that through multistep accumulation gene mutations, healthy cells gradually transform to malignant cancer cells, which is the current view of cancer progression [14]. Among those mutations, most of them are neutral (“passenger mutations”) and have no effect on cell proliferation. Only a small portion of mutations will bring growth advantage (“driver mutations”) [15]. Since the emerging of mutations can be regarded as purely stochastic, we can model such procedure with multitype branching processes (cf. Bozic et al.’s model [15, 16]). There have been some results about the emerging time for certain number of driver mutations, and the relation between the number of emerged driver mutations and emerged passenger mutations [15, 17].

In the experiments on breast cancer cell lines, Gupta et al. [4] found that the proportion of each phenotype will tend to a certain constant regardless of the initial population states (“phenotypic equilibrium”). They built a Markovian model, assuming that the evolution of the phenotypic proportions satisfies an n-state Markov chain, and used the ergodicity of the Markov chain to explain this phenomenon [4]. However, we find that the Markovian model is just the ODE system model under a special condition. We determine this condition and its biological meaning. Furthermore, we try to remove this condition and explain the experimental phenomenon in [4] under more general context.

In the deterministic model (ODE system), we only consider the average behavior of cell population dynamics (which requires a large initial population). However, using the stochastic model (branching processes), we can study the trajectory behavior. We prove that the proportions will converge not only on average, but also almost surely. This implies that even with a small initial population, we can still observe “phenotypic equilibrium”.

In the theory of multitype branching processes, people have observed similar proportion convergence phenomenon and proved such phenomenon in several limit theorems under different conditions [18–20]. Those are possible ways to explain “phenotypic equilibrium”, but those required conditions may not be satisfied in experiments. Thus we improve those limit theorems by dropping redundant conditions. We will see that the conditions we need are all biologically reasonable. Therefore, we give a stochastic explanation of “phenotypic equilibrium”. This result may also be of interests to probabilists.

Generally we only consider Markovian branching processes, but sometimes the biological process is not memoryless, thus we need to consider non-Markovian branching processes. We show that under some conditions, the non-Markovian branching processes can be transformed into Markovian branching processes. Using this trick, we demonstrate similar results for non-Markovian branching processes.

In Section 2, we will define some notations, and give the mathematical description of our models, which is based on [8] and [20]. In Section 3, we will describe under which condition the deterministic model becomes the Markovian model in [4]. In Section 4, we will prove that under some mild conditions, the “phenotypic equilibrium” phenomenon will always happen in the Markovian branching process model. Specifically, we will improve a limit theorem about proportion convergence in multitype branching processes. We will also apply our results to Bozic et al.’s model. In Section 5, as an application of our conclusions, we will investigate under what conditions one of the phenotypes will die out or dominate. In Section 6, we will show that the above conclusions are still valid in more general cases.

2 Notations and model description

2.1 Notations

Boldface letter, like A, represents matrix. A′ means the matrix transpose of A. I is the identity matrix. Letter with arrow above, like , represents row vectors. and represent all ones and all zeros vectors. Consider the population of cells with n phenotypes: Y₁, Y₂, ⋯, Y_n. In the stochastic model, is the population at time t, where X_i(t) is the population of phenotype Y_i. is the proportion of phenotype Y_i, as long as the denominator is not zero. . In the deterministic model, is the expected population at time t. We only consider the case where each x_i(t) is nonnegative, and at least one of them is positive (to guarantee ). is the total population. is the proportions of different subpopulations among the total population.

2.2 Stochastic model

Assume that the population of cells have n phenotypes: Y₁, Y₂, ⋯, Y_n. Assume that all the cells evolve independently. (During the exponential growth period, this assumption is almost true [21]) We can present the generalized cell divisions, death and phenotypic conversions as the following reaction form:

It means that for an Y_i cell, it will live an exponential time (we will consider non-exponential lifetime in Section 6) with expectation 1/α_i and turn into d_i1 Y₁ cells, d_i2 Y₂ cells, ⋯, d_in Y_n cells, where d_i1, d_i2, ⋯, d_in are random variables taking nonnegative integer values. d_i1, d_i2, ⋯, d_in are not necessarily independent, but they are assumed to be independent with the exponential reaction time of any cell.

For example, an asymmetric division Y₁ → Y₁ + Y₂ means (d₁₁, d₁₂, d₁₃, ⋯, d_1n) = (1, 1, 0, ⋯, 0). A conversion Y₁ → Y₂ means (d₁₁, d₁₂, d₁₃, ⋯, d_1n) = (0, 1, 0, ⋯, 0). So the probability distribution on the possible reactions gives the joint probability distribution of d_i1, d_i2, ⋯, d_in.

In fact this is a multitype continuous-time Markovian branching process with state space , each component of which represents the population of a phenotype, as defined in [19] and [20]. For example, if one Y₁ cell splits symmetrically, the process will move from the state to the state . We require that , ∀i, j. (In experiments, d_ij is bounded, thus is always true). Then this process will always have finite value in finite time (non-explosion) with probability one [19]. Section V.7.1, (3)-(4)].

2.3 Deterministic model

Now we consider the mathematical expectation of the populations, , which is nonnegative. Based on [19], Section V.7.2, (5)–(9)], we have the deterministic model, namely the following ODE system: (1) where , , (i ≠ j). Define . From Eq (1), we have .

3 The relation between the Markovian model and the deterministic model

In the Markovian model [4], it is assumed that the population proportions satisfies the Kolmogorov forward equations of an n-state Markov chain: (2) where Q is the transition rate matrix, satisfying . In this section, we will discuss whether such assumption can be satisfied in the deterministic model.

From Eq (1), we have (3)

If for some constant k, then Eq (3) becomes , and . Thus Eq (3) has the same form of Eq (2). If , there are non-zero quadratic terms of p_i(t) in Eq (3), implying that Eq (3) does not have the same form of Eq (2).

Notice that means (4)

Thus we have

Theorem 1. Eq (4) is the sufficient and necessary condition for that the proportions of different phenotypes in the deterministic model (1) satisfy the Kolmogorov forward equations of an n-state Markov chain.

Now we know that the Markovian model is a special case of the deterministic model. In biology, Eq (4) means that the growth rates (average number of descendants produced per unit time) of different phenotypes are the same. This condition might be well satisfied for breast cancer cells, which explains why the data fitting in [4] is satisfactory.

4 Asymptotic behavior in general cases

In general cases, Eq (4) is not satisfied since different phenotypes may differ in cell cycling time [22, 23], then the Markovian model is invalid. Thus we need other methods to study the asymptotic behavior of the population dynamics. In this section, we will prove that under some mild conditions, the proportions of different phenotypes will tend to some constants regardless of initial population states.

From Perron-Frobenius theorem [24, 25], we know that has a real eigenvalue λ₁ (called Perron eigenvalue), such that for any eigenvalue μ ≠ λ₁, Re μ < λ₁. λ₁ has a left eigenvector = (u₁, u₂, ⋯, u_n) (called Perron eigenvector), satisfying u_i ≥ 0, ∀i and . When λ₁ is simple, such is unique. We know that the set of all n-order real square matrices with repeated eigenvalue has measure 0 (as a subset of ) [26]. Thus it is reasonable to assume that λ₁ is simple.

4.1 Deterministic model

We have proved the following theorem in Appendix B of [26].

Theorem 2. Assume that λ₁ is simple. Starting from any initial value except for the point in some zero-measure set, we have (x₁(t), x₂(t), ⋯, x_n(t))/exp(λ₁ t) → as t → ∞, where c > 0 is a constant. In this case, the solution of Eq (4) will tend to as t → ∞. Thus Eq (4) has one and only one stable fixed point and no stable limit cycle.

This gives a satisfactory deterministic explanation of the phenotypic equilibrium phenomenon reported in [4].

Remark 1. If λ₁ is not simple, then the convergence result may not hold. Consider A with a_{i, j} = 0, ∀i ≠ j and a_{i, i} = 1, ∀i. Here λ₁ = 1 is not simple, x_i(t) = x_i(0)exp(λ₁t), and (x₁(t), x₂(t), ⋯, x_n(t))/exp(λ₁t) = (x₁(0), x₂(0), ⋯, x_n(0)) will never change. Convergence to a common point will not occur.

4.2 Stochastic model

Since 1960s, probabilists proved that for a continuous-time multitype branching process, under different conditions, where W is a nonnegative random variable. In [19, 27] and [28], it is required that λ₁ > 0 and A is irreducible (this implies λ₁ is simple). In [19] it is proved that W = 0 or W > 0 according to whether the population will become extinct. In [18], it is required that the branching process is discrete in time. In [10, 11] it is required that the initial population tends to infinity. Janson [20] requires that λ₁ > 0, λ₁ is simple, and assumes a special condition about communicating classes structure (see Remark 2). Based on [20] and [19], we will prove the convergence theorem without Janson’s last assumption (Theorem 3). We can see the benefit of this improvement in Section 5.

4.2.1 Preliminaries.

In this section, we assume that λ₁ is simple and positive. λ₁ > 0 means that the total cell population is increasing.

Sometimes, the transformation from one phenotype to another phenotype is not reversible. For example, a mature human red blood cell (which loses its nucleus) cannot transform back to a zygote. Thus we need to classify phenotypes according to communicating behaviors. In mathematical language, we need to study communicating classes of A when A is reducible.

If a cell of phenotype Y_i can produce (directly or indirectly) a cell of phenotype Y_j and vice versa, then we say phenotype Y_i communicates with phenotype Y_j (Y_i ↔ Y_j). Since “↔” is an equivalent relation, we can divide the n phenotypes into several disjoint sets (called communicating classes) according to A [29]. Then we can order the classes and rearrange the phenotypes suitably to make A block-triangular. (Each diagonal block corresponds to a communicating class). Thus the eigenvalues of A consist of all eigenvalues of diagonal blocks. Every eigenvalue corresponds to a diagonal block, and then corresponds to a communicating class. (See [20] and [18] for details).

Denote the communicating class corresponding to the Perron eigenvalue λ₁ by T.

For example, consider matrix , where each W represents a different nonnegative matrix (not 0). Assume that D₃ has the Perron eigenvalue λ₁, then D₃ corresponds to the communicating class T. Denote the other three communicating classes by C₁, C₂, C₄.

For two communicating classes C_i and C_j, we write C_i ⇒ C_j if there exist phenotype X_{k_i} ∈ C_i and X_{k_j} ∈ C_j such that a_{k_j, k_i} > 0. For two communicating classes C and D, we write C → D if there exist communicating classes C = C₁, C₂, ⋯, C_m = D such that C_i ⇒ C_i+1, ∀1 ≤ i < m. Stipulate that C_i ⇒ C_i and C_i → C_i.

Then we can illustrate the communicating classes in the example above as Fig 1.

Download:

Fig 1. Illustration of the example in Section 4.2.1.

https://doi.org/10.1371/journal.pone.0170916.g001

For a communicating class C, define . In other words, is the set of all phenotypes that can produce (directly or indirectly) phenotypes in C. In the example above .

For a communicating class C, define . In other words, is the set of all phenotypes that can be produced (directly or indirectly) by phenotypes in C. In the example above .

For the Markovian branching process , we say that a cell V with phenotype in becomes “essentially extinct” if at some time no cell of any phenotypes in is V or its descendants. In other words, V and its descendants become extinct inside . We say that a trajectory of the branching process becomes “essentially extinct” if at some time no cell of any phenotypes in remains. This means that we can never get a cell with phenotypes in T any more. If so, we cannot have the desired convergence property (Theorem 3), since the proportions of phenotypes in T should be positive (Lemma 2). Let the branching process start at any initial population as long as it has some cells with phenotypes in .

4.2.2 Results and proofs.

We now state the main result of this paper and then give the proof of it.

Theorem 3. Assume that λ₁ is simple and positive. Conditioned on essential non-extinction, we have almost surely ⋯, u_n) as t → ∞.

Lemma 1. Assume that λ₁ is simple. If for some i ≠ j, a_{i, j} > 0 in Eq (1), then u_j > 0 ⇒ u_i > 0.

Proof. Without loss of generality, let i = 1, j = 2. Assume u₁ = 0, u₂ > 0. Let in the first equation of Eq (3). Then it becomes . However is a fixed point of Eq (3) according to Theorem 2, thus we should have dp₁/dt = 0, which is a contradiction.

Lemma 2. Assume that λ₁ is simple. Then .

Proof. Apply the Perron-Frobenius theorem to , the restriction of A on , and let be its Perron eigenvector. w_T, the restriction of on T cannot be , otherwise λ₁ is an eigenvalue of , a contradiction. From Lemma 1 we know that is positive. Set u_i = w_i if , and u_j = 0 if , then is the Perron eigenvector of A. Thus .

Lemma 3. (Lemma 9.8 in [20]) Assume that λ₁ is simple and positive. Then we have almost surely as t → ∞, where W is a nonnegative random variable, and .

Lemma 4. (Lemma 9.7 (ii) and (iii) in [20], originated from Theorem V.7.2 in [19]) Assume that λ₁ is simple and positive, and contains all phenotypes, then W = 0 if and only if the branching process becomes essentially extinct almost surely.

Remark 2. Janson’s paper [20, Section 2] has six fundamental assumptions (A1)-(A6). Assumptions (A1)-(A5) have been satisfied in this paper (regarding (A5) as “the process is not essentially extinct at time 0”). Assumption (A6) “ contains all phenotypes” is only used in Lemma 4. In fact, we will prove (in Lemma 7) that Lemma 4 is still correct without Assumption (A6). Thus we can drop Assumption (A6) in the main result. Assumption (A6) implies all phenotypes should have the same exponential growth rate λ₁, and no phenotype will die out or dominate (see Section 5), which is not necessarily satisfied in experiments. For example, in Bozic et al.’s paper [15], they consider tumor cells which gradually cumulate mutations which accelerate cell growth. Since tumor cells with more accelerating mutations cannot switch back to tumor cells with less such mutations, they must have different growth rates. Also, the proportion of tumor cells with less accelerating mutations will gradually decrease to 0, which contradicts with assumption (A6).

The following lemma is a modification of the second Borel-Cantelli lemma. We base our proof on Theorem 2.3.6 in [30].

Lemma 5. Consider events B₁, B₂, ⋯, B_n, ⋯. If for any positive integers m < n, we have , where 0 < ϵ ≤ 1, then B_n) = 1. In other words, almost surely {B_n: n ≥ 1} will happen infinitely often.

Proof. Let 0 < M < N < ∞. as N → ∞. So for all M, and since it follows that .

Lemma 6. For almost every essentially non-extinct trajectory (according to Lemma 3, the set of such trajectories has positive probability), we can find an essentially non-extinct cell with phenotype in T within finite time. If we can find such cell at time t, then we can find such cell at any time τ > t.

Proof. If at some time t all cells with phenotypes in die out, then at least one of the remaining cells with phenotypes in T is not essentially extinct.

Otherwise, at each time t = k (), there exists one cell E_k with phenotype in . (For different k, E_k may be the same cell). Let B_k () be the event that during the time interval [k, k + 1), the cell E_k produces (directly or indirectly) at least one cell with phenotype in T.

If B_k happens, choose one such cell with phenotype in T and put it in a special set S. Consider any two cells F and G in S, and assume F is produced in the time interval [i, i + 1), G is produced in the time interval [j, j + 1), and i < j, where . Then E_j is the ancestor of G. Since E_j has phenotype in , and F has phenotype in T, F cannot be the ancestor of E_j. Since E_j is still alive at time t = j, when F has been produced, E_j cannot be the ancestor of F. Thus F cannot be the ancestor of G. Since G is produced after F, G cannot be the ancestor of F. In sum, one cell in S cannot be the ancestor of another cell in S. Thus all cells in S are independent.

Consider two phenotypes Y_i and Y_j, and assume a cell with phenotype Y_i can produce a cell with phenotype X_j directly, namely . Because of Markovian property, within a time span of 1/n, the probability for a cell with phenotype Y_i to produce a cell with phenotype Y_j directly is . Let . For a cell with phenotype in , it can produce a cell with phenotype in T within n steps. Thus the probability of B_k is no less than ηⁿ, regardless of what happens before time t = k.

Now we can use Lemma 5 with ϵ = ηⁿ, and there will be an infinite number of cells in S, except for a zero-measure set of trajectories. According to Lemma 3, the probability for one cell in S to become essentially extinct is less than 1, thus the probability for all cells in S to become essentially extinct is 0, and at least one cell in S is not essentially extinct, except for a zero-measure set of trajectories.

Lemma 7. Assume that λ₁ is simple and positive, then W = 0 if and only if the branching process becomes essentially extinct almost surely.

Proof. ⇐: For a trajectory outside the zero-measure exclusion set of Lemma 3, assume that at some time τ ≥ 0 (dependent on the trajectory), X_i(τ) = 0 for all . For any Y_j ∈ T, 0 = lim_{t → ∞} e^{− λ₁t}X_j(t) = Wu_j. From Lemma 2, u_j > 0. Thus W = 0 almost surely.

⇒: Assume that (W = 0 & the trajectory is not essentially extinct) = P₀ > 0. According to Lemma 6, we can find time t₀ > 0 large enough such that (W = 0 & the trajectory is not essentially extinct & there exists an essentially non-extinct cell with phenotype in T at time t₀) ≥ P₀/2 > 0. On this set, only consider this essentially non-extinct cell and its descendants from time t ≥ t₀, then the population is restricted on and we can use Lemma 4. Now we have W > 0 except for a zero-measure set of trajectories, which is a contradiction.

From Lemma 3 and Lemma 7, we can obtain Theorem 3.

Remark 3. The assumption of λ₁ > 0 is not too strong. If λ₁ < 0, then from Theorem 2, the expected populations decay to . Therefore this process will become extinct almost surely. For λ₁ = 0, consider an example that each cell always have exactly one child, and the child can be any phenotype with equal probability. Then the total population is fixed, and the proportions will always fluctuate, so there is no convergence [20].

For Gupta el al’s experiment, the initial cell population is very large in cancer cell lines, thus the probability of essential extinction is quite small. Therefore, the proportions will almost always tend to the same constants. This gives a satisfactory stochastic explanation of the phenotypic equilibrium phenomenon reported in [4].

The deterministic model only reflects the average behavior of many trajectories (or equivalently a large initial population). When the cell number is relatively small, the stochasticity is not negligible, and it is not very reasonable to assume the cell number changes continuously. So the stochastic model is more effective than deterministic model. That is why we also prove the same result for stochastic model.

Remark 4. In Bozic et al.’s model, we divide cells in groups by their number of driver mutations. The growth rate for cells with k driver mutations is 1 − (1 − s)^k/2 (s = 0.004 in [15]). Cells can acquire driver mutations, but cannot lose them. Also we do not consider cell death, so there is no essential extinction. If we only consider cells with no more than n driver mutations, then the population of cells with exactly n mutations will grow with exponential growth rate 1 − (1 − s)ⁿ/2, which is larger than that of cells with less driver mutations. So cells with n driver mutations will dominate exponentially fast. Generally, the population of cells with n driver mutations will grow exponentially with rate 1 − (1 − s)ⁿ/2, and as long as the next driver mutation emerges, its proportion will decay to 0 exponentially fast. Also, we should notice that such results are valid for almost every trajectories. Here we do not consider the difference in passenger mutations, since they have no effect on cell growth, and considering them will make the Perron eigenvalue not simple. The model in [15, 16] is discrete-time, but we can see from Section 6 that we can still apply our results.

5 When will one proportion tend to 0 or 1?

In population dynamics, we are also concerned about when one phenotype dies out or dominates. In terms of the notations in this paper, we need to consider when P_i(t) → 0 or P_i(t) → 1 as t → ∞.

In this section, we will still assume that the Perron eigenvalue λ₁ of A is simple and positive. Then from Theorem 3, we have almost surely in the stochastic model. Thus we can get the following corollaries from Lemma 2.

Corollary 4. .

Corollary 5. .

Remark 5. From Corollary 4 we can see that the sufficient and necessary condition under which no phenotype dies out, namely , is that contains all phenotypes. This is just Janson’s last assumption.

Remark 6. If we find that P_i(t) → 0, in an experiment, then we know that the phenotype Y_j will never transform to Y_i in any way. If we find that P_i(t) → 1, then we know that the phenotype Y_i will never transform to any other phenotypes.

6 Model generalization: non-exponential lifetime

In the previous sections, we assumed that the lifetime of a cell is exponentially distributed and independent of the type and number of its descendants. However, in real biological system, the lifetime distribution should be more like lognormal, gamma, Weibull, or exponentially modified Gaussian distribution [31, 32]. Furthermore, the time needed for division and conversion have different distributions [32]. In this way the process is a multitype Bellman-Harris branching process (also called age-dependent branching process) [33], no longer Markovian.

We can use the “device of stages” method to approximate a non-exponential random variable with several exponential random variables [34]. This indicates that through adding supplementary sub-phenotypes, we can simulate a non-Markovian branching process with a Markovian branching process. See the example illustrated in Fig 2.

Download:

Fig 2. Illustration of the example in Section 6.

https://doi.org/10.1371/journal.pone.0170916.g002

Here are supplementary sub-phenotypes. We artificially assume such supplementary sub-phenotypes exist just by technical reasons. They do not have biological meanings. When we count as Y₁, the process has the same distribution with the original one. If we have convergence with this new process, then we also have convergence for the original process. An Y₁ cell has probability to divide into Y₁ + Y₁, and probability to convert into Y₂. Here we set , , and to be large enough while keeping their proportions, so that the time needed for the first step is ignorable (exponential random variable with expectation ).

Now the time distribution for division Y₁ → Y₁ + Y₁ is approximately , where Ex(α) is the density function of exponential random variable with parameter α, and * means convolution. Similarly, the time distribution for conversion Y₁ → Y₂ is approximately .

According to [34], any non-negative random variable can be approximated to any accuracy by such combination of convolutions of exponential random variables. Thus we can simulate such non-Markovian branching processes to any precision with Markovian branching processes. Here the lifetime of a cell can be non-exponential, and the lifetime of a cell can depend on the type and number of its descendants.

Now we can apply Theorem 3 to those sub-phenotypes. The proportion of each sub-phenotype converges to a constant. Thus the proportion of each phenotype (including all its sub-phenotypes) converges to a constant. This proves the “phenotypic equilibrium” phenomenon in a more realistic stochastic model. In addition, the conclusions in Section 5 are still valid.

Remark 7. The most unrealistic aspect of exponential lifetime is that the density function reaches maximum at 0, but one cell cannot divide right after its birth. For the sum of several independent exponential variables, the density function at 0 is 0. By the law of large numbers, the density function of the sum of n independent exponential variables with parameter nλ will have a sharp peak near λ when n is large.

Remark 8. The proportion convergence theorem for non-Markovian (age-dependent) branching processes can be proved directly, but under stronger conditions [33].

7 Conclusion and discussion

We have presented a unified stochastic model for the population dynamics with cellular phenotypic conversions. We have given the sufficient and necessary condition under which the dynamical behavior of our model can be described by an n-state Markov chain. In general case, we have proved that the proportions of different phenotypes will tend to constants regardless of their initial values, and we have investigated the sufficient and necessary conditions under which one phenotype will die out or dominate. We also extend our model to non-Markovian case while keeping the above conclusions valid. In this way we explain experimental phenomenon in [4].

As remarked in Section 4.2, we improve a limit theorem in branching processes, which may be of theoretical interests.

Our results can also apply to cancer progression models with gene mutations, such as [15–17].

Since the phenotypic conversions have been reported in various cellular systems, such as E.coli [35] and cancer cells [6, 36], we hope that our model here could be applied as a general framework in the study of multi-phenotypic populations of cells.

With the improvement of experiment methods, we will accumulate more and more data for single cell. In single cell level, the stochasticity is not negligible anymore. Thus we should build detailed stochastic models (branching process would be a good framework) for cell population dynamics. Such models will provide new insights and predictions. In the meanwhile, stochastic process related theories should attract more attention.

There are some possible improvements about this research. First, we assume that the branching process is time homogeneous, namely the birth and death rates keep the same for all time. However, as time goes on, the cell density increases, and the birth and death rates should change [21]. Thus a possible improvement is to have time-dependent or density-dependent d_ij. Second, we only prove the convergence for t → ∞, but in experiments we only have finite observation time. Thus it is meaningful to estimate the convergence rate. Third, we only consider finite many phenotypes. We find that there is essential difficulty to build similar theory for infinite-type branching processes. However, there have been some works considering infinite many mutation states with multitype branching processes, such as [17].

Acknowledgments

We would like to thank Professor Min-Ping Qian, Da-Yue Chen, Svante Janson and anonymous reviewers for helpful advice and discussions. Y. W. would like to thank Lingxue Zhu and Mingda Zhang for a special and inspiring discussion.

Author Contributions

Conceptualization: D-QJ YW DZ.
Formal analysis: YW.
Funding acquisition: D-QJ DZ.
Investigation: YW.
Methodology: D-QJ YW DZ.
Writing – original draft: YW DZ.
Writing – review & editing: D-QJ YW DZ.

References

1. Altschuler SJ, Wu LF. Cellular heterogeneity: do differences make a difference? Cell 141(4) (2010) 559–563. pmid:20478246
- View Article
- PubMed/NCBI
- Google Scholar
2. Kussell E, Leibler S. Phenotypic diversity, population growth, and information in fluctuating environments. Science 309 (5743) (2005) 2075–2078. pmid:16123265
- View Article
- PubMed/NCBI
- Google Scholar
3. dos Santos RV, da Silva LM. The noise and the KISS in the cancer stem cells niche. J. Theor. Biol 335 (2013) 79–87. pmid:23827200
- View Article
- PubMed/NCBI
- Google Scholar
4. Gupta PB, Fillmore CM, Jiang G, Shapira SD, Tao K, Kuperwasser C, et al. Stochastic state transitions give rise to phenotypic equilibrium in populations of cancer cells. Cell 146 (4) (2011) 633–644. pmid:21854987
- View Article
- PubMed/NCBI
- Google Scholar
5. Zhou D, Wu D, Li Z, Qian MP, Zhang MQ. Population dynamics of cancer cells with cell state conversions. Quant. Biol. 1 (3) (2013) 201–208. pmid:26085954
- View Article
- PubMed/NCBI
- Google Scholar
6. Yang G, Quan Y, Wang W, Fu Q, Wu J, Mei T, et al. Dynamic equilibrium between cancer stem cells and non-stem cancer cells in human SW620 and MCF-7 cancer cell populations. Br. J. Cancer 106 (9) (2012) 1512–1519. pmid:22472879
- View Article
- PubMed/NCBI
- Google Scholar
7. Jagers P. The proportions of individuals of different kinds in two-type populations. A branching process problem arising in biology. J. Appl. Probab 6 (2) (1969) 249–260.
- View Article
- Google Scholar
8. Kimmel M, Axelrod DE. Branching Processes in Biology. Springer, New York, 2002, pp. 103–140.
9. Yakovlev AY, Mayer-Proschel M, Noble M. A stochastic model of brain cell differentiation in tissue culture. J. Math. Biol. 37 (1) (1998) 49–60. pmid:9710974
- View Article
- PubMed/NCBI
- Google Scholar
10. Yakovlev AY, Yanev NM. Relative frequencies in multitype branching processes. Ann. Appl. Probab. 19 (1) (2009) 1–14.
- View Article
- Google Scholar
11. Yakovlev AY, Yanev NM. Limiting distributions for multitype branching processes. Stoch. Anal. Appl. 28 (6) (2010) 1040–1060. pmid:24031155
- View Article
- PubMed/NCBI
- Google Scholar
12. Murray JD. Mathematical Biology I: An Introduction. Springer-Verlag, Berlin, Heidelberg, 2001, pp. 1–6, 101–105.
13. Clayton E, Doupé PD, Klein AM, Winton DJ, Simons BD, Jones PH. A single type of progenitor cell maintains normal epidermis. Nature 446 (7132) (2007) 185–189. pmid:17330052
- View Article
- PubMed/NCBI
- Google Scholar
14. Hanahan D, Weinberg RA. The hallmarks of cancer. Cell, 100(1), (2000) 57–70. pmid:10647931
- View Article
- PubMed/NCBI
- Google Scholar
15. Bozic I, Antal T, Ohtsuki H, Carter H, Kim D, Chen S, et al. Accumulation of driver and passenger mutations during tumor progression. Proc. Natl. Acad. Sci. 2010 Oct 26;107(43):18545–50. pmid:20876136
- View Article
- PubMed/NCBI
- Google Scholar
16. Kimmel M, Corey S. Stochastic hypothesis of transition from inborn neutropenia to AML: interactions of cell population dynamics and population genetics. Front. Oncol. 2013 Apr 29;3(89.10):3389.
- View Article
- Google Scholar
17. McDonald TO, Kimmel M. A multitype infinite-allele branching process with applications to cancer evolution. J. Appl. Probab. 2015;52(3):864–76.
- View Article
- Google Scholar
18. Kesten H, Stigum BP. Limit theorems for decomposable multi-dimensional Galton-Watson processes. J. Math. Anal. Appl. 17 (2) (1967) 309–338.
- View Article
- Google Scholar
19. Athreya KB, Ney PE. Branching Processes. Springer-Verlag, Berlin, 1972, pp. 199–206.
20. Janson S. Functional limit theorems for multitype branching processes and generalized Pólya urns. Stoch. Process. Appl. 110 (2) (2004) 177–245.
- View Article
- Google Scholar
21. Zwietering MH, Jongenburger I, Rombouts FM, van’t Riet K. Modeling of the bacterial growth curve. Appl. Environ. Microbiol. 56 (6) (1990) 1875–1881. pmid:16348228
- View Article
- PubMed/NCBI
- Google Scholar
22. Patrawala L, Calhoun T, Schneider-Broussard R, Zhou J, Claypool K, Tang DG. Side population is enriched in tumorigenic, stem-like cancer cells, whereas ABCG2⁺ and ABCG2⁻ cancer cells are similarly tumorigenic. Cancer Res. 65 (14) (2005) 6207–6219. pmid:16024622
- View Article
- PubMed/NCBI
- Google Scholar
23. Fillmore CM, Kuperwasser C. Human breast cancer cell lines contain stem-like cells that self-renew, give rise to phenotypically diverse progeny and survive chemotherapy. Breast Cancer Res. 10 (2) (2008) R25. pmid:18366788
- View Article
- PubMed/NCBI
- Google Scholar
24. Seneta E. Non-negative Matrices and Markov Chains. 2nd Edition. Springer, New York, 1981, p. 22.
25. Karlin S. Taylor HM, A First Course in Stochastic Processes. 2nd Edition. Academic Press, New York, 1975, pp. 547–551.
26. Zhou D, Wang Y, Wu B. A multi-phenotypic cancer model with cell plasticity. J. Theor. Biol. 357 (2014) 35–45. pmid:24819463
- View Article
- PubMed/NCBI
- Google Scholar
27. Athreya KB. Some results on multitype continuous time Markov branching processes. Ann. Math. Stat. 39(2) (1968) 347–357.
- View Article
- Google Scholar
28. Smythe RT. Central limit theorems for urn models. Stoch. Process. Appl. 65 (1) (1996) 115–137.
- View Article
- Google Scholar
29. Norris JR. Markov Chains. Cambridge University Press, Cambridge, 1997, pp. 11, 122.
30. Durrett R. Probability: Theory and Examples. 4th Edition. Cambridge University Press, Cambridge, 2010, pp. 58–59.
31. Hawkins ED, Turner ML, Dowling MR, van Gend C, Hodgkin PD. A model of immune regulation as a consequence of randomized lymphocyte division and death times. Proc. Natl. Acad. Sci. 104 (2007) 5032–5037. pmid:17360353
- View Article
- PubMed/NCBI
- Google Scholar
32. Golubev A. Exponentially modified gaussian (EMG) relevance to distributions related to cell proliferation and differentiation. J. Theor. Biol. 262 (2010) 257–266. pmid:19825376
- View Article
- PubMed/NCBI
- Google Scholar
33. Mode CJ. Multitype Branching Processes: Theory and Applications. Vol. 34, American Elsevier Pub. Co., New York, 1971, pp. 138–145.
34. Cox DR, Miller HD. The Theory of Stochastic Processes. Methuen & Co. Ltd., London, 1965, pp. 257–262.
35. Ozbudak EM, Thattai M, Lim HN, Shraiman BI, van Oudenaarden A. Multistability in the lactose utilization network of Escherichia coli. Nature 427 (6976) (2004) 737–740. pmid:14973486
- View Article
- PubMed/NCBI
- Google Scholar
36. Fidler IJ, Kripke ML. Metastasis results from preexisting variant cells within a malignant tumor. Science 197 (4306) (1977) 893–895. pmid:887927
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Altschuler SJ, Wu LF. Cellular heterogeneity: do differences make a difference? Cell 141(4) (2010) 559–563. pmid:20478246
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Kussell E, Leibler S. Phenotypic diversity, population growth, and information in fluctuating environments. Science 309 (5743) (2005) 2075–2078. pmid:16123265
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. dos Santos RV, da Silva LM. The noise and the KISS in the cancer stem cells niche. J. Theor. Biol 335 (2013) 79–87. pmid:23827200
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Gupta PB, Fillmore CM, Jiang G, Shapira SD, Tao K, Kuperwasser C, et al. Stochastic state transitions give rise to phenotypic equilibrium in populations of cancer cells. Cell 146 (4) (2011) 633–644. pmid:21854987
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Zhou D, Wu D, Li Z, Qian MP, Zhang MQ. Population dynamics of cancer cells with cell state conversions. Quant. Biol. 1 (3) (2013) 201–208. pmid:26085954
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Yang G, Quan Y, Wang W, Fu Q, Wu J, Mei T, et al. Dynamic equilibrium between cancer stem cells and non-stem cancer cells in human SW620 and MCF-7 cancer cell populations. Br. J. Cancer 106 (9) (2012) 1512–1519. pmid:22472879
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Jagers P. The proportions of individuals of different kinds in two-type populations. A branching process problem arising in biology. J. Appl. Probab 6 (2) (1969) 249–260.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref8] 8. Kimmel M, Axelrod DE. Branching Processes in Biology. Springer, New York, 2002, pp. 103–140.

[ref9] 9. Yakovlev AY, Mayer-Proschel M, Noble M. A stochastic model of brain cell differentiation in tissue culture. J. Math. Biol. 37 (1) (1998) 49–60. pmid:9710974
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref10] 10. Yakovlev AY, Yanev NM. Relative frequencies in multitype branching processes. Ann. Appl. Probab. 19 (1) (2009) 1–14.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref11] 11. Yakovlev AY, Yanev NM. Limiting distributions for multitype branching processes. Stoch. Anal. Appl. 28 (6) (2010) 1040–1060. pmid:24031155
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref12] 12. Murray JD. Mathematical Biology I: An Introduction. Springer-Verlag, Berlin, Heidelberg, 2001, pp. 1–6, 101–105.

[ref13] 13. Clayton E, Doupé PD, Klein AM, Winton DJ, Simons BD, Jones PH. A single type of progenitor cell maintains normal epidermis. Nature 446 (7132) (2007) 185–189. pmid:17330052
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref14] 14. Hanahan D, Weinberg RA. The hallmarks of cancer. Cell, 100(1), (2000) 57–70. pmid:10647931
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref15] 15. Bozic I, Antal T, Ohtsuki H, Carter H, Kim D, Chen S, et al. Accumulation of driver and passenger mutations during tumor progression. Proc. Natl. Acad. Sci. 2010 Oct 26;107(43):18545–50. pmid:20876136
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref16] 16. Kimmel M, Corey S. Stochastic hypothesis of transition from inborn neutropenia to AML: interactions of cell population dynamics and population genetics. Front. Oncol. 2013 Apr 29;3(89.10):3389.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref17] 17. McDonald TO, Kimmel M. A multitype infinite-allele branching process with applications to cancer evolution. J. Appl. Probab. 2015;52(3):864–76.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref18] 18. Kesten H, Stigum BP. Limit theorems for decomposable multi-dimensional Galton-Watson processes. J. Math. Anal. Appl. 17 (2) (1967) 309–338.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref19] 19. Athreya KB, Ney PE. Branching Processes. Springer-Verlag, Berlin, 1972, pp. 199–206.

[ref20] 20. Janson S. Functional limit theorems for multitype branching processes and generalized Pólya urns. Stoch. Process. Appl. 110 (2) (2004) 177–245.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref21] 21. Zwietering MH, Jongenburger I, Rombouts FM, van’t Riet K. Modeling of the bacterial growth curve. Appl. Environ. Microbiol. 56 (6) (1990) 1875–1881. pmid:16348228
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref22] 22. Patrawala L, Calhoun T, Schneider-Broussard R, Zhou J, Claypool K, Tang DG. Side population is enriched in tumorigenic, stem-like cancer cells, whereas ABCG2⁺ and ABCG2⁻ cancer cells are similarly tumorigenic. Cancer Res. 65 (14) (2005) 6207–6219. pmid:16024622
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref23] 23. Fillmore CM, Kuperwasser C. Human breast cancer cell lines contain stem-like cells that self-renew, give rise to phenotypically diverse progeny and survive chemotherapy. Breast Cancer Res. 10 (2) (2008) R25. pmid:18366788
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref24] 24. Seneta E. Non-negative Matrices and Markov Chains. 2nd Edition. Springer, New York, 1981, p. 22.

[ref25] 25. Karlin S. Taylor HM, A First Course in Stochastic Processes. 2nd Edition. Academic Press, New York, 1975, pp. 547–551.

[ref26] 26. Zhou D, Wang Y, Wu B. A multi-phenotypic cancer model with cell plasticity. J. Theor. Biol. 357 (2014) 35–45. pmid:24819463
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref27] 27. Athreya KB. Some results on multitype continuous time Markov branching processes. Ann. Math. Stat. 39(2) (1968) 347–357.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref28] 28. Smythe RT. Central limit theorems for urn models. Stoch. Process. Appl. 65 (1) (1996) 115–137.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref29] 29. Norris JR. Markov Chains. Cambridge University Press, Cambridge, 1997, pp. 11, 122.

[ref30] 30. Durrett R. Probability: Theory and Examples. 4th Edition. Cambridge University Press, Cambridge, 2010, pp. 58–59.

[ref31] 31. Hawkins ED, Turner ML, Dowling MR, van Gend C, Hodgkin PD. A model of immune regulation as a consequence of randomized lymphocyte division and death times. Proc. Natl. Acad. Sci. 104 (2007) 5032–5037. pmid:17360353
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref32] 32. Golubev A. Exponentially modified gaussian (EMG) relevance to distributions related to cell proliferation and differentiation. J. Theor. Biol. 262 (2010) 257–266. pmid:19825376
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref33] 33. Mode CJ. Multitype Branching Processes: Theory and Applications. Vol. 34, American Elsevier Pub. Co., New York, 1971, pp. 138–145.

[ref34] 34. Cox DR, Miller HD. The Theory of Stochastic Processes. Methuen & Co. Ltd., London, 1965, pp. 257–262.

[ref35] 35. Ozbudak EM, Thattai M, Lim HN, Shraiman BI, van Oudenaarden A. Multistability in the lactose utilization network of Escherichia coli. Nature 427 (6976) (2004) 737–740. pmid:14973486
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref36] 36. Fidler IJ, Kripke ML. Metastasis results from preexisting variant cells within a malignant tumor. Science 197 (4306) (1977) 893–895. pmid:887927
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

Figures