Next Article in Journal
Quantifying the Effects of Global Warming on the Plankton Population: An Asymmetric Multifactor Mathematical Model-Based Approach
Previous Article in Journal
Environmental Sound Classification Framework Based on L-mHP Features and SE-ResNet50 Network Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Two-Dimensional Moran Model

by
Asma Althagafi
*,† and
Mohamed Abdelkader
Department of Statistics and Operations Research, King Saud University, Riyadh 11451, Saudi Arabia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Symmetry 2023, 15(5), 1046; https://doi.org/10.3390/sym15051046
Submission received: 8 April 2023 / Revised: 29 April 2023 / Accepted: 3 May 2023 / Published: 9 May 2023
(This article belongs to the Section Mathematics)

Abstract

:
In this paper, we consider a two-dimensional Moran model of random walks consisting of two queues evolving in parallel which can (at each unit of time) either increase by one or reset to 0. We analyze the joint law of their final altitude and prove that the asymptotic distribution of each component is a shifted geometric distribution and we analyze the maximum of these two components, also giving closed forms for the mean and variance.

1. Introduction

The concept of a random walk is very important and is used in many fields, such as in biology (random walk is applied to problems related to genetic diseases such as diabetes, cancer, or the movement of an animal in search of food), physics (the movement of a particle in a gas or liquid), chemistry, (a chemical reagent in a solution), economics (the stock market price), the maximum age of a machine in mechanics, the exploration of a territory by a robot in artificial intelligence, etc.
In this article, we consider discrete random walks. Many scientific studies used probability generating functions to obtain the mean, the variance, and the limiting distribution of some statistics such as the final altitude and the height of discrete random walks, e.g., via the kernel method and singularity analysis (see [1,2]), via some links with continued fractions [3], or via more probabilistic approaches (see [4], which also illustrates links with random trees).
For example in dimension one, if one focuses on articles which play a role in our analysis, Banderier and Flajolet have shown in [1] that the final altitude of a random meander of length n converges asymptotically to a Rayleigh distribution (drift δ = 0 ) and normal distribution ( δ > 0 ). Furthermore, in [5], Banderier and Nicodème have analyzed the height of discrete bridges/meanders/excursions for bounded discrete walks. Similar extremal parameters were also studied for trees in [4,6], and by Gafni [7] for the asymptotic distribution of the length of the longest run of consecutive equal parts. Finally, some models of walks with reset were considered by Banderier and Wallner, who treated some statistics for lattice paths with catastrophes such as the number of catastrophes of a random excursion of size n, which converges to Gaussian, Rayleigh, or discrete distribution depending on the drift (see Theorem 4.12 in [8]).
Then for higher dimensions, still in link with our model, we can mention the stochastic model for solitons, investigated by Itoh, Mahmoud, and Takahashi in [9], where they proved that the wavelength converges to a convolution of geometric random variables. Itoh and Mahmoud [10] also proved that the distribution for the lifetime of an individual converges to a (shifted) geometric distribution in law. Other articles related to the Moran process (in biology and population genetics) are involving similar higher-dimensional walk models; see e.g., [11,12,13].
In this paper, we study the two-dimensional Moran walk TDMW defined as follows: At time 0, the process starts from the origin. After one unit of time, we have four possibilities: (a) the process stops at (0,0) with probability p 0 ; (b) the first component shifts by one positive unit, but the second component returns to zero with probability p 1 ; (c) the second component shifts by one positive unit, but the first component returns to zero with probability p 2 ; or (d) the two components shift by one positive unit with probability p. We now provide four examples of the evolution of the TDMW process until time n = 8 :
( 0 , 0 ) p 1 ( 0 , 1 ) p ( 1 , 2 ) p 1 ( 0 , 3 ) p 2 ( 1 , 0 ) p 2 ( 2 , 0 ) p ( 3 , 1 ) p ( 4 , 2 ) p 1 ( 0 , 3 )
( 0 , 0 ) p 0 ( 0 , 0 ) p ( 1 , 1 ) p 1 ( 0 , 1 ) p 2 ( 1 , 0 ) p 2 ( 2 , 0 ) p ( 3 , 1 ) p 2 ( 4 , 0 ) p ( 5 , 1 )
( 0 , 0 ) p ( 1 , 1 ) p ( 2 , 2 ) p ( 3 , 3 ) p 2 ( 4 , 0 ) p 2 ( 5 , 0 ) p ( 6 , 1 ) p 2 ( 7 , 0 ) p 2 ( 8 , 0 )
( 0 , 0 ) p 2 ( 1 , 0 ) p ( 2 , 1 ) p 1 ( 0 , 2 ) p 1 ( 0 , 3 ) p 1 ( 0 , 4 ) p ( 1 , 5 ) p ( 2 , 6 ) p ( 3 , 7 ) .
In these four examples, the final altitude of components one and two are 0, 5, 8, 3, and 3, 1, 0, 7, respectively. The final altitude of the maximum is 3, 5, 8, and 7 and the height of the four walks is 4, 5, 8, and 7, respectively.
The main result of this work is the discovery of the probability generating function of the maximum of the two final altitudes, and the computation of the corresponding mean and variance. Furthermore, we prove that the asymptotic distribution of the two random walks converges to a shifted geometric distribution, and we give a closed form for the corresponding mean and variance.
This paper is organized as follows. In Section 2, we define some statistics, introduce our model in detail, and use some examples for different lengths of Moran walk according to the probability distribution. In Section 3, we state our main results on the probability generating function of the final altitude and their mean and variance. In Section 4, we use R software to simulate Moran walks of lengths: 10, 100, 500, and 1000, for different values of the probabilities p 0 , p 1 , p 2 , and p. In Section 5, we establish recursive equations for the sequence of multivariate polynomials and the probability generating function associated with the model. In Section 6, we prove that the limiting distributions of two Moran random walks Y n ( 1 ) and Y n ( 2 ) converge to shifted geometric distributions and find their means and variances. In Section 7, we give the proofs of our theorems. In Section 8, we present another method for obtaining the distribution of the final altitude Y n . Finally, we present some conclusions and perspectives.

2. Definitions and Presentation of the Model

In this section, we present some definitions concerning the statistical properties of discrete processes, such as the mean, the variance, and the probability generating function. Finally, we present our model and define some statistics, such as the final altitude and height.
Let us consider a stochastic process with a discrete time and finite state space without assuming that they are Markov chains. For brevity, we call such a process a discrete stochastic process. We start with the following classical definitions (see [14]).

2.1. Definitions

1.
Let U = ( U n , n = 0 , 1 , 2 , ) be a discrete stochastic process starting from time n = 0 , with U 0 being the initial state.
2.
The state space of the process is assumed to be finite and is denoted by S. Each U n takes its values in S.
3.
Let ( U , V ) = ( ( U n , V n ) , n = 0 , 1 , 2 , ) be a discrete stochastic process with two dimensions starting from time n = 0 , with ( U 0 , V 0 ) = ( 0 , 0 ) being the initial state.
4.
The state space of the process is assumed to be finite and is denoted by S × S . Each ( U n , V n ) takes their values in S × S .
5.
The probability distribution of U n is denoted by P n , which is identified with a vector in [ 0 , 1 ] S . We call these distributions the one-dimensional distributions of the process.
6.
We define the probability that is associated with the discrete process ( U n , V n ) as follows: for all r , s N , P n ( r , s ) = P ( U n = r , V n = s ) .
7.
Let U be a discrete random variable with a distribution P ( U = r ) = p r , r N . Then, the expected value and variance of U, denoted by E ( U ) and V ar ( U ) , respectively, is defined as
E ( U ) = r 0 r p r a n d V ar ( U ) = E U E ( U ) 2 = E ( U 2 ) E ( U ) 2 .
8.
Let U be a discrete random variable with distribution P ( U = r ) = p r , r N ; then, the probability generating function, denoted by G, of the variable U is defined by:
G U ( u ) = E ( u U ) = r = 0 u r p r ,
for all u R such that | u | < 1 .
Probability generating functions constitute an elegant tool due to their numerous uses. Mainly, the probability density functions associated with discrete stochastic processes and their moments can be obtained from the derivatives of the probability generating function. In fact, the mean and the variance of the process (the first and second centered moments of the distribution of U) are related to the derivatives of the probability generating function at u = 1 . More precisely, the next folklore lemma explains this link.
Lemma 1 
([2]). Let G U be the probability generating function for the discrete random process random U; then, for k N , the kth factorial moment of U is given by
k G U ( u ) k u | u = 1 = E U ( U 1 ) ( U 2 ) ( U k + 1 ) .
In addition, if the limit of G U ( u ) u and 2 G U ( u ) 2 u exists at u = 1 , then we have the following two important equations, which are related to the mean and variance of U and G U ( u ) :
E U = G U ( u ) u | u = 1 and V ar U = 2 G U ( u ) 2 u | u = 1 + E U E U 2 .
Consider a two-dimensional Moran model with two components Y n ( 1 ) , Y n ( 2 ) ; it starts at time 0 with two components having an age 0 (i.e., ( Y 0 ( 1 ) , Y 0 ( 2 ) ) = ( 0 , 0 ) ), and it is parameterized by four probabilities, denoted by p 0 , p 1 , p 2 , and p. The Moran model with two dimensions is given by the following system:
( Y n + 1 ( 1 ) , Y n + 1 ( 2 ) ) = ( Y n ( 1 ) + 1 , Y n ( 2 ) + 1 ) , with probability p , ( 0 , Y n ( 2 ) + 1 ) , with probability p 1 , ( Y n ( 1 ) + 1 , 0 ) , with probability p 2 , ( 0 , 0 ) , with probability p 0 ,
where p 0 , p 1 , p 2 , and p are non-negative given values such that p + p 0 + p 1 + p 2 = 1 .
Additionally, we define the final altitude Y n and height H n statistics by:
Y n = max ( Y n ( 1 ) , Y n ( 2 ) ) , H n = max ( Y 0 , Y 1 , Y 2 , , Y n ) .
Applying the four first definitions, we can consider Y n and H n as a random walk (discrete stochastic process with one dimension) in the state space S = { 0 , , n } , which is started from the initial state Y 0 = 0 and H 0 = 0 , respectively. However, ( Y n ( 1 ) , Y n ( 2 ) ) are considered a stochastic process with two dimensions in the state space S 2 = { 0 , , n } 2 , which is started from the initial state ( Y 0 ( 1 ) , Y 0 ( 2 ) ) = ( 0 , 0 ) . Moreover, we can consider Y n ( 1 ) , Y n ( 2 ) , Y n , and H n as a random walk in N 2 , where the x-axis represents the time n (length of Moran walk) and the y-axis represents the age of each component Y n ( 1 ) and Y n ( 2 ) or the maximum age or the height. In the next section, we denote by:
E ( Y n ( i ) ) , V ar ( Y n ( i ) ) , and E ( Y n ) for the means and variances of Y n ( i ) and Y n for i = 1 , 2 and n 0 , respectively.

2.2. Example 1: Two-Dimensional Moran Walk of Length 5 for Initial Probabilities p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3

Using R software, Figure 1 shows that the final altitudes of the two components are equal to two and one, respectively; thus, the final maximum age of the two components is equal to two. In addition, the age height is fixed at two when p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 for the Moran walk of length 5.

2.3. Example 2: Two-Dimensional Moran Walk of Length 10 with Initial Probabilities p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3

Figure 2 shows that the final altitudes of the two components are equal to one and two, respectively; thus, the final maximum age of the two components is equal to two. In addition, the age height is 4 at time 7, when p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 for the Moran walk of length 10.

3. Main Result

In this section, we study the final altitude of the maximum age, Y n = max ( Y n 1 , Y n 2 ) , between the two components at time n. Precisely, we want to find the probability generating function of Y n , denoted by Ψ n ( x ) , and determine the mean and variance of the Moran walk Y n with length n. For this reason, we define the probability generating function Ψ n ( x ) as
x R , Ψ n ( x ) = G Y n ( x ) = k = 0 n x k P ( Y n = k ) = k = 0 n x k P max ( Y n ( 1 ) , Y n ( 2 ) ) = k .
The first result concerns the probability generating function Ψ n ( x ) , which is presented in the next theorem.
Theorem 1. 
The probability generating function, Ψ n ( x ) , of the final altitude between the two components Y n = max ( Y n ( 1 ) , Y n ( 2 ) ) is given by the following: for all n N * , x R ,
Ψ n ( x ) = 2 p 0 p 1 + p 2 p x ( p x ) n 1 p x + r 1 n + r 2 n x n + ( 1 r 1 ) r 1 x ( r 1 x ) n 1 r 1 x + ( 1 r 2 ) r 2 x ( r 2 x ) n 1 r 2 x p 0 1 ( p x ) n 1 p x ( p x ) n ,
such that Ψ 0 ( x ) = 1 , where r 1 = p 2 + p , r 2 = p 1 + p , | r 1 x 1 | < 1 , | r 2 x 2 | < 1 and | p x | < 1 .
In the next theorem, using the probability generating function and their derivatives of order one and two, respectively, we can calculate the mean and variance of Y n . The second main result is presented in the following theorem.
Theorem 2. 
The mean and variance of the final altitude Y n are given by
E Y n = r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p ,
and
V ar ( Y n ) = r 1 ( 1 r 1 ) 2 1 r 1 n r 1 n + 1 + ( 1 + 2 n ) ( 1 r 1 ) + r 2 ( 1 r 2 ) 2 1 r 2 n r 2 n + 1 + ( 1 + 2 n ) ( 1 r 2 ) p ( 1 p ) 2 3 p + p n p n + 1 2 n ( 1 p ) 4 p + 2 r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p 2 r 1 r 1 n + 1 1 r 1 ( r 2 r 2 n + 1 1 r 2 ) ,
where r 1 = p 2 + p and r 2 = p 1 + p .
Remark 1. 
Because Y n = max ( Y n ( 1 ) , Y 2 ( 2 ) ) , then Y n Y n ( i ) , for i = 1 , 2 . So, we deduce from Theorem 2 and Corollary 1 that E Y n E Y n ( i ) .

4. Simulation

In this section, using R software, we simulate the final altitude of each component Y ( 1 ) , Y ( 2 ) , the maximum Y, and the height H for two cases. In the first case, we take p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 in the system (6); in the second case, we take p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 for four different lengths of the TDMW, which are equal to 10, 100, 500, and 1000, respectively.
Figure 3 shows that the final altitudes of Y ( 1 ) , Y ( 2 ) , Y, and H are equal to 0, 2, 2, and 3 for the Moran walk of length 10. When the length is increased from 10 to 100, we observe that the final altitudes Y ( 1 ) , Y ( 2 ) , Y, and H are equal to 2, 0, 2, and 8. It is clear that the height is bounded by eight. Then the maximum age of the two components over the full process goes from 6 (for length 10) to 8 (for length 100). This increase is very small, this can be explained by a logarithmic increase of the height over the time, similarly to what is proven in [15].
Figure 4 shows that if the length is equal to 500, the final altitudes of Y ( 1 ) , Y ( 2 ) , and Y are equal to at most 4, and the final altitude of the height H is equal to 10. If we increase the length of the walk from 500 to 1000, then we remark that the final altitudes of Y ( 1 ) , Y ( 2 ) , and Y are not changed and stay at a maximum of 4; however, the final altitude of the height H changes a little from 10 to 11. Then, the maximum height between the two lengths is equal to at most 11.
Figure 5 shows that if the length of the random walk is equal to 10, the final altitudes of Y ( 1 ) and Y ( 2 ) are equal to 3 and 0; then, the maximum altitude Y is 3, and the height H is 5. When the length of the walk is increased from 10 to 100, we remark that the final altitudes of Y ( 1 ) , Y ( 2 ) , and Y change a little and are equal to at most 2, and the final altitude of the height H is equal to 3. Then, the difference in the final altitude of the height is very small, ranging from three to five between the two lengths of the walk and being bounded by 5.
Figure 6 shows that the final altitude of Y ( 1 ) , Y ( 2 ) , and Y is equal to at most 4, and the final altitude of the height H is fixed at 10 for the Moran walk of length 500. When the length is increased from 500 to 1000, we observe that the final altitudes of Y ( 1 ) , Y ( 2 ) , and Y are moved a little and at most are equal to 6, and the final altitude of the height H is fixed at 11. It is clear that the height is bounded by eight by eleven. The maximum age of the two components is at most equal to 11.

5. Recursive Equations

In this section, we first calculate the probability of the process defined in (6), denoted by P n + 1 ( k 1 , k 2 ) at time n + 1 given that we know it at time n. Secondly, we study the sequence of multivariate polynomials, denoted by f n ( x 1 , x 2 ) , and find the recursive equation related to this sequence between two consecutive times n and n + 1 . Next, we investigate the probability generating function, denoted by F ( t , x 1 , x 2 ) , for our model. Finally, we prove that the limiting distribution of each component converges to the shifted geometric distribution and calculate their means and variances. Applying the fifth definition in Section 2, we then have the following definition: for all k 1 , k 2 { 0 , , n + 1 } ,
P n + 1 ( k 1 , k 2 ) = P Y n + 1 ( 1 ) = k 1 , Y n + 1 ( 2 ) = k 2 = P Y n + 1 ( 1 ) = k 1 , Y n + 1 ( 2 ) = k 2 | Y n ( 1 ) = l , Y n ( 2 ) = q ,
where l , q { 0 , , n } . We start this section with a technical lemma. It leads to the involvement of a recursive equation between the probability of our model for two consecutive times n and n + 1 , and it will be used in Theorem 3. It is based on the following conditional probability: consider two events A and B, where the probability of event A occurs given that event B occurred in the past, given by
P ( A | B ) = P ( A B ) P ( B ) ,
where the probability of B is strictly positive. It is presented by the following lemma:
Lemma 2. 
We have the following recursion of probabilities:
P n + 1 ( k 1 , k 2 ) = p P n ( k 1 1 , k 2 1 ) , if k 1 1 , k 2 1 , p 1 l = 0 n P n ( l , k 2 1 ) , if k 1 = 0 , k 2 1 , p 2 q = 0 n P n ( k 1 1 , q ) , if k 1 1 , k 2 = 0 , p 0 k 1 = 0 n k 2 = 0 n P n ( k 1 , k 2 ) , if k 1 = 0 , k 2 = 0 .
Proof. 
This proof is based on the utility of the conditional probability that the Moran walks Y ( 1 ) and Y ( 2 ) are aged k 1 and k 2 at time n + 1 given that they are aged l and q at time n; thus, For k 1 1 and k 2 1 , we have
P n + 1 ( k 1 , k 2 ) = P ( Y n + 1 ( 1 ) = k 1 , Y n + 1 ( 2 ) = k 2 ) = P ( Y n ( 1 ) + 1 = k 1 , Y n ( 2 ) + 1 = k 2 , Y n ( 1 ) = k 1 1 , Y n ( 2 ) = k 2 1 ) = P ( Y n ( 1 ) + 1 = k 1 , Y n ( 2 ) + 1 = k 2 | Y n ( 1 ) = k 1 1 , Y n ( 2 ) = k 2 1 ) × P ( Y n ( 1 ) = k 1 1 , Y n ( 2 ) = k 2 1 ) = p P n ( k 1 1 , k 2 1 ) .
For k 1 = 0 and k 2 1 , we have:
P n + 1 ( k 1 , k 2 ) = l = 0 n P ( Y n ( 1 ) + 1 = 0 , Y n ( 2 ) + 1 = k 2 , Y n ( 1 ) = l , Y n ( 2 ) = k 2 1 ) = p 1 l = 0 n P n ( l , k 2 1 ) .
For k 1 1 and k 2 = 0 , we have by symmetry:
P n + 1 ( k 1 , k 2 ) = q = 0 n P ( Y n ( 1 ) + 1 = k 1 , Y n ( 2 ) + 1 = 0 , Y n ( 1 ) = k 1 1 , Y n ( 2 ) = q ) = q = 0 n P ( Y n ( 1 ) + 1 = k 1 , Y n ( 2 ) + 1 = 0 | Y n ( 1 ) = k 1 1 , Y n ( 2 ) = q ) × P ( Y n ( 1 ) = k 1 1 , Y n ( 2 ) = q ) = p 2 q = 0 n P n ( k 1 1 , q ) .
For k 1 = k 2 = 0 , we have:
P n + 1 ( 0 , 0 ) = P ( Y n ( 1 ) + 1 = 0 , Y n ( 2 ) + 1 = 0 , Y n ( 1 ) = 0 , Y n ( 2 ) = 0 ) + l = 1 n P ( Y n ( 1 ) + 1 = 0 , Y n ( 2 ) + 1 = 0 , Y n ( 1 ) = l , Y n ( 2 ) = 0 ) + q = 1 n P ( Y n ( 1 ) + 1 = 0 , Y n ( 2 ) + 1 = 0 , Y n ( 1 ) = 0 , Y n ( 2 ) = q ) + l , q = 1 n P ( Y n ( 1 ) + 1 = 0 , Y n ( 2 ) + 1 = 0 , Y n ( 1 ) = l , Y n ( 2 ) = q ) = p 0 P n ( k 1 , k 2 ) + q = 1 n P n ( 0 , q ) + l = 1 n P n ( l , 0 ) + l , q = 1 n P n ( l , q ) = p 0 k 1 = 0 n k 2 = 0 n P n ( k 1 , k 2 ) = p 0 .
Remark 2. 
We offer some comments on the different cases of the age of the two components Y n + 1 ( 1 ) and Y n + 1 ( 2 ) :
1. 
In the first case, if the two final ages of the two components are equal to at least one, then the probability that the two components are aged k 1 and k 2 at time n + 1 is equal to p multiplied by the probability of two components that are aged k 1 1 and k 2 1 at time n.
2. 
In the second case, if the ages of the two components one and two are equal to zero and at least one at time n + 1 , then the probability that the two components are aged k 1 = 0 and k 2 at time n + 1 is equal to p 1 multiplied by the sum of probabilities such that the two components are aged l, which is started from 0 to n and k 2 1 at time n, respectively.
3. 
In the third case, if the ages of two components two and one are equal to zero and at least one at time n + 1 , then the probability that the two components are aged k 2 = 0 and k 1 at time n + 1 is equal to p 2 multiplied by the sum of probabilities such that the two components are aged q, which is started from 0 to n and k 1 1 at time n, respectively.
4. 
In the fourth case, if the ages of two components one and two are equal to zero, then the probability that the two components are aged k 2 = 0 and k 1 = 0 at time n + 1 is equal to p 0 multiplied by the double sum of the probabilities such that the two components are aged l and q starting from zero to n at time n, respectively.
In the next section, we define the sequence of multivariate polynomials f n ( x 1 , x 2 ) (for n N ) by the fact that the coefficient of x 1 k 1 x 2 k 2 in f n ( x 1 , x 2 ) is the probability that, at time n, the i-th individual has age k i (for i = 1 , 2 ); that is,
f n ( x 1 , x 2 ) = E x 1 Y n ( 1 ) x 2 Y n ( 2 ) = k 1 = 0 n k 2 = 0 n x 1 k 1 x 2 k 2 P n ( k 1 , k 2 ) .
Starting from Equation (10) and using the preceding lemma, we can find a recursive equation that is related between f n + 1 ( x 1 , x 2 ) , f n ( x 1 , 1 ) , f n ( 1 , x 2 ) , and f n ( x 1 , x 2 ) ; it is introduced in the next proposition.
Proposition 1. 
For all x 1 , x 2 R , the sequence of multivariate polynomials f n ( x 1 , x 2 ) satisfies the recursive equation:
f n + 1 ( x 1 , x 2 ) = p 0 f n ( 1 , 1 ) + p 1 x 2 f n ( 1 , x 2 ) + p 2 x 1 f n ( x 1 , 1 ) + p x 1 x 2 f n ( x 1 , x 2 ) ,
with initial condition f 0 ( x 1 , x 2 ) = P 0 ( 0 , 0 ) = 1 .
Proof. 
Using Equation (10) and Lemma 2, we have
f n + 1 ( x 1 , x 2 ) = P n + 1 ( 0 , 0 ) + k 1 = 1 n + 1 k 2 = 1 n + 1 x 1 k 1 x 2 k 2 P n + 1 ( k 1 , k 2 ) A + k 1 = 1 n + 1 x 1 k 1 P n + 1 ( k 1 , 0 ) B + k 2 = 1 n + 1 x 2 k 2 P n + 1 ( 0 , k 2 ) C .
By a simple calculation, we have
A = p k 1 = 1 n + 1 k 2 = 1 n + 1 x 1 k 1 x 2 k 2 P n ( k 1 1 , k 2 1 ) = p x 1 x 2 k 1 = 0 n k 2 = 0 n x 1 k 1 x 2 k 2 P n ( k 1 , k 2 ) = p x 1 x 2 f n ( x 1 , x 2 ) ,
C = k 2 = 1 n + 1 x 2 k 2 l = 0 n P n ( l , k 2 1 ) = p 1 x 2 k 2 = 0 n x 2 k 2 l = 0 n P n ( l , k 2 ) = p 1 x 2 f n ( 1 , x 2 ) .
By symmetry, we have
B = k 1 = 1 n + 1 x 1 k 1 p 2 q = 0 n P n ( k 1 1 , q ) = p 2 x 1 k 1 = 0 n x 1 k 1 q = 0 n P n ( k 1 , q ) = p 2 x 1 f n ( x 1 , 1 ) ,
and
P n + 1 ( 0 , 0 ) = p 0 k 1 = 0 n k 2 = 0 n P ( Y n ( 1 ) = k 1 , Y n ( 2 ) = k 2 ) = p 0 k 1 = 0 n k 2 = 0 n 1 1 k 1 1 2 k 2 P ( Y n ( 1 ) = k 1 , Y n ( 2 ) = k 2 ) = p 0 f n ( 1 , 1 ) .
We conclude the proof through the combination of Equations (12)–(16). □

6. Distributions of Y n ( 1 ) and Y n ( 2 )

In this section, we study some statistical characteristics such as the probability generating function, the asymptotic distribution, the mean, and the variance of the final altitude of each component Y n ( 1 ) and Y n ( 2 ) at time n. Precisely, we start to find the probability generating function of each component as denoted by f n ( x 1 , 1 ) and f n ( 1 , x 2 ) . In the next theorem, we prove that the final altitude of each component converges to the shifted geometric distribution in the law. This partially generalizes the results of Itoh and Mahmoud [10] (which was restricted to the uniform case, i.e., p 1 = p 2 , but in any dimension). Finally, we finish this section by calculating the mean and variance of each component. The following theorem that is presented is the probability generating function and asymptotic limit of Y n ( 1 ) and Y n ( 2 ) .
Theorem 3. 
The distributions of Y n ( 1 ) and Y n ( 2 ) follow the same discrete distribution, with probability generating functions being given by: ( x 1 , x 2 ) R 2
f 0 ( x 1 , x 2 ) = 1 , f n ( x 1 , 1 ) = r 1 x 1 n + ( 1 r 1 ) 1 r 1 x 1 n 1 r 1 x 1 , f n ( 1 , x 2 ) = r 2 x 2 n + ( 1 r 2 ) 1 r 2 x 2 n 1 r 2 x 2 .
Asymptotically, Y n ( 1 ) and Y n ( 2 ) converge, in law, to some shifted geometric (Geo) random variables:
Y n ( 1 ) G e o ( 1 r 1 ) 1 a n d Y n ( 2 ) G e o ( 1 r 2 ) 1 ,
where r 1 = p 2 + p , r 2 = p 1 + p , | r 1 x 1 | < 1 and | r 2 x 2 | < 1 .
Proof. 
Using the recursive equation defined in (11) for x 2 = 1 , we get
f n + 1 ( x 1 , 1 ) = ( p 0 + p 1 ) f n ( 1 , 1 ) + ( p + p 2 ) x 1 f n ( x 1 , 1 ) = ( 1 r 1 ) k = 0 n x 1 r 1 k f n k ( 1 , 1 ) + r 1 x 1 n + 1 = r 1 x 1 n + 1 + ( 1 r 1 ) 1 r 1 x 1 n + 1 1 r 1 x 1 .
By symmetry, we have
f n + 1 ( 1 , x 2 ) = r 2 x 2 n + 1 + ( 1 r 2 ) 1 r 2 x 2 n + 1 1 r 2 x 2 .
Passing to the limit of f n + 1 ( x 1 , 1 ) , we then have
lim n f n ( x 1 , 1 ) = lim n ( r 1 x 1 ) n + ( 1 r 1 ) 1 ( r 1 x 1 ) n 1 r 1 x 1 = 1 r 1 1 r 1 x 1 .
Suppose that Z follows the geometric distribution with parameter 1 r 1 . Then, its probability is given by
P ( Z = k ) = ( 1 r 1 ) r 1 k 1 , k 0
and the probability generating function of Z is equal to
G Z ( x 1 ) = ( 1 r 1 ) k = 0 r 1 k x 1 k = 1 r 1 1 r 1 x 1 , i f | r 1 x 1 | < 1 .
The relations (17) and (18) prove that Y ( 1 ) converges to g e o ( 1 r 1 ) 1 in the distribution. □
From Theorem 3, we directly get the following corollary concerning the explicit expressions of the means and variances of Y n ( 1 ) and Y n ( 2 ) , which depend on probability generating functions.
Corollary 1. 
The mean and variance of the final altitude Y n of each component of Moran walk ( Y n ( i ) ) i = 1 , 2 are given by:
E Y n ( i ) = r i r i n + 1 1 r i a n d V ar Y n ( i ) = r i ( 1 r i ) 2 1 r i n r i n + 1 + ( 1 + 2 n ) ( 1 r i ) ,
where r 1 = p 2 + p , r 2 = p 1 + p , | r 1 x 1 | < 1 and | r 2 x 2 | < 1 .
Proof. 
Calculating the first derivative of f n ( x 1 , 1 ) with respect to x 1 ,
f n ( x 1 , 1 ) x 1 = x 1 n r 1 n x 1 n 1 + ( 1 r 1 ) n r 1 n x 1 n 1 1 r 1 x 1 + r 1 1 r 1 n x 1 n 1 r 1 x 1 2 ,
evaluating at x 1 = 1 ,
f n ( x 1 , 1 ) x 1 | x 1 = 1 = n r 1 n + ( 1 r 1 ) n r 1 n ( 1 r 1 ) + r 1 ( 1 r 1 n ) ( 1 r 1 ) 2 = r 1 r 1 n + 1 1 r 1 ,
and using Equation (5) we get
E Y n ( 1 ) = f n ( x 1 , 1 ) x 1 | x 1 = 1 = r 1 r 1 n + 1 1 r 1 .
By symmetry, the mean of Y n ( 2 ) is given by
E Y n ( 2 ) = f n ( 1 , x 2 ) x 2 | x 2 = 1 = r 2 r 2 n + 1 1 r 2 .
Computing the second derivative of f n ( x 1 , 1 ) with respect to x 1 , one has
2 f n ( x 1 , 1 ) 2 x 1 = x 1 n r 1 n x 1 n 1 + ( 1 r 1 ) n r 1 n x 1 n 1 1 r 1 x 1 + r 1 1 r 1 n x 1 n 1 r 1 x 1 2 = n ( n 1 ) r 1 n x 1 n 2 n ( 1 r 1 ) r 1 n ( n 1 ) x 1 n 2 1 r 1 x 1 + r 1 x 1 n 1 ( 1 r 1 x 1 ) 2 + r 1 ( 1 r 1 ) n r 1 n x 1 n 1 ( 1 r 1 x 1 ) 2 + 2 r 1 1 r 1 n x 1 n ( 1 r 1 x 1 ) 3 ,
and evaluating at x 1 = 1 , one has
2 f n ( x 1 , 1 ) 2 x 1 | x 1 = 1 = n ( n 1 ) r 1 n n ( 1 r 1 ) r 1 n ( n 1 ) 1 r 1 + r 1 1 ( 1 r 1 ) 2 + r 1 ( 1 r 1 ) n r 1 n ( 1 r 1 ) 2 + 2 r 1 1 r 1 n ( 1 r 1 ) 3 = 2 n r 1 n + 1 ( 1 r 1 ) + 2 r 1 2 ( 1 r 1 n ) ( 1 r 1 ) 2 .
Using Equations (5), (19) and (20), the variance is then given by
V ar ( Y n ( 1 ) ) = 2 n r 1 n + 1 ( 1 r 1 ) + 2 r 1 2 ( 1 r 1 n ) ( 1 r 1 ) 2 + r 1 r 1 n + 1 ( 1 r 1 ) r 1 r 1 n + 1 1 r 1 2 = r 1 ( 1 r 1 ) 2 1 r 1 n r 1 n + 1 + ( 1 + 2 n ) ( 1 r 1 ) .
By symmetry, we have
V ar ( Y n ( 2 ) ) = r 2 ( 1 r 2 ) 2 1 r 2 n r 2 n + 1 + ( 1 + 2 n ) ( 1 r 2 ) .

7. Proofs

In this section, we prove Theorems 1 and 2 of our main result. For this purpose, consider the three sequences of polynomials, R n ( x ) , A n ( x ) , and S n ( x ) , which are defined by
R n ( x ) = k = 1 n l = 1 k x k P Y n ( 1 ) = k , Y n ( 2 ) = l , R 0 ( x ) = 0
A n ( x ) = k = 0 n l = k + 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = l , A 0 ( x ) = 0 ,
S n ( x ) = k = 0 n l = 0 k x k P Y n ( 1 ) = k , Y n ( 2 ) = l , S 0 ( x ) = 1 .
The following lemmas are technical and will be used for the proof of Theorem 1. The first lemma is used to calculate A n ( x ) when the final age of the first component can be equal to zero but the final age of the second component two is increased by one unit more than the first. It is presented by:
Lemma 3. 
The sequence A n ( x ) satisfies the recursive equation
A n ( x ) = p 1 + p x A n 1 ( x ) = p 1 1 ( p x ) n 1 p x ,
for all n N * and for all x R , such that | p x | < 1 .
Proof. 
We can write A n ( x ) by
A n ( x ) = l = 1 n P Y n ( 1 ) = 0 , Y n ( 2 ) = l + k = 1 n l = k + 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = l ,
and using Lemma 2, we obtain
A n ( x ) = p 1 f n 1 ( 1 , 1 ) + k = 1 n l = k + 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = l = p 1 + p x k = 0 n 1 l = k + 1 n 1 x k P Y n 1 ( 1 ) = k , Y n 1 ( 2 ) = l = p 1 + p x A n 1 ( x ) .
Iterating n times A n ( x ) , we get
A n ( x ) = p 1 r = 0 n 1 ( p x ) r = p 1 1 ( p x ) n 1 p x .
We use the previous lemma to find a recursive equation for the sequence R n ( x ) . It depends on the probability generating function of the walk Y n ( 1 ) and sequence A n . The second lemma in this section is used to calculate R n ( x ) when the final ages of the two components can be equal and can be started from ( 1 , 1 ) but the age of the first component does not pass the age of the first; it is presented by:
Lemma 4. 
The sequence R n ( x ) satisfies the recursive equation
R n ( x ) = p x f n 1 ( x , 1 ) p x A n 1 ( x ) ,
for all n N * and for all x R , such that | p x | < 1 .
Proof. 
Using Lemma 2, we get
R n ( x ) = p k = 1 n l = 1 k x k P Y n 1 ( 1 ) = k 1 , Y n 1 ( 2 ) = l 1 = p x f n 1 ( x , 1 ) p x k = 0 n 1 l = k + 1 n 1 x k P Y n 1 ( 1 ) = k , Y n 1 ( 2 ) = l = p x f n 1 ( x , 1 ) p x A n 1 ( x ) .
We will use Lemma 4 to find a recursive equation for the sequence S n ( x ) ; it depends on the probability generating function of the walk Y n ( 1 ) and sequence A n . The following lemma is used to calculate S n ( x ) when the final ages of the two components can be equal and can be started from ( 0 , 0 ) but the age of the second component does not pass the age of the first; it is presented in the following lemma.
Lemma 5. 
The sequence S n ( x ) satisfies the recursive equation:
S n ( x ) = p 0 + ( p + p 2 ) x f n 1 ( x , 1 ) p x A n 1 ( x ) ,
and S n ( x ) is given exactly by
S n ( x ) = p 0 p 1 p x ( p x ) n 1 p x + ( r 1 x ) n + ( 1 r 1 ) r 1 x ( r 1 x ) n 1 r 1 x .
for all x R such that | p x | < 1 and | r 1 x | < 1 , where r 1 is defined in Theorem 3.
Proof. 
Developing the sequence S n ( x )
S n ( x ) : = k = 0 n l = 0 k x k P Y n ( 1 ) = k , Y n ( 2 ) = l = P Y n ( 1 ) = 0 , Y n ( 2 ) = 0 + k = 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = 0 + l = 1 k x k P Y n ( 1 ) = 0 , Y n ( 2 ) = l + k = 1 n l = 1 k x k P Y n ( 1 ) = k , Y n ( 2 ) = l ,
and using Lemma 2 and Equation (10), we have
P Y n ( 1 ) = 0 , Y n ( 2 ) = 0 = P n ( 0 , 0 ) = p 0 ,
k = 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = 0 = p 2 x f n 1 ( x , 1 ) ,
and
l = 1 k = 0 x k P Y n ( 1 ) = 0 , Y n ( 2 ) = l = 0 .
Combining Equations (21) and (25)–(27), we get
S n ( x ) = p 0 + p 2 x f n 1 ( x , 1 ) + A n ( x ) = p 0 + p 2 + p x f n 1 ( x , 1 ) p 1 p x ( p x ) n 1 p x ,
and using the expression of f n 1 ( x , 1 ) defined in Theorem 3, we obtain the formula of S n ( x ) defined in (24). □
Remark 3. 
Consider the sequence L n ( x ) given by the following: for all n N ,
L 0 ( x ) = 1 , L n ( x ) : = k = 0 n l = 0 k x k P Y n ( 1 ) = l , Y n ( 2 ) = k ,
calculate the sum of x k P Y n ( 1 ) = l , Y n ( 2 ) = k under the condition that the final ages of the two components can be equal and can be started from ( 0 , 0 ) but the age of the first component does not pass the age of the second.
We observe that the two sequences L n ( x ) and S n ( x ) are symmetric; thus, the expression of L n ( x ) is directly given by the following: for all n N * , for all x R such that | p x | < 1 and | r 2 x | < 1 ,
L n ( x ) = p 0 + ( r 2 x ) n + ( 1 r 2 ) r 2 x ( r 2 x ) n 1 r 2 x p 2 p x ( p x ) n 1 p x ,
where r 2 is defined in Theorem 3.
Consider the sequence C n ( x ) given by: for all n N , for all x R ,
C n ( x ) = k = 0 n x k P Y n ( 1 ) = k , Y n ( 2 ) = k .
The following lemma is used to calculate C n ( x ) when the final ages of two components are equal and are started from ( 0 , 0 ) ; it is presented by the following:
Lemma 6. 
The sequence C n ( x ) satisfies the recursive equation: for all n N such that | p x | < 1 ,
C n ( x ) = p 0 + p x C n 1 ( x ) = p 0 1 ( p x ) n 1 p x + ( p x ) n .
Proof. 
Rewriting C n ( x )
C n ( x ) = k = 0 n x k P Y n ( 1 ) = k , Y n ( 2 ) = k = P n ( 0 , 0 ) + k = 1 n x k P Y n ( 1 ) = k , Y n ( 2 ) = k ,
and using Lemma 2 and Equation (10), we obtain
C n ( x ) = p 0 f n 1 ( 1 , 1 ) + p k = 1 n x k P Y n 1 ( 1 ) = k 1 , Y n 1 ( 2 ) = k 1 = p 0 + p x k = 0 n 1 x k P Y n 1 ( 1 ) = k , Y n 1 ( 2 ) = k = p 0 + p x C n 1 ( x ) .
Iterating n times C n ( x ) and getting n N , x R such that | p x | < 1 , one has
C n ( x ) = p 0 r = 0 n 1 ( p x ) r + ( p x ) n = p 0 1 ( p x ) n 1 p x + ( p x ) n .
The last lemma gives the probability that the two components have age k 0 .
Corollary 2. 
For all k N , we have
P ( Y n ( 1 ) = Y n ( 2 ) = k ) = p 0 p k , i f k { 0 , 1 , , n 1 } , p n , i f k = n .
Proof. 
Identifying Equations (29) and (30), we have
C n ( x ) = p 0 r = 0 n 1 ( p x ) r + ( p x ) n = k = 0 n 1 x k P Y n ( 1 ) = k , Y n ( 2 ) = k + P Y n ( 1 ) = Y n ( 2 ) = n x n ,
which finishes the proof. □
Here, we present the proof of Theorem 1, which is based on the four previous lemmas.

7.1. Proof of Theorem 1

We have
Ψ n ( x ) = k = 0 n x k P ( Y n = k ) = k = 0 n x k P max ( Y n ( 1 ) , Y n ( 2 ) ) = k = k = 0 n x k P ( Y n ( 1 ) = k , Y n ( 2 ) k ) + P ( Y n ( 1 ) < k , Y n ( 2 ) = k ) = k = 0 n l = 0 k x k P ( Y n ( 1 ) = k , Y ( 2 ) = l ) + k = 0 n l = 0 k x k P ( Y n ( 1 ) = l , Y n ( 2 ) = k ) k = 0 n x k P ( Y n ( 1 ) = k , Y n ( 2 ) = k ) = S n ( x ) + L n ( x ) C n ( x ) .
Using the results in Lemma 5, Remark 3, and Lemma 6, we get
Ψ n ( x ) = p 0 p 1 p x ( p x ) n 1 p x + ( r 1 x ) n + ( 1 r 1 ) r 1 x ( r 1 x ) n 1 r 1 x + p 0 p 2 p x ( p x ) n 1 p x + ( r 2 x ) n + ( 1 r 2 ) r 2 x ( r 2 x ) n 1 r 2 x p 0 1 ( p x ) n 1 p x ( p x ) n ,
Simplifying Ψ n ( x ) , we then get
Ψ n ( x ) = 2 p 0 p 1 + p 2 p x ( p x ) n 1 p x + r 1 n + r 2 n x n + ( 1 r 1 ) r 1 x ( r 1 x ) n 1 r 1 x + ( 1 r 2 ) r 2 x ( r 2 x ) n 1 r 2 x p 0 1 ( p x ) n 1 p x ( p x ) n .
In the next section, we prove Theorem 2; for this purpose, we will need to perform two derivatives of the probability generating function defined in the previous theorem and take x = 1 to obtain the mean and variance of Y n while using Equation (5).

7.2. Proof of Theorem 2

Calculating the first derivative of Ψ n ( x ) defined in Theorem 1,
Ψ n ( x ) x = x ( 2 p 0 p 1 + p 2 p x ( p x ) n 1 p x + r 1 n + r 2 n x n + ( 1 r 1 ) r 1 x ( r 1 x ) n 1 r 1 x + ( 1 r 2 ) r 2 x ( r 2 x ) n 1 r 2 x p 0 1 ( p x ) n 1 p x ( p x ) n ) .
Developing Ψ n ( x ) x , we have
Ψ n ( x ) x = ( p 1 + p 2 ) p n p ( p x ) n 1 1 p x + p ( p x ( p x ) n ) ( 1 p x ) 2 + n r 1 n + r 2 n x n 1 + ( 1 r 1 ) r 1 n r 1 ( r 1 x ) n 1 ( 1 r 1 x ) + r 1 ( r 1 x ( r 1 x ) n ) ( 1 r 1 x ) 2 + ( 1 r 2 ) r 2 n r 2 ( r 2 x ) n 1 ( 1 r 2 x ) + r 2 ( r 2 x ( r 2 x ) n ) ( 1 r 2 x ) 2 + p 0 p n ( p x ) n 1 ( 1 p x ) p ( 1 ( p x ) n ) ( 1 p x ) 2 n p ( p x ) n 1 ,
and evaluating at x = 1 ,
Ψ ( x ) x | x = 1 = ( p 1 + p 2 ) ( 1 p ) 2 ( p n p n ) ( 1 p ) + p ( p p n ) + r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 + p 0 ( 1 p ) 2 n p n ( 1 p ) p ( 1 p n ) n p n = r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 + I n ,
where I n is a sequence defined by
I n = p 0 ( 1 p ) 2 n p n ( 1 p ) p ( 1 p n ) p 1 + p 2 ( 1 p ) 2 ( p n p n ) ( 1 p ) + p ( p p n ) n p n .
By a simple calculation, we have
p 0 ( 1 p ) 2 n p n ( 1 p ) p ( 1 p n ) n p n = n p 0 p n ( 1 p ) p p 0 ( 1 p n ) ( 1 p ) 2 n p n ,
( p 1 + p 2 ) ( 1 p ) 2 ( p n p n ) ( 1 p ) + p ( p p n ) = p ( p 1 + p 2 ) ( 1 p ) + n ( p 1 + p 2 ) p n ( 1 p ) p ( p 1 + p 2 ) ( p p n ) ( 1 p ) 2 ,
and adding relation (33) and (34), we observe that
n p 0 p n ( 1 p ) + n ( p 1 + p 2 ) p n ( 1 p ) n p n = 0 ,
hence,
I n = p 0 ( 1 p ) 2 n p n ( 1 p ) p ( 1 p n ) p 1 + p 2 ( 1 p ) 2 ( p n p n ) ( 1 p ) + p ( p p n ) n p n = p p 0 ( 1 p n ) ( 1 p ) 2 p ( p 1 + p 2 ) ( p p n ) ( 1 p ) 2 p ( p 1 + p 2 ) ( 1 p ) .
Taking the 1 ( 1 p ) 2 common factor, one has
I n = 1 ( 1 p ) 2 ( p p 0 + p 0 p n + 1 p 2 ( p 1 + p 2 ) + ( p 1 + p 2 ) p n + 1 p ( p 1 + p 2 ) + p 2 ( p 1 + p 2 ) ) .
Simplifying I n , we then obtain
I n = 1 ( 1 p ) 2 p ( p 0 + p 1 + p 2 ) + p n + 1 ( p 0 + p 1 + p 2 ) = p + p n + 1 1 p .
Combining (32) and (35), we obtain
E Y n = Ψ n ( x ) x | x = 1 = r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p .
We define the following sequences of functions:
g n ( x ) = p n p ( p x ) n 1 1 p x + p ( p x ( p x ) n ) ( 1 p x ) 2 , if | p x | < 1 , h n , r 1 ( x ) = r 1 n r 1 ( r 1 x ) n 1 ( 1 r 1 x ) + r 1 ( r 1 x ( r 1 x ) n ) ( 1 r 1 x ) 2 + n r 1 n x n 1 , if | r 1 x | < 1 , k n ( x ) = p 0 p n ( p x ) n 1 ( 1 p x ) p ( 1 ( p x ) n ) ( 1 p x ) 2 n p ( p x ) n 1 , if | p x | < 1 .
We calculate the first derivative of the function g n ( x ) defined in (36), and we have
g n ( x ) x = n ( n 1 ) p n x n 2 ( 1 p x ) + p ( p n p ( p x ) n 1 ) ( 1 p x ) 2 + p ( p n p n x n 1 ) ( 1 p x ) 2 + 2 p 2 ( p x ( p x ) n ) ( 1 p x ) 3 ,
evaluating g n ( x ) x at x = 1 , we then get
g n ( x ) x | x = 1 = n ( n 1 ) p n 1 p + 2 p ( p n p n ) ( 1 p ) 2 + 2 p 2 ( p p n ) ( 1 p ) 3 .
The first derivative of h n , r 1 is given by
h n , r 1 ( x ) x = n ( n 1 ) r 1 n x n 2 ( 1 r 1 x ) + r 1 ( r 1 n r 1 n x n 1 ) ( 1 r 1 x ) 2 + r 1 ( r 1 n r 1 n x n 1 ) ( 1 r 1 x ) 2 + 2 r 1 2 ( r 1 x ( r 1 x ) n ) ( 1 r 1 x ) 3 + n ( n 1 ) r 1 n x n 2 ,
evaluating h n , r 1 ( x ) x at x = 1 ,
h n , r 1 ( x ) x | x = 1 = n ( n 1 ) r 1 n ( 1 r 1 ) + r 1 ( r 1 n r 1 n ) ( 1 r 1 ) 2 + r 1 ( r 1 n r 1 n ) ( 1 r 1 ) 2 + 2 r 1 2 ( r 1 r 1 n ) ( 1 r 1 ) 3 + n ( n 1 ) r 1 n ,
and we get
h n , r 1 ( x ) x | x = 1 = n ( n 1 ) r 1 n ( 1 r 1 ) + 2 r 1 ( r 1 n r 1 n ) ( 1 r 1 ) 2 + 2 r 1 2 ( r 1 r 1 n ) ( 1 r 1 ) 3 + n ( n 1 ) r 1 n .
By symmetry, we have
h n , r 2 ( x ) x | x = 1 = n ( n 1 ) r 2 n ( 1 r 2 ) + 2 r 2 ( r 2 n r 2 n ) ( 1 r 2 ) 2 + 2 r 2 2 ( r 2 r 2 n ) ( 1 r 2 ) 3 + n ( n 1 ) r 2 n .
Computing the first derivative of k n ( x ) , we have
k n ( x ) x | x = 1 = n ( n 1 ) p 0 p n x n 2 1 p x + n p 0 p n + 1 x n 1 ( 1 p x ) 2 + n p 0 p n + 1 x n 1 ( 1 p x ) 2 2 p 0 p 2 ( 1 ( p x ) n ) ( 1 p x ) 3 n ( n 1 ) p n x n 2 ,
taking x = 1 , we obtain
k n ( x ) x | x = 1 = n ( n 1 ) p 0 p n 1 p + 2 n p 0 p n + 1 ( 1 p ) 2 2 p 0 p 2 ( 1 p n ) ( 1 p ) 3 n ( n 1 ) p n .
Combining the results of (31) and (37)–(40) we obtain
2 Ψ n ( x ) 2 x | x = 1 = n ( n 1 ) ( p 1 + p 2 ) p n ( 1 p ) 2 p ( p 1 + p 2 ) ( p n p n ) ( 1 p ) 2 2 p 2 ( p 1 + p 2 ) ( p p n ) ( 1 p ) 3 + 2 r 1 ( r 1 n r 1 n ) ( 1 r 1 ) + 2 r 1 2 ( r 1 r 1 n ) ( 1 r 1 ) 2 + 2 r 2 ( r 2 n r 2 n ) ( 1 r 2 ) + 2 r 2 2 ( r 2 r 2 n ) ( 1 r 2 ) 2 + n ( n 1 ) p 0 p n 1 p + 2 n p 0 p n + 1 ( 1 p ) 2 2 p 0 p 2 ( 1 p n ) ( 1 p ) 3 n ( n 1 ) p n .
Defining J n as
J n :   = n ( n 1 ) ( p 1 + p 2 ) p n ( 1 p ) + n ( n 1 ) p 0 p n 1 p n ( n 1 ) p n + 2 n p 0 p n + 1 ( 1 p ) 2 2 p 2 ( p 1 + p 2 ) ( 1 p ) 2 + 2 n p n + 1 ( p 1 + p 2 ) ( 1 p ) 2 + 2 p 3 ( p 1 + p 2 ) ( 1 p ) 3 + 2 p n + 2 ( p 1 + p 2 ) ( 1 p ) 3 2 p 0 p 2 ( 1 p ) 3 + 2 p 0 p n + 2 ( 1 p ) 3 ,
and developing and simplifying J n thanks to the following equations
n ( n 1 ) ( p 0 + p 1 + p 2 ) p n ( 1 p ) n ( n 1 ) p n = 0 , 2 n p 0 p n + 1 ( 1 p ) 2 + 2 n ( p 1 + p 2 ) p n + 1 ( 1 p ) 2 = 2 n p n + 1 1 p , 2 p 0 p n + 2 ( 1 p ) 3 + 2 ( p 1 + p 2 ) p n + 2 ( 1 p ) 3 = 2 p n + 2 ( 1 p ) 2 ,
we have
J n = 2 n p n + 1 1 p 2 p 2 ( p 1 + p 2 ) ( 1 p ) 2 + 2 p n + 2 ( 1 p ) 2 2 p 3 ( p 1 + p 2 ) ( 1 p ) 3 2 p 0 p 2 ( 1 p ) 3 = 2 n p n + 1 1 p + 2 p n + 2 ( 1 p ) 2 2 p 2 ( 1 p ) 2 .
Replacing it in (41), we then get
2 Ψ n ( x ) 2 x | x = 1 = 2 r 1 ( r 1 n r 1 n ) ( 1 r 1 ) + 2 r 1 2 ( r 1 r 1 n ) ( 1 r 1 ) 2 + 2 r 2 ( r 2 n r 2 n ) ( 1 r 2 ) + 2 r 2 2 ( r 2 r 2 n ) ( 1 r 2 ) 2 + 2 n p n + 1 1 p + 2 p n + 2 ( 1 p ) 2 2 p 2 ( 1 p ) 2 .
We define the following sequences A r i and A p by
A r i = 2 r i ( r i n r i n ) ( 1 r i ) + 2 r i 2 ( r i r i n ) ( 1 r i ) 2 + r i r i n + 1 1 r i r i r i n + 1 1 r i 2 , i = 1 , 2 ,
and
A p = 2 n p n + 1 1 p + 2 p n + 2 ( 1 p ) 2 2 p 2 ( 1 p ) 2 p p n + 1 1 p 2 ,
developing, A r i for i = 1 , 2 , we have
A r i = 1 ( 1 r i ) 2 2 r i 2 2 n r i n + 1 2 r i 3 + 2 n r i n + 2 + 2 r i 3 2 r i n + 2 + 1 ( 1 r i ) 2 r i r i 2 r i n + 1 + r i n + 2 r i 2 + 2 r i n + 2 r i 2 ( n + 1 ) .
Simplifying A r i ,
A r i = 1 ( 1 r i ) 2 2 n r i n + 1 + 2 n r i n + 2 + r i r i n + 1 + r i n + 2 r i 2 ( n + 1 ) .
Using Corollary 1,
A r i = r i ( 1 r i ) 2 1 r i n r i n + 1 + ( 1 + 2 n ) ( 1 r i ) ,
and developing A p , we have
A p = 2 n p n + 1 1 p + 2 p n + 2 ( 1 p ) 2 2 p 2 ( 1 p ) 2 p p n + 1 1 p 2 = 2 n p n + 1 2 n p n + 2 + 2 p n + 2 2 p 2 p 2 + 2 p n + 2 p 2 ( n + 1 ) ( 1 p ) 2 .
By simplifying A p , we then get
A p = 2 n p n + 1 2 n p n + 2 + 4 p n + 2 3 p 2 p 2 ( n + 1 ) ( 1 p ) 2 = p ( 1 p ) 2 3 p + p n p n + 1 2 n ( 1 p ) 4 p .
Applying Equation (5), we have
V ar ( Y n ) = 2 Ψ n ( x ) 2 x | x = 1 + E Y n E Y n 2 ,
combining Equations (8) and (43), we obtain
V ar ( Y n ) = 2 r 1 ( r 1 n r 1 n ) ( 1 r 1 ) + 2 r 1 2 ( r 1 r 1 n ) ( 1 r 1 ) 2 + 2 r 2 ( r 2 n r 2 n ) ( 1 r 2 ) + 2 r 2 2 ( r 2 r 2 n ) ( 1 r 2 ) 2 + 2 n p n + 1 1 p + 2 p n + 2 ( 1 p ) 2 2 p 2 ( 1 p ) 2 + r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p 2 .
Applying Equations (43) and (44), we then have
V ar ( Y n ) = A r 1 + A r 2 + A p + 2 r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p , 2 r 1 r 1 n + 1 1 r 1 ( r 2 r 2 n + 1 1 r 2 ) ,
which concludes our proof.

8. Another Method for Obtaining the Distribution of the Final Altitude Y n

In this section, we present another method from [15] to study the final altitude Y n . This method is based on the generating function of f n ( x 1 , x 2 ) = E x 1 Y n ( 1 ) , x 2 Y n ( 2 ) . More precisely, we define the former as follows: for all ( t , x 1 , x 2 ) R 3 , the function F is
F ( t , x 1 , x 2 ) = n 0 f n ( x 1 , x 2 ) t n ,
where the time is encoded by the exponent of t.
Our goal in the first setup is to obtain an explicit form of F; in a second setup, we intend to expand F in series with respect to time t and extract the coefficient of t n from F. This coefficient represents the explicit form of f n ( x 1 , x 2 ) . Finally, we obtain from f n ( x 1 , x 2 ) all information about the disjoint distribution of ( Y n ( 1 ) , Y n ( 2 ) ) and thus, the distribution of Y n .
Starting from the functional equation defined in (11) and using the kernel method (for more details, see [1,16]), we obtain the probability generating function. We obtain the explicit form of F in the following theorem:
Theorem 4. 
An explicit form of the moment generating function of the final altitude Y n of the Moran walk is given by the following: for all ( x 1 , x 2 ) R 2 such that | t x 1 x 2 | < 1 / p , | t x 1 | < 1 / ( p + p 2 ) and | t x 2 | < 1 / ( p + p 1 ) ,
F ( t , x 1 , x 2 ) = 1 + Q ( x 1 , x 2 ) t 1 t 1 t p x 1 x 2 , | t | < 1 ,
where
Q ( x 1 , x 2 ) = p 0 + p 2 x 1 1 t ( p + p 2 ) 1 t x 1 ( p + p 2 ) + p 1 x 2 1 t ( p + p 1 ) 1 t x 2 ( p + p 1 ) .
Proof. 
For all n 1 , using Equation (11), we have
f n ( x 1 , x 2 ) = p 0 f n 1 ( 1 , 1 ) + p 1 x 2 f n 1 ( 1 , x 2 ) + p 2 x 1 f n 1 ( x 1 , 1 ) + p x 1 x 2 f n 1 ( x 1 , x 2 ) .
By applying the sum for the previous equation and through a simple calculation, we have
n 1 f n 1 ( 1 , x 2 ) t n = t n 0 f n ( 1 , x 2 ) t n = t F ( t , 1 , x 2 ) , n 1 f n 1 ( x 1 , 1 ) t n = t n 0 f n ( x 1 , 1 ) t n = t F ( t , x 1 , 1 ) , n 1 f n 1 ( x 1 , x 2 ) t n = t n 0 f n ( x 1 , x 2 ) t n = t F ( t , x 1 , x 2 ) .
Then, the probability generating function satisfies the recursive equation
( 1 p x 1 x 2 t ) F ( t , x 1 , x 2 ) = 1 + t p 0 F ( t , 1 , 1 ) + p 1 x 2 t F ( t , 1 , x 2 ) + p 2 x 1 t F ( t , x 1 , 1 ) .
Using Equation (47) and obtaining the expressions of F ( t , x 1 , 1 ) and F ( t , 1 , x 2 ) ,
F ( t , x 1 , 1 ) = 1 1 t x 1 ( p + p 2 ) + t ( p 0 + p 1 ) 1 t x 1 ( p + p 2 ) F ( t , 1 , 1 ) ,
F ( t , 1 , x 2 ) = 1 1 t x 2 ( p + p 1 ) + t ( p 0 + p 2 ) 1 t x 2 ( p + p 1 ) F ( t , 1 , 1 ) .
Replacing Equations (48) and (49) in (47), we obtain
1 p x 1 x 2 t F ( t , x 1 , x 2 ) = 1 + t p 2 x 1 1 t x 1 ( p + p 2 ) + p 1 x 2 1 t x 2 ( p + p 1 ) + t p 0 + p 1 x 2 t ( p 0 + p 2 ) 1 t x 2 ( p + p 1 ) + p 2 x 1 t ( p 0 + p 1 ) 1 t x 1 ( p + p 2 ) F ( t , 1 , 1 ) = 1 + p 0 + p 2 x 1 1 t ( p + p 2 ) 1 t x 1 ( p + p 2 ) t 1 t + p 1 x 2 1 t ( p + p 1 ) 1 t x 2 ( p + p 1 ) t 1 t ,
where F ( t , 1 , 1 ) is an infinite sum of geometric sequence that goes to 1 / ( 1 t ) for | t | < 1 . □
Remark 4. 
The term 1 t p x 1 x 2 in Equation (46) is called the kernel factor.
Remark 5. 
The function F defined in (46) can be factorized by:
F ( t , x 1 , x 2 ) = 1 1 Q ( x 1 , x 2 ) t 1 1 t 1 Q ( x 1 , x 2 ) 1 1 t p x 1 x 2 ,
where Q ( 1 , 1 ) = 1 p .
From Theorem 4, we can obtain the generating function of ( Y n ( 1 ) ) n and ( Y n ( 2 ) ) n .
Corollary 3. 
The probability generating functions, denoted by F ( t , x 1 , 1 ) and F ( t , 1 , x 2 ) , of ( Y n ( 1 ) ) n and ( Y n ( 2 ) ) n are given by:
F ( t , x 1 , 1 ) = 1 1 t p x 1 1 + p 0 + p 1 + p 2 x 1 [ 1 t ( p + p 2 ) ] 1 t x 1 ( p + p 2 ) t 1 t ,
and
F ( t , 1 , x 2 ) = 1 1 t p x 2 1 + p 0 + p 2 + p 1 x 2 [ 1 t ( p + p 1 ) ] 1 t x 2 ( p + p 1 ) t 1 t ,
for all x 1 R , for all x 2 R , such that | t x 1 | < 1 / p , | t x 2 | < 1 / p , where | t | < 1 .
Proof. 
The proof is a direct consequence of the previous theorem if we take x 1 = 1 in (46); we obtain the expression of the probability generating function of Y n ( 1 ) . This occurs similarly for Y n ( 2 ) . □
The previous result is very important. It allows us to know the probability generating function of each walk Y n ( 1 ) and Y n ( 2 ) . Combining Equations (45), (51) and (52) and evaluating at t = 1 , we then have the expressions of f n ( x 1 , 1 ) and f n ( 1 , x 2 ) .

9. Conclusions and Perspectives

This current paper proposes the finding of the limiting distribution of the age of two components, denoted by Y n ( 1 ) and Y n ( 2 ) ; they converge to a geometric distribution with parameters 1 r 1 and 1 r 2 based on the probability generating function method for a TDMW with length n. In addition, the means and variances of Y n ( 1 ) and Y n ( 2 ) explicitly depend on the initial probability distribution p 0 , p 1 , p 2 , and p. Furthermore, we determine the probability generating function of the maximum age Y n of two components, which is denoted by Ψ n ; it depends on the probability generating of each component and is controlled by the following term given by 2 p 0 p 1 + p 2 p x ( p x ) n 1 p x p 0 1 ( p x ) n 1 p x ( p x ) n . Furthermore, the function Ψ n depends on the four probabilities p 0 , p 1 , p 2 , and p. Finally, the mean and variance of Y n can be expressed from the means and variances of Y n ( 1 ) and Y n ( 2 ) and are controlled by the two terms given by p p n 1 p n and
2 r 1 r 1 n + 1 1 r 1 ( r 2 r 2 n + 1 1 r 2 ) p ( 1 p ) 2 3 p + p n p n + 1 2 n ( 1 p ) 4 p + 2 r 1 r 1 n + 1 1 r 1 + r 2 r 2 n + 1 1 r 2 p p n + 1 1 p ,
respectively.
In the next work, we will study the statistical properties of the height statistics, denoted by H n , starting from H 0 = 0 to H n = max ( Y 0 , , Y n ) for a TDMW with length n. We will use the generating function of H n that is defined by
F h ( x , t ) = n 0 f n h ( x ) t n = n 0 E x 1 Y n ( 1 ) x 2 Y n ( 2 ) 1 { Y 1 h , , Y n = k h } t n = k = 0 h F k h ( t ) x k ,
where f n h ( x ) encodes the possible values of Y n (bounded by h) and is given by
f n h ( x ) = E x 1 Y n ( 1 ) x 2 Y n ( 2 ) 1 { Y 1 h , , Y n = k h } = E x 1 Y n ( 1 ) x 2 Y n ( 2 ) 1 { H n h , Y n = k h } ,
and
F k h ( t ) = n 0 f n , k h t n = n 0 P ( Y 1 h , , Y n 1 h , Y n = k h ) t n ,
where the probability generating function of a bounded Moran walk with dimension two ends at altitude k (for more details, see [15]).
In Section 3, for two cases of four probabilities, we observe that the height statistics is bounded by 3, 5, 10, and 11 for walks with lengths of 10, 100, 500, and 1000 when p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 , respectively. Moreover, it is bounded by 3, 5, 10, and 11 for the same lengths when p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 , respectively.
In the future, we plan to work on the following challenging questions:
1.
Can we find the probability generating function of the height?
2.
Can we explicitly calculate the mean and variance of the height?
3.
What is the limiting distribution of the height?

Author Contributions

Conceptualization, A.A. and M.A.; writing—–original draft preparation, A.A. and M.A.; writing—–review and editing, A.A. and M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Generation of random samples using R program.

Acknowledgments

The research of Mohamed Abdelkader has been supported in part by King Saud University, the Deanship of Scientific Research, and the College of Science Research Center.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviation is used in this manuscript:
TDMWtwo-dimensional Moran walk

References

  1. Banderier, C.; Flajolet, P. Basic analytic combinatorics of directed lattice paths. Theor. Sci. 2002, 281, 37–80. [Google Scholar] [CrossRef]
  2. Flajolet, P.; Sedgewick, R. Analytic Combinatorics; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
  3. Flajolet, P.; Guillemin, F. The formal theory of birth-and-death processes, lattice path combinatorics and continued fractions. Adv. Appl. Probab. 2000, 32, 750–778. [Google Scholar] [CrossRef]
  4. Drmota, M. Random Trees. In An Interplay between Combinatorics and Probability; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
  5. Banderier, C.; Nicodème, P. Bounded discrete walks. Discret. Math. Theor. Comput. Sci. 2010, 35–48. [Google Scholar] [CrossRef]
  6. de Bruijn, N.G.; Knuth, D.E.; Rice, S.O. The average height of planted plane trees. Graph. Theory Comput. 1972, 15–22. [Google Scholar] [CrossRef]
  7. Gafni, A. Longest run of equal parts in a random integer composition. Discret. Math. 2015, 338, 236–247. [Google Scholar] [CrossRef]
  8. Banderier, C.; Wallner, M. Lattice paths with catastrophes. Discret. Math. Theor. Comput. Sci. 2017, 19, 32. [Google Scholar] [CrossRef]
  9. Itoh, Y.; Mahmoud, H.M.; Takahashi, D. A stochastic model for solitons. Random Struct. Algorithms 2004, 24, 51–64. [Google Scholar] [CrossRef]
  10. Itoh, Y.; Mahmoud, H.M. Age statistics in the Moran population model. Stat. Probab. Lett. 2005, 74, 21–30. [Google Scholar] [CrossRef]
  11. Huillet, T.; Möhle, M. Duality and asymptotics for a class of nonneutral discrete Moran models. J. Appl. Probab. 2009, 46, 866–893. [Google Scholar] [CrossRef]
  12. Cordero, F.; Véchambre, G. Moran models and Wright-Fisher diffusions with selection and mutation in a one-sided random environment. Adv. Appl. Probab. 2023, 1–67. [Google Scholar] [CrossRef]
  13. Huillet, T.E. Random walk Green kernels in the neutral Moran model conditioned on survivors at a random time to origin. Math. Popul. Stud. 2016, 23, 164–200. [Google Scholar] [CrossRef]
  14. Faragó, A. Approximating general discrete stochastic processes by Markov chains. J. Stat. Comput. Sci. 2022, 1, 135–145. [Google Scholar]
  15. Althagafi, A.; Aguech, R.; Banderier, C. Height of Walks with Resets and the Moran Model. 2023. submitted. [Google Scholar]
  16. Banderier, C. Combinatoire Analytique des Chemins et des Cartes. Ph.D. Thesis, Université Paris VI, Paris, France, 2001. [Google Scholar]
Figure 1. The evolution of the age of each component, the final altitude, and the height of two-dimensional Moran walk of length 5 starting from ( 0 , 0 ) .
Figure 1. The evolution of the age of each component, the final altitude, and the height of two-dimensional Moran walk of length 5 starting from ( 0 , 0 ) .
Symmetry 15 01046 g001
Figure 2. The evolution of the age of each component, the final altitude, and the height of the two-dimensional Moran walk of length 10 starting from ( 0 , 0 ) .
Figure 2. The evolution of the age of each component, the final altitude, and the height of the two-dimensional Moran walk of length 10 starting from ( 0 , 0 ) .
Symmetry 15 01046 g002
Figure 3. Final altitude of Y ( 1 ) (in red), Y ( 2 ) (in green), their maximum ( Y : = max ( Y ( 1 ) , Y ( 2 ) ) , in pick) and height H (in orange) of a TDMW of lengths 10 and 100 for p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 .
Figure 3. Final altitude of Y ( 1 ) (in red), Y ( 2 ) (in green), their maximum ( Y : = max ( Y ( 1 ) , Y ( 2 ) ) , in pick) and height H (in orange) of a TDMW of lengths 10 and 100 for p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 .
Symmetry 15 01046 g003
Figure 4. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of a TDMW with lengths 500 and 1000 for p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 .
Figure 4. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of a TDMW with lengths 500 and 1000 for p 0 = p = 1 / 8 and p 1 = p 2 = 3 / 8 .
Symmetry 15 01046 g004
Figure 5. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of the Moran walk with lengths 10 and 100 for p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 .
Figure 5. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of the Moran walk with lengths 10 and 100 for p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 .
Symmetry 15 01046 g005
Figure 6. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of a TDMW with lengths 500 and 1000 for p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 .
Figure 6. Final altitude Y ( 1 ) , Y ( 2 ) , and Y and height H of a TDMW with lengths 500 and 1000 for p 0 = p = 1 / 6 and p 1 = p 2 = 1 / 3 .
Symmetry 15 01046 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Althagafi, A.; Abdelkader, M. Two-Dimensional Moran Model. Symmetry 2023, 15, 1046. https://doi.org/10.3390/sym15051046

AMA Style

Althagafi A, Abdelkader M. Two-Dimensional Moran Model. Symmetry. 2023; 15(5):1046. https://doi.org/10.3390/sym15051046

Chicago/Turabian Style

Althagafi, Asma, and Mohamed Abdelkader. 2023. "Two-Dimensional Moran Model" Symmetry 15, no. 5: 1046. https://doi.org/10.3390/sym15051046

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop