Markov models for duration-dependent transitions: selecting the states using duration values or duration intervals?

Carette, Philippe; Guerry, Marie-Anne

doi:10.1007/s10260-022-00637-2

Markov models for duration-dependent transitions: selecting the states using duration values or duration intervals?

Original Paper
Published: 15 April 2022

Volume 31, pages 1203–1223, (2022)
Cite this article

Statistical Methods & Applications Aims and scope Submit manuscript

179 Accesses
1 Citation
Explore all metrics

Abstract

In a Markov model the transition probabilities between states do not depend on the time spent in the current state. The present paper explores two ways of selecting the states of a discrete-time Markov model for a system partitioned into categories where the duration of stay in a category affects the probability of transition to another category. For a set of panel data, we compare the likelihood fits of the Markov models with states based on duration intervals and with states defined by duration values. For hierarchical systems, we show that the model with states based on duration values has a better maximum likelihood fit than the baseline Markov model where the states are the categories. We also prove that this is not the case for the duration-interval model, under conditions on the data that seem realistic in practice. Furthermore, we use the Akaike and Bayesian information criteria to compare these alternative Markov models. The theoretical findings are illustrated by an analysis of a real-world personnel data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Article 04 June 2018

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Article Open access 17 April 2024

Fixed and random effects models: making an informed choice

Article Open access 07 August 2018

References

Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: 2nd intr. symp. on information theory, Budapest, 1973, Akademiai Kiado
Anderson D, Burnham K (2004) Model selection and multi-model inference. Springer, New York, p 10
Google Scholar
Bacci S, Pandolfi S, Pennoni F (2014) A comparison of some criteria for states selection in the latent Markov model for longitudinal data. Adv Data Anal Classif 8(2):125–145
Article MathSciNet MATH Google Scholar
Bányai T, Landschützer C, Bányai Á (2018) Markov-chain simulation-based analysis of human resource structure: How staff deployment and staffing affect sustainable human resource strategy. Sustainability 10(10):3692
Article Google Scholar
Barbu V, Limnios N (2008) Semi-Markov chains and hidden semi-Markov models toward applications: their use in reliability and DNA analysis. Springer, New York
MATH Google Scholar
Bartholomew D, Forbes A, McClean S (1991) Statistical techniques for manpower planning. Wiley, Chichester
Google Scholar
Bartolucci F, Farcomeni A, Pennoni F (2012) Latent Markov models for longitudinal data. Chapman and Hall/CRC, Boca Raton
Book MATH Google Scholar
Cartella F, Lemeire J, Dimiccoli L, Sahli H (2015) Hidden semi-Markov models for predictive maintenance. Math Probl Eng 2015
D’Amico G, Janssen J, Manca R (2006) Homogeneous semi-Markov reliability models for credit risk management. Decis Econ Finance 28(2):79–93
Article MathSciNet MATH Google Scholar
De Feyter T (2006) Modelling heterogeneity in manpower planning: dividing the personnel system into more homogeneous subgroups. Appl Stoch Models Bus Ind 22(4):321–334
Article MathSciNet MATH Google Scholar
Dewar M, Wiggins C, Wood F (2012) Inference in hidden Markov models with explicit state duration distributions. IEEE Signal Process Lett 19(4):235–238
Article Google Scholar
Duffy S, Day N, Tabár L, Chen H, Smith T (1997) Markov models of breast tumor progression: some age-specific results. J Natl Cancer Inst Monogr 22:93–97
Article Google Scholar
Durland JM, McCurdy TH (1994) Duration-dependent transitions in a Markov model of US GNP growth. J Bus Econ Stat 12(3):279–288
Google Scholar
Forbes J (1987) Early intraorganizational mobility: patterns and influences. Acad Manag J 30(1):110–125
Article Google Scholar
Frühwirth-Schnatter S, Pittner S, Weber A, Winter-Ebmer R (2018) Analysing plant closure effects using time-varying mixture-of-experts Markov chain clustering. Ann Appl Stat 12(3):1796–1830
Article MathSciNet MATH Google Scholar
Jiang Y, Sinha K (1989) Bridge service life prediction model using the Markov chain. Transp Res Rec 1223:24–30
Google Scholar
Katz RW (1981) On some criteria for estimating the order of a Markov chain. Technometrics 23(3):243–249
Article MathSciNet MATH Google Scholar
Langeheine R, Van de Pol F (1994) Discrete-time mixed Markov latent class models. A casebook of methods, Analyzing social and political change, pp 170–197
Longini I, Clark W, Gardner L, Brundage J (1991) The dynamics of CD4+ T-lymphocyte decline in HIV-infected individuals: a Markov modeling approach. J Acquir Immune Defici Syndr 4(11):1141–1147
Google Scholar
Lyness K, Thompson D (2000) Climbing the corporate ladder: do female and male executives follow the same route? J Appl Psychol 85(1):86
Article Google Scholar
McFarland D (1970) Intragenerational social mobility as a Markov process: including a time-stationary Markovian model that explains observed declines in mobility rates over time. Am Sociol Rev 35(3):463–476
Article Google Scholar
McGinnis R (1968) A stochastic model of social mobility. Am Sociol Rev 33(5):712–722
Article Google Scholar
Parker B, Caine D (1996) Holonic modelling: human resource planning and the two faces of Janus. Int J Manpow 17(8):30–45
Article Google Scholar
Patten S (2005) Markov models of major depression for linking psychiatric epidemiology to clinical practice. Clin Pract Epidemiol Ment Health 1(2):11
Google Scholar
Rombaut E, Guerry MA (2015) Decision trees as a classification technique in manpower planning. In: The 16th conference of the applied stochastic models and data analysis international society, pp 863–877
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461–464
Article MathSciNet MATH Google Scholar
Sonnenberg FA, Beck JR (1993) Markov models in medical decision making: a practical guide. Med Decis Mak 13(4):322–338
Article Google Scholar
Uche P (1990) Non-homogeneity and transition probabilities of a Markov chain. Int J Math Educ Sci Technol 21(2):295–301
Article MATH Google Scholar
Ugwuowo F, McClean S (2000) Modelling heterogeneity in a manpower system: a review. Appl Stoch Models Bus Ind 16(2):99–110
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics, Ghent University, Ghent, Belgium
Philippe Carette
Department of Business Technology and Operations, Vrije Universiteit Brussel, Brussels, Belgium
Marie-Anne Guerry

Authors

Philippe Carette
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Anne Guerry
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marie-Anne Guerry.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Proofs of theorems and lemmas

Let us recall the definition of the following functions:

$$\begin{aligned} \varphi (t)&= {\left\{ \begin{array}{ll} t\ln {t} &{} \text {if } t>0 \\ 0 &{} \text {if } t=0, \end{array}\right. } \end{aligned}$$

(13)

$$\begin{aligned} \phi (t_1,t_2)&=\varphi (t_1)+\varphi (t_2)-\varphi (t_1+t_2)\text {.} \end{aligned}$$

(12)

and

$$\begin{aligned} \psi (x,y) = \phi (x,y)+\phi (x+y,n_{AB}-y)+\phi (n_{AA}-x-y,y)-\phi (n_{AA},n_{AB})\text {.} \end{aligned}$$

(24)

Lemma 1

$\ln {\hat{L}_{\mathcal {S}}} = \phi (n_{11},n_{12})+\phi (n_{11}+n_{12},n_{13})+\phi (n_{22},n_{23})$

Proof

Using (4), (5) and the fact that $n_2-n_{23}=n_{22}$, we obtain

$$\begin{aligned} \ln {\hat{L}_{\mathcal {S}}}&= \varphi (n_{11})+\varphi (n_{12})+\varphi (n_{13})+\varphi (n_{22})+\varphi (n_{23})-\varphi (n_{1})-\varphi (n_{2}) \\&\begin{aligned}&=\varphi (n_{11})+\varphi (n_{12})-\varphi (n_{11}+n_{12}) \\&\quad + \varphi (n_{11}+n_{12}) +\varphi (n_{13})-\varphi (n_{1}) \\&\quad +\varphi (n_{22})+\varphi (n_{23})-\varphi (n_{2}) \end{aligned} \\&= \phi (n_{11},n_{12})+\phi (n_{11}+n_{12},n_{13})+\phi (n_{22},n_{23}) \text {.} \end{aligned}$$

$\square$

Lemma 2

$\ln {\hat{L}_{\tilde{\mathcal {S}}}} = \phi (n_{11}+n_{12},n_{13})+\phi (n_{22},n_{23})$

Proof

It follows from (10) that

$$\begin{aligned} \hat{L}_{\tilde{\mathcal {S}}}=\frac{ {(n_{11}+n_{12})}^{n_{11}+n_{12}} \, {n_{13}}^{n_{13}} \, {n_{22}}^{n_{22}} \, {n_{23}}^{n_{23}} }{ {n_{1}}^{n_{1}} \, {n_{2}}^{n_{2}}\text {,} } \end{aligned}$$

using $n_{1}=n_{11}+n_{12}+n_{13}$ and $n_{2}=n_{22}+n_{23}$. Hence,

$$\begin{aligned} \ln {\hat{L}_{\tilde{\mathcal {S}}}}&= \varphi (n_{11}+n_{12})+\varphi (n_{13})+\varphi (n_{22})+\varphi (n_{23})-\varphi (n_{1})-\varphi (n_{2}) \\&= \bigl (\varphi (n_{11}+n_{12})+\varphi (n_{13})-\varphi (n_{1}) \bigr ) + \bigl (\varphi (n_{22})+\varphi (n_{23})-\varphi (n_{2}) \bigr ) \\&= \phi (n_{11}+n_{12},n_{13})+\phi (n_{22},n_{23}) \text {.} \end{aligned}$$

$\square$

Lemma 3

The function $\phi$, defined by (12), is homogeneous of the first degree, i.e.

$$\begin{aligned} \phi (tx,ty)=t\,\phi (x,y)\quad \text {for all } t,x,y\ge 0 \end{aligned}$$

Proof

By (13), we have that $\varphi (tu)=t\,\varphi (u)+u\,\varphi (t)$ for all $t,u\ge 0$. The result then follows from (12). $\square$

Lemma 4

The function $\phi$, defined by (12), is convex.

Proof

Let $u,v>0$. We prove that the Hessian matrix of $\phi$ at (u, v), denoted H, is positive semi-definite. Using standard calculus,

$$\begin{aligned} H= \begin{bmatrix} \frac{v}{u(u+v)} &{} -\frac{1}{u+v} \\ -\frac{1}{u+v} &{} \frac{u}{v(u+v)} \end{bmatrix}= \frac{1}{uv(u+v)} \begin{bmatrix} v^2 &{} -uv \\ -uv &{} u^2 \end{bmatrix}\text {.} \end{aligned}$$

(23)

The eigenvalues of H are 0 and $\frac{u^2+v^2}{uv(u+v)}$. They are both non-negative, hence H is positive semi-definite. $\square$

Lemma 5

For the function $\phi$, defined by (12), holds

$$\begin{aligned} \phi (a,b)+\phi (c,d)\ge \phi (a+c,b+d)\quad \text {for all } a,b,c,d\ge 0 \end{aligned}$$

with equality if and only if $ad=bc$.

Proof

The inequality is an immediate consequence of the convexity and homogeneity properties of the function $\phi$ (Lemmas 3, 4):

$$\begin{aligned} \phi (a,b)+\phi (c,d)\ge 2\,\phi (\tfrac{a+c}{2},\tfrac{b+d}{2})=\phi (a+c,b+d)\text {.} \end{aligned}$$

Suppose $ad=bc$. If $ad=0=bc$, then equality holds surely because $\phi (u,v)=0$ if $u=0$ or $v=0$. If $ad=bc\ne 0$, then $c=ta$ and $d=tb$ for some number $t\ne 0$. Hence, by the homogeneity of $\phi$,

$$\begin{aligned} \phi (a,b)+\phi (c,d)&= \phi (a,b)+\phi (ta,tb) \\&= (1+t)\phi (a,b) \\&= \phi ((1+t)a,(1+t)b) =\phi (a+c,b+d)\text {.} \end{aligned}$$

Now suppose equality holds. Then, by homogeneity of $\phi$,

$$\begin{aligned} \frac{\phi (a,b)+\phi (c,d)}{2}=\phi (\tfrac{a+c}{2},\tfrac{b+d}{2})\text {.} \end{aligned}$$

Consequently, the function $f(t)=\phi (u,v)$ with $u=a+t(c-a)$ and $v=b+t(d-b)$ is linear on [0, 1], because $\phi$ is convex. So, $f''(t)=0$ for all $t\in (0,1)$. Using the Hessian matrix of $\phi$ in (23), we obtain

$$\begin{aligned} f''(t)&= (c-a)^2H_{11}+2(c-a)(d-b)H_{12}+(d-b)^2H_{22} \\&= \frac{\left[ (b-d)u-(c-a)v\right] ^2}{uv(u+v)}=\frac{(ad-bc)^2}{uv(u+v)}\text {,} \end{aligned}$$

and the result $ad=bc$ thus follows from $f''(t)=0$. $\square$

Lemma 6

For the function $\phi$, as defined in (12), holds that $\phi _y:t \mapsto \phi (t,y)$ is strictly decreasing and strictly convex for all $y>0$.

Proof

Using standard calculus, ${\phi _y}'(t)=\ln {t}-\ln {(t+y)}<0$ and ${\phi _y}''(t)=\frac{y}{t(t+y)}>0$, if $t>0$. $\square$

Lemma 7

If $n_{AA}>2n_{AB}$, it holds that $\psi (n_{AA}-\frac{3}{2}n_{AB},n_{AB})<0$.

Proof

Let $\alpha =n_{AA}/n_{AB}$. By (24) and Lemma 3,

$$\begin{aligned} \psi (n_{AA}-\tfrac{3}{2}n_{AB},n_{AB})=h(\alpha )n_{AB} \end{aligned}$$

where

$$\begin{aligned} h(t)=\phi _1(t-\tfrac{3}{2})+\phi _1(\tfrac{1}{2})-\phi _1(t) \end{aligned}$$

and $\phi _1$ is the function $u \mapsto \phi (u,1)$. Since $\phi _1$ is strictly convex (Lemma 6), its first derivative is strictly increasing and therefore h is strictly decreasing. Furthermore,

$$\begin{aligned} h(2)=\phi (\tfrac{1}{2},1)+\phi (\tfrac{1}{2},1)-\phi (2,1)=2\phi (\tfrac{1}{2},1)-\phi (2,1)=0 \end{aligned}$$

by Lemma 3. Consequently, $h(\alpha )<0$ since $\alpha >2$. $\square$

Lemma 8

If $n_{AA}>2n_{AB}$, it holds that $\psi (n_{AA}-\tfrac{3}{4}n_{AB},\tfrac{3}{4}n_{AB})<0$.

Proof

Let $\alpha =n_{AA}/n_{AB}$ and $\beta =3/4$. By (24) and Lemma 3,

$$\begin{aligned} \psi (n_{AA}-\beta n_{AB},\beta n_{AB})=h(\alpha )n_{AB} \end{aligned}$$

where

$$\begin{aligned} h(t)=\phi (t-\beta ,\beta )+\phi (t,1-\beta )-\phi (t,1)\text {.} \end{aligned}$$

With the use of the function $\phi _1:u \mapsto \phi (u,1)$ and Eqs. (13) and (12) we can rewrite h(t) as

$$\begin{aligned} h(t)=\phi _1(t-\beta )-\phi _1(t)+\phi (\beta ,1-\beta )\text {.} \end{aligned}$$

Since $\phi _1$ is strictly convex (Lemma 6), its first derivative is strictly increasing and therefore h is strictly decreasing. The result now follows from the fact that $h(2)<0$. $\square$

Lemma 9

If $n_{AA}\ge 3n_{AB}$, it holds that $\psi (n_{AB},n_{AB})<0$.

Proof

Let $\alpha =n_{AA}/n_{AB}$. By (24) and Lemma 3,

$$\begin{aligned} \psi (n_{AB},n_{AB})=h(\alpha )n_{AB} \end{aligned}$$

where

$$\begin{aligned} h(t)=\phi (1,1)+\phi _1(t-2)-\phi _1(t) \end{aligned}$$

and $\phi _1$ is the function $u \mapsto \phi (u,1)$. Since $\phi _1$ is strictly convex (Lemma 6), its first derivative is strictly increasing and therefore h is strictly decreasing. Furthermore, $h(3)=\ln {\frac{16}{27}}<0$. Hence, since $\alpha \ge 3$ the monotonicity of h yields $h(\alpha )<0$ and the result follows. $\square$

Lemma 10

If $n_{AA}> 2n_{AB}$, it holds that $\psi (\tfrac{n_{AA}-n_{AB}}{2},n_{AB})<0$.

Proof

Let $\alpha =n_{AA}/n_{AB}$. By (24) and Lemmas 3 and 6,

$$\begin{aligned} \psi (\tfrac{n_{AA}-n_{AB}}{2},n_{AB})=h(\alpha )n_{AB}\text {,} \end{aligned}$$

where

$$\begin{aligned} h(t)=\phi (t-1,2)-\phi (t,1)\text {.} \end{aligned}$$

Using standard calculus,

$$\begin{aligned} h'(t)=\ln {(t-1)}-\ln {t}<0\text {,} \end{aligned}$$

so that the function h is strictly decreasing. Furthermore, $h(2)=0$ which can be verified by straightforward computation. Hence, since $\alpha >2$ the monotonicity of h yields $h(\alpha )<0$ and the result follows. $\square$

Lemma 11

$\psi (0,\frac{n_{AA}n_{AB}}{n_{AA}+n_{AB}})=0$.

Proof

Denote $\rho =\frac{n_{AA}n_{AB}}{n_{AA}+n_{AB}}$. Then, $n_{AB}-\rho = \tfrac{n_{AB}}{n_{AA}}\rho$ and $n_{AA}-\rho =\tfrac{n_{AA}}{n_{AB}}\rho$, so that, using (24) and Lemma 3,

$$\begin{aligned} \psi (0,\rho )&= \phi (\theta ,n_{AB}-\rho )+\phi (n_{AA}-\rho ,\rho )-\phi (n_{AA},n_{AB}) \\&= \phi (\rho ,\tfrac{n_{AB}}{n_{AA}}\rho )+\phi (\tfrac{n_{AA}}{n_{AB}}\rho ,\rho )-\phi (n_{AA},n_{AB}) \\&=\tfrac{\rho }{n_{AA}}\phi (n_{AA},n_{AB})+\tfrac{\rho }{n_{AB}}\phi (n_{AA},n_{AB})-\phi (n_{AA},n_{AB}) \\&= \left( \tfrac{\rho }{n_{AA}}+\tfrac{\rho }{n_{AB}}-1\right) \,\phi (n_{AA},n_{AB})=0\text {.} \end{aligned}$$

$\square$

Lemma 12

$\psi (0,n_{AB})=\psi (n_{AA}-n_{AB},n_{AB})>0$.

Proof

By (24), $\psi (0,n_{AB})$ and $\psi (n_{AA}-n_{AB},n_{AB})$ are both equal to $\phi _{n_{AB}}(n_{AA}-n_{AB})-\phi _{n_{AB}}(n_{AA})$, where $\phi _{n_{AB}}$ is the function $u \mapsto \phi (u,n_{AB})$. According to Lemma 6, $\phi _{n_{AB}}$ is strictly decreasing and the result follows. $\square$

Lemma 13

For an AB-complete data set, we have that $\hat{\tau }_{23}>\hat{p}_{AB}$ is equivalent to $g(n_{11},n_{12})>0$, where

$$\begin{aligned} g(x,y)=n_{AB}x+(n_{AA}+n_{AB})y-n_{AA}n_{AB}\text {.} \end{aligned}$$

Hence, if $\hat{\tau }_{23}>\hat{p}_{AB}$, the point $(n_{11},n_{12})$ in xy-plane lies above the line through the points $(0,\tfrac{n_{AA}n_{AB}}{n_{AA}+n_{AB}})$ and $(n_{AA},0)$.

Proof

Because of AB-completeness, it holds that $n_{23}=n_{12}$. Furthermore, $n_{2}=n_{22}+n_{23}=n_{22}+n_{12}=n_{AA}-n_{11}$. Consequently, since $\hat{\tau }_{23}=n_{23}/n_{2}$ and $\hat{p}_{AB}=n_{AB}/(n_{AA}+n_{AB})$, we have

$$\begin{aligned} \hat{\tau }_{23}>\hat{p}_{AB}&\Leftrightarrow \; \frac{n_{12}}{n_{AA}-n_{11}}> \frac{n_{AB}}{n_{AA}+n_{AB}} \\&\Leftrightarrow \; n_{12}(n_{AA}+n_{AB})> (n_{AA}-n_{11})n_{AB} \\&\Leftrightarrow \; g(n_{11},n_{12})>0\text {.} \end{aligned}$$

$\square$

Proof of theorem 1

Proof

In expression (17), we can eliminate the variables $n_{13}$, $n_{22}$ and $n_{23}$, since $n_{AA}=n_{11}+n_{12}+n_{22}$, and, by the AB-completeness assumption, $n_{AB}=n_{13}+n_{23}$ and $n_{12}=n_{23}$. Hence, $\varDelta \ell \ell _{\mathrm{M}(k)\text{- }\mathrm{M}}=\psi (n_{11},n_{12})$, where

$$\begin{aligned} \psi (x,y) = \phi (x,y)+\phi (x+y,n_{AB}-y)+\phi (n_{AA}-x-y,y)-\phi (n_{AA},n_{AB}){\text {.}} \end{aligned}$$

(24)

The function $\psi$ is convex, since each of the first three terms in (24) is a convex function of (x, y), as a composition of the convex function $\phi$ (Lemma 4) and an affine function from $\mathbb {R}^2$ to $\mathbb {R}^2$.

Let $\alpha =n_{AA}-n_{AB}$. By assumption, $\alpha >0$. Let $\beta =\min \{n_{AB},\frac{\alpha }{2}\}$ and denote the following points in xy-plane: $\mathbf {a}(0,\frac{n_{AA}n_{AB}}{n_{AA}+n_{AB}})$, $\mathbf {b}(\beta ,n_{AB})$, $\mathbf {c}(\alpha ,n_{AB})$, $\mathbf {d}(0,n_{AA})$, $\mathbf {e}(n_{AA}-\tfrac{3}{2}n_{AB},n_{AB})$ and $\mathbf {f}(n_{AA}-\tfrac{3}{4}n_{AB},\tfrac{3}{4}n_{AB})$. Because $n_{AA}>2n_{AB}$, the 3-simplex $\mathbf {ecf}$ is a subset of the 4-simplex $\mathbf {abcd}$, see Fig. 3.

Take $k\ge 1$ sufficiently large, so that $\hat{\tau }_{23}>\hat{p}_{AB}$. Then, using Lemma 13 and the fact that $n_{11}\ge n_{12}$ whenever $k\ge 1$, the point $\mathbf {p}(n_{11},n_{12})$ in xy-plane must be contained in the 4-simplex $\mathbf {abcd}$.

Suppose $\varDelta \ell \ell _{\mathrm{M}(k)\text{- }\mathrm{M}}\ge 0$. We prove that $\mathbf {p}$ must then belong to the 3-simplex $\mathbf {ecf}$. First, we observe that $\psi$ is non-positive on the 5-simplex $\mathbf {abefd}$, because $\psi$ is convex and $\psi$ is non-positive in all vertices of $\mathbf {abefd}$ (Lemmas 7, 8, 9, 10, 11). Hence, because $\mathbf {p}\in \mathbf {abcd}$ and $\psi (\mathbf {p})=\varDelta \ell \ell _{\mathrm{M}(k)\text{- }\mathrm{M}}\ge 0$, we have $\mathbf {p}\in \mathbf {ecf}$. Consequently, $n_{11}>n_{AA}-\tfrac{3}{2}n_{AB}$ and $n_{12}>\tfrac{3}{4}n_{AB}$. But then, (20) cannot be satisfied, since $n_{AA}-n_{11}=n_{22}+n_{12}=n_{22}+n_{23}=n_{2}$. $\square$

Proof of theorem 2

Proof

Denote $n_{11}+n_{12}=a$, $n_{13}=b$, $n_{22}=c$ and $n_{23}=d$. Then, $n_{AA}=a+c$ and $n_{AB}=b+d$. Hence, using (18), we have

$$\begin{aligned} \varDelta \ell \ell _{\widetilde{\mathrm{M}}(k)\text{- }\mathrm{M}}=\phi (a,b)+\phi (c,d)-\phi (a+c,b+d)\text {,} \end{aligned}$$

which is non-negative by Lemma 5.

Furthermore, $\varDelta \ell \ell _{\widetilde{\mathrm{M}}(k)\text{- }\mathrm{M}}=0$ is equivalent to $\phi (a,b)+\phi (c,d)=\phi (a+c,b+d)$, which in turn is equivalent to $ad=bc$ by Lemma 5. Finally, $\hat{\tau }_{13}=\hat{\tau }_{23}=\hat{p}_{AB}$ if and only if $ad=bc$, since $\hat{\tau }_{13}=\tfrac{b}{a+b}$, $\hat{\tau }_{23}=\tfrac{d}{c+d}$ and $\hat{p}_{AB}=\tfrac{b+d}{a+c+b+d}$. $\square$

Proof of theorem 3

Proof

If $n_{11}>0$ and $n_{12}>0$, we have by the binomial theorem that $n_{11}^{n_{11}}n_{12}^{n_{12}}< (n_{11}+n_{12})^{n_{11}+n_{12}}$, hence $\phi (n_{11},n_{12})< 0$ using (12). The theorem now follows from (19) and the fact that $\phi (n_{11},n_{12})=0$ if $n_{11}=0$ or $n_{12}=0$. $\square$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Carette, P., Guerry, MA. Markov models for duration-dependent transitions: selecting the states using duration values or duration intervals?. Stat Methods Appl 31, 1203–1223 (2022). https://doi.org/10.1007/s10260-022-00637-2

Download citation

Accepted: 29 March 2022
Published: 15 April 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s10260-022-00637-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Markov models for duration-dependent transitions: selecting the states using duration values or duration intervals?

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Fixed and random effects models: making an informed choice

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Proofs of theorems and lemmas

Appendix: Proofs of theorems and lemmas

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

Lemma 7

Proof

Lemma 8

Proof

Lemma 9

Proof

Lemma 10

Proof

Lemma 11

Proof

Lemma 12

Proof

Lemma 13

Proof

Proof

Proof

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation