Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Guo, Xianping; Huang, Yonghui; Zhang, Yi

doi:10.1007/s00245-016-9352-6

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Published: 15 April 2016

Volume 75, pages 317–341, (2017)
Cite this article

Applied Mathematics & Optimization Submit manuscript

Xianping Guo¹,
Yonghui Huang¹ &
Yi Zhang²

446 Accesses
4 Citations
Explore all metrics

Abstract

This paper studies the constrained (nonhomogeneous) continuous-time Markov decision processes on the finite horizon. The performance criterion to be optimized is the expected total reward on the finite horizon, while N constraints are imposed on similar expected costs. Introducing the appropriate notion of the occupation measures for the concerned optimal control problem, we establish the following under some suitable conditions: (a) the class of Markov policies is sufficient; (b) every extreme point of the space of performance vectors is generated by a deterministic Markov policy; and (c) there exists an optimal Markov policy, which is a mixture of no more than \(N+1\) deterministic Markov policies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Article 19 October 2019

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Article 29 September 2018

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Article 27 November 2014

References

Altman, E.: Constrained Markov Decision Processes. Chapman & Hall, Boca Raton (1999)
MATH Google Scholar
Avrachenkov, K., Habachi, O., Piunovskiy, A., Zhang, Y.: Infinite horizon impulsive optimal control with applications to Internet congestion control. Int. J. Control 88, 703–716 (2015)
Article MATH Google Scholar
Baüerle, N., Rieder, U.: Markov Decision Processes with Applications to Finance. Springer, Heidelberg (2011)
Book MATH Google Scholar
Bertsekas, D., Nedíc, A., Ozdaglar, A.: Convex Analysis and Optimization. Athena Scientific, Belmont (2003)
MATH Google Scholar
Feinberg, E.: Continuous time discounted jump Markov decision processes: a discrete-event approach. Math. Oper. Res. 29, 492–524 (2004)
Article MathSciNet MATH Google Scholar
Feinberg, E., Mandava, M., Shiryayev, A.: On solutions of Kolmogorovs equations for nonhomogeneous jump Markov processes. J. Math. Anal. Appl. 411(1), 261–270 (2014)
Article MathSciNet MATH Google Scholar
Feinberg, E., Rothblum, U.: Splitting randomized stationary policies in total-reward Markov decision processes. Math. Oper. Res. 37, 129–153 (2012)
Article MathSciNet MATH Google Scholar
Ghosh, M.K., Saha, S.: Continuous-time controlled jump Markov processes on the finite horizon. In: Optimization, Control, and Applications of Stochastic Systems, pp. 99–109. Birkhäuser, New York (2012)
Guo, X.P., Hernández-Lerma, O.: Continuous-Time Markov Decision Processes. Springer, Berlin (2009)
Book MATH Google Scholar
Guo, X.P., Hernández-Lerma, O.: Constrained continuous-time Markov controlled processes with discounted criteria. Stoch. Anal. Appl. 21, 379–399 (2003)
Article MATH Google Scholar
Guo, X.P., Huang, X.X., Huang, Y.H.: Finite horizon optimality for continuous-time Markov decision processes with unbounded transition rates. Adv. Appl. Probab. 47, 1–24 (2015)
Article MathSciNet MATH Google Scholar
Guo, X.P., Huang, Y.H., Song, X.Y.: Linear programming and constrained average optimality for general continuous-time Markov decision processes in history-dependent policies. SIAM J. Control Optim. 50, 23–47 (2012)
Article MathSciNet MATH Google Scholar
Guo, X.P., Song, X.Y.: Discounted continuous-time constrained Markov decision processes in Polish spaces. Ann. Appl. Probab. 21, 2016–2049 (2011)
Article MathSciNet MATH Google Scholar
Guo, X.P., Piunovskiy, A.: Discounted continuous-time Markov decision processes with constraints: unbounded transition and loss rates. Math. Oper. Res. 36, 105–132 (2011)
Article MathSciNet MATH Google Scholar
Guo, X.P., Vykertas, M., Zhang, Y.: Absorbing continuous-time Markov decision processes with total cost criteria. Adv. Appl. Probab. 45, 490–519 (2013)
Article MathSciNet MATH Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer, New York (1996)
Book MATH Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Further Topics on Discrete-Time Markov Control Processes. Springer, New York (1999)
Book MATH Google Scholar
Huang, Y.H.: Finite horizon continuous-time Markov decision processes with mean and variance criteria. Submitted (2015)
Jacod, J.: Multivariate point processes: predictable projection, Radon–Nicodym derivatives, representation of martingales. Z. Wahrscheinlichkeitstheorie und verwandte Gebiete 31, 235–253 (1975)
Article MATH Google Scholar
Kitaev, M.Y., Rykov, V.V.: Controlled Queueing Systems. CRC Press, New York (1995)
MATH Google Scholar
Miller, B.L.: Finite state continuous time Markov decision processes with a finite planning horizon. SIAM J. Control 6, 266–280 (1968)
Article MathSciNet MATH Google Scholar
Miller, B., Miller, G., Siemenikhin, K.: Towards the optimal control of Markov chains with constraints. Automatica 46, 1495–1502 (2010)
Article MathSciNet MATH Google Scholar
Piunovskiy, A.B.: Optimal Control of Random Sequences in Problems with Constraints. Kluwer Academic, Dordrecht (1997)
Book MATH Google Scholar
Piunovskiy, A.: A controlled jump discounted model with constraints. Theory Probab. Appl. 42, 51–71 (1998)
Article Google Scholar
Piunovskiy, A., Zhang, Y.: Discounted continuous-time Markov decision processes with unbounded rates: the convex analytic approach. SIAM J. Control Optim. 49, 2032–2061 (2011)
Article MathSciNet MATH Google Scholar
Pliska, S.R.: Controlled jump processes. Stoch. Process. Appl. 3, 259–282 (1975)
Article MathSciNet MATH Google Scholar
Prieto-Rumeau, T., Hernández-Lerma, O.: Selected Topics in Continuous-Time Controlled Markov Chains and Markov Games. Imperial College Press, London (2012)
Book MATH Google Scholar
Yushkevich, A.A.: Controlled Markov models with countable state and continuous time. Theory Probab. Appl. 22, 215–235 (1977)
Article MathSciNet MATH Google Scholar
Zhang, L.L., Guo, X.P.: Constrained continuous-time Markov decision processes with average criteria. Math. Methods Oper. Res. 67, 323–340 (2008)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou, 510275, China
Xianping Guo & Yonghui Huang
Department of Mathematical Sciences, University of Liverpool, Liverpool, L69 7ZL, UK
Yi Zhang

Authors

Xianping Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yonghui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yonghui Huang.

Additional information

Research supported by NSFC and Guangdong Province Key Laboratory of Computational Science at Sun Yat-Sen University. Y. Zhang’s work was carried out with a financial grant from the Research Fund for Coal and Steel of the European Commission, within the INDUSE-2-SAFETY project (Grant No. RFSR-CT-2014-00025).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guo, X., Huang, Y. & Zhang, Y. Constrained Continuous-Time Markov Decision Processes on the Finite Horizon. Appl Math Optim 75, 317–341 (2017). https://doi.org/10.1007/s00245-016-9352-6

Download citation

Published: 15 April 2016
Issue Date: April 2017
DOI: https://doi.org/10.1007/s00245-016-9352-6

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Abstract

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Abstract

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation