Average Optimality of Markov Decision Processes with Unbounded Costs

Hernández-Lerma, Onésimo

doi:10.1007/978-3-642-48417-9_72

Onésimo Hernández-Lerma²

192 Accesses
1 Citations

Abstract

This paper considers Markov decision processes (MDPs) with Borel state space, not necessarily compact control constraint sets, and unbounded cost functions. The objective is to present some recent results on the existence of stationary optimal policies for MDPs with an average cost (AC) criterion. These results include extensions of recent works [7, 8, 9] based on the “vanishing discount factor” approach, as well as existence results for MDPs with strictly unbounded costs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Constrained Markov decision processes in Borel spaces: from discounted to average optimality

Article 20 June 2016

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Article 15 April 2016

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Article 27 November 2014

References

A. Arapostathis, V. Borkar, E. Fernández-Gaucherand, M. K. Ghosh, and S. I. Marcus. Controlled Markov processes with an average cost criterion: a survey. SIAM J. Control Optim. 29 (1991). In press.
Google Scholar
E. Fernández-Gaucherand, Controlled Markov Processes on the Infinite Planning Horizon: Optimal and Adaptive Control Ph. D. Thesis, Univ. of Texas at Austin, 1991.
Google Scholar
O. Hernández-Lerma, Lecture Notes on Discretc-Time Markov Control Processes. IV CLAPEM, Mexico City, 1990.
Google Scholar
O. Hernández-Lerma, Average optimality in dynamic programming on Borel spaces-unbounded costs and controls. Syst. Control Lett. 16 (1991).
Google Scholar
O. Hernández-Lerma, Average optimality of controlled Markov chains with strictly unbounded costs. Preprint, 1991.
Google Scholar
O. Hernández-Lerma, R. Montes-de-Oca and R. Cavazos-Cadena, Recurrence conditions for Markov decision processes with Borel state space:a survey. Ann. O.R. 29 (1991).
Google Scholar
O. Hernández-Lerma and J.B. Lasserre, Average cost optimal policies for Markov control processes with Borel state space and unbounded costs. Syst. Control Lett. 15 (1990), 349–356.
Article Google Scholar
M. Schal, Average optimality in dynamic programming with general state space. Preprint, 1990.
Google Scholar
L.I. Sennott, average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs. Oper. Res. 37 (1989), 626–633.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Matemáticas, CINVESTAV-IPN, A. Postal 14-740, 07000, México, D.F., México
Onésimo Hernández-Lerma

Authors

Onésimo Hernández-Lerma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

FB IV, Mathematik, Universität Trier, Postfach 38 25, D-5500, Trier, Germany
Peter Gritzmann , Rainer Hettich , Reiner Horst & Ekkehard Sachs , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández-Lerma, O. (1992). Average Optimality of Markov Decision Processes with Unbounded Costs. In: Gritzmann, P., Hettich, R., Horst, R., Sachs, E. (eds) Operations Research ’91. Physica-Verlag HD. https://doi.org/10.1007/978-3-642-48417-9_72

Download citation

DOI: https://doi.org/10.1007/978-3-642-48417-9_72
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-0608-3
Online ISBN: 978-3-642-48417-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Average Optimality of Markov Decision Processes with Unbounded Costs

Abstract

Access this chapter

Preview

Similar content being viewed by others

Constrained Markov decision processes in Borel spaces: from discounted to average optimality

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Average Optimality of Markov Decision Processes with Unbounded Costs

Abstract

Access this chapter

Preview

Similar content being viewed by others

Constrained Markov decision processes in Borel spaces: from discounted to average optimality

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation