Abstract
This paper considers Markov decision processes (MDPs) with Borel state space, not necessarily compact control constraint sets, and unbounded cost functions. The objective is to present some recent results on the existence of stationary optimal policies for MDPs with an average cost (AC) criterion. These results include extensions of recent works [7, 8, 9] based on the “vanishing discount factor” approach, as well as existence results for MDPs with strictly unbounded costs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
A. Arapostathis, V. Borkar, E. Fernández-Gaucherand, M. K. Ghosh, and S. I. Marcus. Controlled Markov processes with an average cost criterion: a survey. SIAM J. Control Optim. 29 (1991). In press.
E. Fernández-Gaucherand, Controlled Markov Processes on the Infinite Planning Horizon: Optimal and Adaptive Control Ph. D. Thesis, Univ. of Texas at Austin, 1991.
O. Hernández-Lerma, Lecture Notes on Discretc-Time Markov Control Processes. IV CLAPEM, Mexico City, 1990.
O. Hernández-Lerma, Average optimality in dynamic programming on Borel spaces-unbounded costs and controls. Syst. Control Lett. 16 (1991).
O. Hernández-Lerma, Average optimality of controlled Markov chains with strictly unbounded costs. Preprint, 1991.
O. Hernández-Lerma, R. Montes-de-Oca and R. Cavazos-Cadena, Recurrence conditions for Markov decision processes with Borel state space:a survey. Ann. O.R. 29 (1991).
O. Hernández-Lerma and J.B. Lasserre, Average cost optimal policies for Markov control processes with Borel state space and unbounded costs. Syst. Control Lett. 15 (1990), 349–356.
M. Schal, Average optimality in dynamic programming with general state space. Preprint, 1990.
L.I. Sennott, average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs. Oper. Res. 37 (1989), 626–633.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1992 Physica-Verlag Heidelberg
About this paper
Cite this paper
Hernández-Lerma, O. (1992). Average Optimality of Markov Decision Processes with Unbounded Costs. In: Gritzmann, P., Hettich, R., Horst, R., Sachs, E. (eds) Operations Research ’91. Physica-Verlag HD. https://doi.org/10.1007/978-3-642-48417-9_72
Download citation
DOI: https://doi.org/10.1007/978-3-642-48417-9_72
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-0608-3
Online ISBN: 978-3-642-48417-9
eBook Packages: Springer Book Archive