Value iteration in countable state average cost Markov decision processes with unbounded costs

Sennott, Linn I.

doi:10.1007/BF02055585

Value iteration in countable state average cost Markov decision processes with unbounded costs

Research Contributions
Published: December 1991

Volume 28, pages 261–271, (1991)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Linn I. Sennott¹

22 Citations
Explore all metrics

Abstract

We deal with countable state Markov decision processes with finite action sets and (possibly) unbounded costs. Assuming the existence of an expected average cost optimal stationary policyf, with expected average costg, when canf andg be found using undiscounted value iteration? We give assumptions guaranteeing the convergence of a quantity related tong−Ν _n (i), whereΝ _n (i) is the minimum expectedn-stage cost when the process starts in statei. The theory is applied to a queueing system with variable service rates and to a queueing system with variable arrival parameter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

D.P. Bertsekas,Dynamic Programming (Prentice-Hall, Englewood Cliffs, NJ, 1987).
Google Scholar
A. Federgruen and P.J. Schweitzer, A survey of asymptotic value-iteration for undiscounted Markovian decision processes, in:Recent Developments in Markov Decision Processes, R. Hartley, L.C. Thomas and D.J. White (eds.) (Academic Press, New York, 1980).
Google Scholar
O. Hernandez-Lerma,Adaptive Markov Control Processes (Springer, New York, 1989).
Google Scholar
A. Hordijk, P.J. Schweitzer and H. Tijms, The asymptotic behavior of the minimal total expected cost for the denumerable state Markov decision model, J. Appl. Prob. 12 (1975) 298–305.
Google Scholar
V.G. Kulkarni and R.L. Karandikar, Convergence of moments of Markov and semi-Markov processes, University of North Carolina, Operations Research and Systems Analysis Technical Report 86-20 (1986).
A.G. Pakes, Some conditions for ergodicity and recurrence of Markov chains, Oper. Res. 17 (1969) 1058–1061.
Google Scholar
S.M. Ross,Introduction to Stochastic Dynamic Programming (Academic Press, New York, 1983).
Google Scholar
P.J. Schweitzer and A. Federgruen, The asymptotic behavior of undiscounted value iteration in Markov decision problems, Math. Oper. Res. 2 (1977) 360–381.
Article Google Scholar
L.I. Sennott, A new condition for the existence of optimal stationary policies in average cost Markov decision processes, Oper. Res. Lett. 5 (1986) 17–23.
Google Scholar
L.I. Sennott, A new condition for the existence of optimum stationary policies in average cost Markov decision processes — Unbounded cost case,Proc. 25th IEEE Conf. on Decision Control, Athens, Greece (1986) pp. 1719–1721.
L.I. Sennott, Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs, Oper. Res. 37 (1989) 626–633.
Google Scholar
L.I. Sennott, Average cost semi-Markov decision processes and the control of queueing systems, Prob. Eng. Inform. Sci. 3 (1989) 247–272.
Article Google Scholar
R.L. Tweedie, Hitting times of Markov chains, with application to state-dependent queues, Bull. Austral. Math. Soc. 17 (1977) 97–107.
Article Google Scholar
D.J. White, Dynamic programming, Markov chains, and the method of successive approximations, J. Math. Anal. Appl. 6 (1963) 373–376.
Google Scholar

Download references

Author information

Authors and Affiliations

Illinois State University, 61761, Normal, IL, USA
Linn I. Sennott

Authors

Linn I. Sennott
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sennott, L.I. Value iteration in countable state average cost Markov decision processes with unbounded costs. Ann Oper Res 28, 261–271 (1991). https://doi.org/10.1007/BF02055585

Download citation

Issue Date: December 1991
DOI: https://doi.org/10.1007/BF02055585

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Value iteration in countable state average cost Markov decision processes with unbounded costs

Abstract

Access this article

Similar content being viewed by others

Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints

Constrained Optimality for First Passage Criteria in Semi-Markov Decision Processes

Structures of Optimal Policies in MDPs with Unbounded Jumps: The State of Our Art

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Value iteration in countable state average cost Markov decision processes with unbounded costs

Abstract

Access this article

Similar content being viewed by others

Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints

Constrained Optimality for First Passage Criteria in Semi-Markov Decision Processes

Structures of Optimal Policies in MDPs with Unbounded Jumps: The State of Our Art

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation