Markov decision processes under observability constraints

Serin, Yasemin; Kulkarni, Vidyadhar

doi:10.1007/s001860400402

Markov decision processes under observability constraints

Published: June 2005

Volume 61, pages 311–328, (2005)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Yasemin Serin¹ &
Vidyadhar Kulkarni²

Abstract

We develop an algorithm to compute optimal policies for Markov decision processes subject to constraints that result from some observability restrictions on the process. We assume that the state of the Markov process is unobservable. There is an observable process related to the unobservable state. So, we want to find a decision rule depending only on this observable process. The objective is to minimize the expected average cost over an infinite horizon. We also analyze the possibility of performing observations in more detail to obtain improved policies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Industrial Engineering Department, Middle East Technical University, 06531 Ankara, Turkey
Yasemin Serin
Operations Research Department, University of North Carolina at Chapel Hill, USA
Vidyadhar Kulkarni

Authors

Yasemin Serin
View author publications
You can also search for this author in PubMed Google Scholar
Vidyadhar Kulkarni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yasemin Serin.

Additional information

Manuscript received: March 2004/Final version received: June 2004

Rights and permissions

Reprints and permissions

About this article

Cite this article

Serin, Y., Kulkarni, V. Markov decision processes under observability constraints. Math Meth Oper Res 61, 311–328 (2005). https://doi.org/10.1007/s001860400402

Download citation

Issue Date: June 2005
DOI: https://doi.org/10.1007/s001860400402

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Markov decision processes under observability constraints

Abstract

Access this article

Similar content being viewed by others

Closed-form expressions of the run-length distribution of the nonparametric double sampling precedence monitoring scheme

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Simulation optimization: a review of algorithms and applications

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Markov decision processes under observability constraints

Abstract

Access this article

Similar content being viewed by others

Closed-form expressions of the run-length distribution of the nonparametric double sampling precedence monitoring scheme

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Simulation optimization: a review of algorithms and applications

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation