Markovdecisionprocessesinsemi-Markov Environments

doi:10.1007/978-0-387-36951-8_6

Part of the book series: Advances in Mechanics and Mathematics ((AMMA,volume 14))

2923 Accesses

In this chapter, we deal with Markov decision processes in semi-Markov environments with the discounted criterion. The model can describe such a system that itself can be modeled by a Markov decision process, but the system is influenced by its environment which is modeled by a semi-Markov process. The influence of the environment on the system occurs when the environment state changes, and consists of the following three things: (1) an instantaneous state (of the system) transition, (2) an instantaneous reward, and (3) the parameters of the Markov decision process change. We study CTMDPs and then SMDPs in semi-Markov environments. Based on them, we study mixed MDPs in a semi-Markov environment, where the underlying MDP model can be either CTMDPs or SMDPs according to which environment states are entered. The criterion considered is the discounted criterion here. The standard results for all the models are obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Optimality Conditions for Partially Observable Markov Decision Processes

Solving Markov Decision Processes via Simulation

Reduction of Discounted Continuous-Time MDPs with Unbounded Jump and Reward Rates to Discrete-Time Total-Reward MDPs

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

(2008). Markovdecisionprocessesinsemi-Markov Environments. In: Markov Decision Processes With Their Applications. Advances in Mechanics and Mathematics, vol 14. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-36951-8_6

Download citation

DOI: https://doi.org/10.1007/978-0-387-36951-8_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-36950-1
Online ISBN: 978-0-387-36951-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Markovdecisionprocessesinsemi-Markov Environments

Access this chapter

Preview

Similar content being viewed by others

Optimality Conditions for Partially Observable Markov Decision Processes

Solving Markov Decision Processes via Simulation

Reduction of Discounted Continuous-Time MDPs with Unbounded Jump and Reward Rates to Discrete-Time Total-Reward MDPs

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Markovdecisionprocessesinsemi-Markov Environments

Access this chapter

Preview

Similar content being viewed by others

Optimality Conditions for Partially Observable Markov Decision Processes

Solving Markov Decision Processes via Simulation

Reduction of Discounted Continuous-Time MDPs with Unbounded Jump and Reward Rates to Discrete-Time Total-Reward MDPs

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation