In this chapter, we deal with Markov decision processes in semi-Markov environments with the discounted criterion. The model can describe such a system that itself can be modeled by a Markov decision process, but the system is influenced by its environment which is modeled by a semi-Markov process. The influence of the environment on the system occurs when the environment state changes, and consists of the following three things: (1) an instantaneous state (of the system) transition, (2) an instantaneous reward, and (3) the parameters of the Markov decision process change. We study CTMDPs and then SMDPs in semi-Markov environments. Based on them, we study mixed MDPs in a semi-Markov environment, where the underlying MDP model can be either CTMDPs or SMDPs according to which environment states are entered. The criterion considered is the discounted criterion here. The standard results for all the models are obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Rights and permissions
Copyright information
© 2008 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
(2008). Markovdecisionprocessesinsemi-Markov Environments. In: Markov Decision Processes With Their Applications. Advances in Mechanics and Mathematics, vol 14. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-36951-8_6
Download citation
DOI: https://doi.org/10.1007/978-0-387-36951-8_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-36950-1
Online ISBN: 978-0-387-36951-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)