Abstract
Two-person zero-sum stochastic games with finite state and action spaces are considered. The expected average payoff criterion is introduced. In the special case of single controller games it is shown that the optimal stationary policies and the value of the game can be obtained from the optimal solutions to a pair of dual programs. For multichain structures, a decomposition algorithm is given which produces such optimal stationary policies for both players. In the case of both players controlling the transitions, a generalized game is obtained, the solution of which gives the optimal policies.
Similar content being viewed by others
References
J. Bather, Optimal decision procedures in finite Markov chains. Part III: general convex systems, Adv. in Appl. Prob. 5 (1973) 541–553.
T. Bewley and E. Kohlberg, The asymptotic theory of stochastic games, Math. Oper. Res. 1 (1976) 197–208.
T. Bewley and E. Kohlberg, On stochastic games with stationary optimal strategies, Math. Oper. Res. 3 (1978) 104–125.
D. Blackwell and T.S. Ferguson, The big match, Ann. Math. Stat. 39 (1968) 159–163.
K.C. Border,Fixed Point Theorems with Applications to Economics and Game Theory (Cambridge Univ. Press, 1985).
A. Federgruen, Successive approximation methods in undiscounted stochastic games, Oper. Res. 28 (1980) 794–809.
J.A. Filar, Ordered field property for stochastic games when the player who controls transitions changes from state to state, J. Optim. Theory Appl. 34 (1981) 503–515.
J.A. Filar, On stochastic equilibria of a single-controller stochastic game, Math. Programming 30 (1984) 313–325.
J.A. Filar, Quadratic programming and the single-controller stochastic game, J. Math. Anal. Appl. 113 (1986) 136–147.
J.A. Filar and T.A. Schultz, Nonlinear programming and stationary strategies in stochastic games, Math. Programming 32 (1986) 243–247.
J.A. Filar, T.A. Schultz, F. Thuijsman and D.J. Vrieze, Nonlinear programming and stationary equilibria in stochastic games, Technical report, UMBC (August 1987).
A.J. Hoffman and R.M. Karp, On nonterminating stochastic games, Manag. Sci. 12(5) (1966) 359–370.
L.C.M. Kallenberg,Linear Programming and Finite Markovian Control Problems, vol. 148 (Mathematical Centre Tracts, Amsterdam, 1983).
M. Loeve,Probability Theory, vol. 2 (Springer, New York, 1978).
J.F. Mertens and A. Newman, Stochastic games, Int. J. Game Theory 10(2) (1981) 53–66.
T. Parthasarathy and T.E.S. Raghavan, An orderfield property for stochastic games when one player controls transition probabilities, J. Optim. Theory Appl. 33(3) (1981) 375–392.
K.W. Ross and R. Varadarajan, Multichain Markov decision processes with a sample path constraint: A decomposition approach, to appear in Math. Oper. Res.
L.S. Shapley, Stochastic games, Proc. Nat. Acad. Sci. USA 39 (1953) 1095–1100.
O.J. Vrieze, Linear programming and undiscounted stochastic games in which one player controls transitions, OR Spektrum 3 (1981) 29–35.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Baykal-Gürsoy, M. Two-person zero-sum stochastic games. Ann Oper Res 28, 135–152 (1991). https://doi.org/10.1007/BF02055578
Issue Date:
DOI: https://doi.org/10.1007/BF02055578