Stochastic game Information & Stochastic game Links at HealthHaven.com
advertise
add site
services
publishers
database
health videos
Bookmark and Share

search wiki for    ?
web dir firms image gallery news pdf wiki shop video 
about
toolbar
stats
live show
health store
more stuff
JOIN/LOGIN
Featured Results:
Caritas Home Care - Get Active—and Back in the Game —with Video Game s
Caritas Home Care - Get Active—and Back in the Game—with Video Games
caritashomecare.org
 Health Game s at WebMD: Sodoku, Puzzles, Mah Jongg, Word Game s
Health Games at WebMD: Sodoku, Puzzles, Mah Jongg, Word Games
webmd.com
 St. Elizabeth's Medical Center - Get Active—and Back in the...
St. Elizabeth's Medical Center - Get Active—and Back in the...
caritasstelizabeths.org
 Science: Explanatory/Predictive Stochastic Dynamical Network Models...
Science: Explanatory/Predictive Stochastic Dynamical Network Models...
cis.jhu.edu
 

In game theory, a stochastic game, introduced by Lloyd Shapley in the early 1950s, is a dynamic game with probabilistic transitions played by one or more players. The game is played in a sequence of stages. At the beginning of each stage the game is in some state. The players select actions and each player receives a payoff that depends on the current state and the chosen actions. The game then moves to a new random state whose distribution depends on the previous state and the actions chosen by the players. The procedure is repeated at the new state and play continues for a finite or infinite number of stages. The total payoff to a player is often taken to be the discounted sum of the stage payoffs or the limit inferior of the averages of the stage payoffs.

The ingredients of a stochastic game are: a finite set of players I; a state space M (either a finite set or a measurable space (M,{\mathcal A})); for each player i\in I, an action set Si (either a finite set or a measurable space (S^i,{\mathcal S}^i)); a transition probability P from M\times S, where S=\times_{i\in I}S^i is the action profiles, to M, where P(A \mid m, s) is the probability that the next state is in A given the current state m and the current action profile s; and a payoff function g from M\times S to RI, where the i-th coordinate of g, gi, is the payoff to player i as a function of the state m and the action profile s.

The game starts at some initial state m1. At stage t, players first observe mt, then simultaneously choose actions s^i_t\in S^i, then observe the action profile s_t=(s^i_t)_i, and then nature selects mt + 1 according to the probability P(\cdot\mid m_t,s_t). A play of the stochastic game, m_1,s_1,\ldots,m_t,s_t,\ldots, defines a stream of payoffs g_1,g_2,\ldots, where gt = g(mt,st).

The discounted game Γλ with discount factor λ (0<\lambda \leq 1) is the game where the payoff to player i is \lambda \sum_{t=1}^{\infty}(1-\lambda)^{t-1}g^i_t. The n-stage game is the game where the payoff to player i is \bar{g}^i_n:=\frac1n\sum_{t=1}^ng^i_t.

The value vn(m1), respectively vλ(m1), of a two-person zero-sum stochastic game Γn, respectively Γλ, with finitely many states and actions exists, and Truman Bewley and Elon Kohlberg (1976) proved that vn(m1) converges to a limit as n goes to infinity and that vλ(m1) converges to the same limit as λ goes to 0.

The "undiscounted" game \Gamma_\infty is the game where the payoff to player i is the "limit" of the averages of the stage payoffs. Some precautions are needed in defining the value of a two-person zero-sum \Gamma_{\infty} and in defining equilibium payoffs of a non-zero-sum \Gamma_{\infty}. The uniform value v_{\infty} of a two-person zero-sum stochastic game \Gamma_\infty exists if for every \varepsilon>0 there is a positive integer N and a strategy pair \sigma_{\varepsilon} of player 1 and \tau_{\varepsilon} of player 2 such that for every σ and τ and every n\geq N the expectation of \bar{g}^i_n with respect to the probability on plays defined by \sigma_{\varepsilon} and τ is at least v_{\infty} -\varepsilon , and the expectation of \bar{g}^i_n with respect to the probability on plays defined by σ and \tau_{\varepsilon} is at most v_{\infty} +\varepsilon . Jean Francois Mertens and Abraham Neyman (1981) proved that every two-person zero-sum stochastic game with finitely many states and actions has a uniform value.

If there is a finite number of players and the action sets and the set of states are finite, then a stochastic game with a finite number of stages always has a Nash equilibrium. The same is true for a game with infinitely many stages if the total payoff is the discounted sum. Nicolas Vieille has shown that all two-person stochastic games with finite state and action spaces have approximate Nash equilibria when the total payoff is the limit inferior of the averages of the stage payoffs. Whether such equilibria exist when there are more than two players is a challenging open question.

Stochastic games have applications in economics and evolutionary biology. They are generalizations of repeated games which correspond to the special case where there is only one state.

The most complete reference is the book of articles edited by Neyman and Sorin. The more elementary book of Filar and Vrieze provides a unified rigorous treatment of the theories of Markov Decision Processes and two-person stochastic games. They coin the term Competitive MDPs to encompass both one- and two-player stochastic games.

[edit] References

  • L.S. Shapley, Stochastic games, Proc. Nat. Acad. Sciences, 39:1095-1100, 1953.
  • J. F. Mertens and A. Neyman, Stochastic Games, International Journal of Game Theory, 10:53-66, 1981.
  • N. Vieille, Stochastic games: Recent results. In Handbook of Game Theory, 1833–1850. Elsevier Science, 2002.
  • A. Neyman and S. Sorin, Stochastic Games and Applications, Kluwer Academic Press, 2003.
  • J. Filar and K. Vrieze, Competitive Markov Decision Processes, Springer-Verlag, 1997.
  • A. Condon, The complexity of stochastic games, Information and Computation, 96:203–224, 1992.



Product Results (view all...)

search wiki for    ?
web dir firms image gallery news pdf wiki shop video 



↑ top of page ↑about thumbshots