×

Nash \(\epsilon\)-equilibria for stochastic games with total reward functions: an approach through Markov decision processes. (English) Zbl 1449.91018

Summary: The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an \(\epsilon\)-equilibrium. To reach this goal, the results of Markov decision processes are used to find \(\epsilon\)-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s fixed point theorem to obtain the \(\epsilon\)-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.

MSC:

91A15 Stochastic games, stochastic differential games
91A05 2-person games
90C40 Markov and semi-Markov decision processes
PDF BibTeX XML Cite
Full Text: DOI

References:

[1] Aliprantis, C. D.; Border, K. C., Infinite Dimensional Analysis., Springer 2006 · Zbl 1156.46001
[2] Ash, R. B., Real Analysis and Probability., Academic Press, New York 1972 · Zbl 1381.28001
[3] Bartle, R., The Elements of Real Analysis., John Wiley and Sons, Inc. 1964 · Zbl 0116.32302
[4] Cavazos-Cadena, R.; Montes-de-Oca, R., Optimal and nearly optimal policies in Markov decision chains with nonnegative rewards and risk-sensitive expected total-reward criterion., In: Markov Processes and Controlled Markov Chains 2002 (Z. Hou, J. A. Filar and A. Chen, eds.), Kluwer Academic Publishers, pp. 189-221 · Zbl 1068.90606
[5] Filar, J.; Vrieze, K., Competitive Markov Decision Processes., Springer-Verlag, New York 1997 · Zbl 0934.91002
[6] Habil, E. D., Double sequences and double series., The Islamic Univ. J., Series of Natural Studies and Engineering 14 (2006), 1-32. (This reference is available at the Islamic University Journal’s site: http://journal.iugaza.edu.ps/index.php/IUGNS/article/view/1594/1525.)
[7] Hernández-Lerma, O.; Lasserre, J. B., Discrete-Time Markov Control Processes: Basic Optimality Criteria., Springer-Verlag, New York 1996 · Zbl 0840.93001
[8] Hordijk, A., Dynamic Programming and Markov Potential Theory., Mathematical Centre Tracts 51, Amsterdam 1974 · Zbl 0284.49012
[9] Jaśkiewicz, A.; Nowak, A. S., Stochastic games with unbounded payoffs: Applications to robust control in Economics., Dyn. Games Appl. 1 (2011), 2, 253-279 · Zbl 1263.91008
[10] Kakutani, S., A generalization of Brouwer’s fixed point theorem., Duke Math. J. 8 (1942), 457-459 · JFM 67.0742.03
[11] Kelley, J. L., General Topology., Springer, New York 1955 · Zbl 0066.16604
[12] Köthe, G., Topological Vector Spaces I., Springer-Verlag, 1969
[13] Puterman, M., Markov Decision Processes., John Wiley and Sons, Inc. New Jersey 1994 · Zbl 1184.90170
[14] Shapley, L. S., Stochastic games., Proc. Nat. Acad. Sci. U. S. A. 39 (1953), 1095-1100 · Zbl 1180.91042
[15] Thuijsman, F., Optimality and Equilibria in Stochastic Games., CW1 Tract-82, Amsterdam 1992 · Zbl 0765.90097
[16] Zeidler, E., Nonlinear Functional Analysis and its Applications., Springer-Verlag, New York Inc. 1988
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.