Search
2014 Volume 29
Article Contents
RESEARCH ARTICLE   Open Access    

Repeated games for multiagent systems: a survey

More Information
  • Abstract: Repeated games are an important mathematical formalism to model and study long-term economic interactions between multiple self-interested parties (individuals or groups of individuals). They open attractive perspectives in modeling long-term multiagent interactions. This overview paper discusses the most important results that actually exist for repeated games. These results arise from both economics and computer science. Contrary to a number of existing surveys of repeated games, most of which originated from the economic research community, we are first to pay a special attention to a number of important distinctive features proper to artificial agents. More precisely, artificial agents, as opposed to the human agents mainly aimed by the economic research, are usually bounded whether in terms of memory or performance. Therefore, their decisions have to be based on the strategies defined using finite representations. Furthermore, these strategies have to be efficiently computed or approximated using a limited computational resource usually available to artificial agents.
  • 加载中
  • Abreu D.1986. Extremal equilibria of oligopolistic supergames. Journal of Economic Theory39(1), 191–225.

    Google Scholar

    Abreu D.1988. On the theory of infinitely repeated games with discounting. Econometrica56, 383–396.

    Google Scholar

    Abreu D., Pearce D., Stacchetti E.1990. Toward a theory of discounted repeated games with imperfect monitoring. Econometrica58(5), 1041–1063.

    Google Scholar

    Abreu D., Rubinstein A.1988. The structure of Nash equilibrium in repeated games with finite automata. Econometrica56(6), 1259–1281.

    Google Scholar

    Aumann R.1981. Survey of repeated games. Essays in Game Theory and Mathematical Economics in Honor of Oskar Morgenstern, 11–42.

    Google Scholar

    Aumann R., Maschler M., Stearns R.1995. Repeated Games With Incomplete Information. The MIT press.

    Google Scholar

    Aumann R., Shapley L.1994. Long term competition: a game theoretic analysis. Essays in Game Theory in Honor of Michael Maschler, 1–15.

    Google Scholar

    Banerjee B., Peng J.2003. Adaptive policy gradient in multiagent learning. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS'03). ACM Press, 686–692.

    Google Scholar

    Banks J., Sundaram R.1990. Repeated games, finite automata, and complexity. Games and Economic Behavior2(2), 97–117.

    Google Scholar

    Ben-Porath E.1990. The complexity of computing a best response automaton in repeated games with mixed strategies. Games and Economic Behavior2(1), 1–12.

    Google Scholar

    Ben-Porath E.1993. Repeated games with finite automata. Journal of Economic Theory59, 17–39.

    Google Scholar

    Ben-Porath E., Peleg B.1987. On the Folk Theorem and Finite Automata. Mimeo, Hebrew University of Jerusalim.

    Google Scholar

    Ben-Sasson E., Kalai A. T., Kalai E.2007. An approach to bounded rationality. In Advances in Neural Information Processing Systems 19, Schisölkopf, B., Platt J. & Hoffman, T. (eds). MIT Press, 145–152.

    Google Scholar

    Benoit J.-P., Krishna V.1985. Finitely repeated games. Econometrica53(4), 905–922.

    Google Scholar

    Benoit J.-P., Krishna V.1999. The Folk Theorems for Repeated Games: A Synthesis. Mimeo, Pennsylvania State University.

    Google Scholar

    Bernstein D., Givan R., Immerman N., Zilberstein S.2003. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research27(4), 819–840.

    Google Scholar

    Berry D., Fristedt B.1985. Bandit Problems. Chapman and Hall London.

    Google Scholar

    Bhaskar V., Obara I.2002. Belief-based equilibria in the repeated Prisoners’ Dilemma with private monitoring. Journal of Economic Theory102(1), 40–69.

    Google Scholar

    Borgs C., Chayes J., Immorlica N., Kalai A., Mirrokni V., Papadimitriou C.2008. The myth of the folk theorem. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing (STOC'08). ACM Press, 365–372.

    Google Scholar

    Bowling M., Veloso M.2002. Multiagent learning using a variable learning rate. Artificial Intelligence136(2), 215–250.

    Google Scholar

    Burkov A., Chaib-draa B.2009. Effective learning in the presence of adaptive counterparts. Journal of Algorithms64(4), 127–138.

    Google Scholar

    Burkov A., Chaib-draa B.2010. An approximate subgame-perfect equilibrium computation technique for repeated games. In Proceedings of Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI'10). AAAI Press, 729–736.

    Google Scholar

    Chen X., Deng X.2006. Settling the complexity of two-player Nash equilibrium. In Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06). IEEE Computer Society, 261–272.

    Google Scholar

    Cheng S., Reeves D., Vorobeychik Y., Wellman M.2004. Notes on equilibria in symmetric games. In AAMAS-04 Workshop on Game-Theoretic and Decision-Theoretic Agents.

    Google Scholar

    Compte O.1998. Communication in repeated games with imperfect private monitoring. Econometrica66(3), 597–626.

    Google Scholar

    Conitzer V., Sandholm T.2007. AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. Machine Learning67(1), 23–43.

    Google Scholar

    Cronshaw M.1997. Algorithms for finding repeated game equilibria. Computational Economics10(2), 139–168.

    Google Scholar

    Cronshaw M., Luenberger D.1994. Strongly symmetric subgame perfect equilibria in infinitely repeated games with perfect monitoring and discounting. Games and Economic Behavior6(2), 220–237.

    Google Scholar

    Daskalakis C., Goldberg P., Papadimitriou C.2006. The complexity of computing a Nash equilibrium. In Proceedings of the Thirty-Eighth Annual ACM Symposium on Theory of Computing (STOC'06). ACM Press, 71–78.

    Google Scholar

    Ely J., Hörner J., Olszewski W.2005. Belief-free equilibria in repeated games. Econometrica73(2), 377–415.

    Google Scholar

    Ely J., Valimaki J.2002. A robust folk theorem for the prisoner's dilemma. Journal of Economic Theory102(1), 84–105.

    Google Scholar

    Friedman J.1971. A non-cooperative equilibrium for supergames. The Review of Economic Studies, 1–12.

    Google Scholar

    Fudenberg D., Kreps D., Maskin E.1990. Repeated games with long-run and short-run players. The Review of Economic Studies57(4), 555–573.

    Google Scholar

    Fudenberg D., Levine D.1991. An approximate folk theorem with imperfect private information. Journal of Economic Theory54(1), 26–47.

    Google Scholar

    Fudenberg D., Levine D., Maskin E.1994. The folk theorem with imperfect public information. Econometrica62(5), 997–1039.

    Google Scholar

    Fudenberg D., Levine D., Takahashi S.2007. Perfect public equilibrium when players are patient. Games and Economic Behavior61(1), 27–49.

    Google Scholar

    Fudenberg D., Tirole J.1991. Game Theory. MIT Press.

    Google Scholar

    Gilboa I.1988. The complexity of computing best-response automata in repeated games. Journal of economic theory45(2), 342–352.

    Google Scholar

    Gossner O., Tomala T.2009. Repeated games. Encyclopedia of Complexity and Systems Science, forthcoming.

    Google Scholar

    Hart S., Mansour Y.2007. The communication complexity of uncoupled Nash equilibrium procedures. In Proceedings of the Thirty-Ninth Annual ACM Symposium on Theory of Computing (STOC'07). ACM Press, 345–353.

    Google Scholar

    Hörner J., Lovo S.2009. Belief-free equilibria in games with incomplete information. Econometrica77(2), 453–487.

    Google Scholar

    Hörner J., Olszewski W.2006. The folk theorem for games with private almost-perfect monitoring. Econometrica74(6), 1499–1544.

    Google Scholar

    Hörner J., Olszewski W.2007. How robust is the folk theorem with imperfect public monitoring. Northwestern University.

    Google Scholar

    Jong S., Tuyls K., Verbeeck K.2008. Fairness in multi-agent systems. The Knowledge Engineering Review23(2), 153–180.

    Google Scholar

    Judd K., Yeltekin S., Conklin J.2003. Computing supergame equilibria. Econometrica71(4), 1239–1254.

    Google Scholar

    Kaelbling L., Littman M., Cassandra A.1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence101(1–2), 99–134.

    Google Scholar

    Kalai E., Stanford W.1988. Finite rationality and interpersonal complexity in repeated games. Econometrica56(2), 397–410.

    Google Scholar

    Kandori M., Obara I.2006. Efficiency in repeated games revisited: the role of private strategies. Econometrica74(2), 499–519.

    Google Scholar

    Kreps D., Wilson R.1982. Sequential equilibria. Econometrica: Journal of the Econometric Society, 863–894.

    Google Scholar

    Kushilevitz E., Nisan N.1997. Communication Complexity. Cambridge University Press.

    Google Scholar

    Laraki R.2002. Repeated games with lack of information on one side: the dual differential approach. Mathematics of Operations Research27(2), 419–440.

    Google Scholar

    Lehrer E., Pauzner A.1999. Repeated games with differential time preferences. Econmetrica67(2), 393–412.

    Google Scholar

    Lehrer E., Yariv L.1999. Repeated games with incomplete information on one side: the case of different discount factors. Mathematics of Operations Research24(1), 204–218.

    Google Scholar

    Lipman B., Wang R.2000. Switching costs in frequently repeated games. Journal of Economic Theory93(2), 149–190.

    Google Scholar

    Lipman B., Wang R.2009. Switching costs in infinitely repeated games. Games and Economic Behavior66(1), 292–314.

    Google Scholar

    Littman M., Stone P.2005. A polynomial-time Nash equilibrium algorithm for repeated games. Decision Support Systems39(1), 55–66.

    Google Scholar

    Mailath G., Matthews S., Sekiguchi T.2002. Private strategies in finitely repeated games with imperfect public monitoring. Contributions to Theoretical Economics2(1), 1046.

    Google Scholar

    Mailath G., Morris S.2002. Repeated games with almost-public monitoring. Journal of Economic Theory102(1), 189–228.

    Google Scholar

    Mailath G., Samuelson L.2006. Repeated Games and Reputations: Long-run Relationships. Oxford University Press.

    Google Scholar

    Matsushima H.2004. Repeated games with private monitoring: two players. Econometrica72(3), 823–852.

    Google Scholar

    Mertens J., Sorin S., Zamir S.1994. Repeated games, Part A: background material. CORE Discussion Papers, 9420.

    Google Scholar

    Myerson R.1991. Game Theory: Analysis of Conflict. Harvard University Press.

    Google Scholar

    Nash J.1950. Equilibrium points in n-person games. Proceedings of the National Academy of Sciences of the United States of America36(1), 48–49.

    Google Scholar

    Neme A., Quintas L.1995. Subgame perfect equilibrium of repeated games with implementation costs. Journal of Economic Theory66(2), 599–608.

    Google Scholar

    Neyman A.1985. Bounded complexity justifies cooperation in the finitely repeated Prisoner's Dilemma. Economics Letters19(3), 227–229.

    Google Scholar

    Neyman A.1995. Cooperation, repetition, and automata. In Cooperation: Game Theoretic Approaches, volume 155 of NATO ASI Series F. Springer-Verlag, 233–255.

    Google Scholar

    Neyman A.1998. Finitely repeated games with finite automata. Mathematics of Operations Research23(3), 513–552.

    Google Scholar

    Obara I.2009. Folk theorem with communication. Journal of Economic Theory144(1), 120–134.

    Google Scholar

    Osborne M., Rubinstein A.1994. A Course in Game Theory. MIT Press.

    Google Scholar

    Papadimitriou C.1992. On players with a bounded number of states. Games and Economic Behavior4(1), 122–131.

    Google Scholar

    Papadimitriou C., Yannakakis M.1988. Optimization, approximation, and complexity classes. In Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing. ACM Press, 229–234.

    Google Scholar

    Papadimitriou C., Yannakakis M.1994. On complexity as bounded rationality (extended abstract). In Proceedings of the Twenty-Sixth Annual ACM Symposium on Theory of Computing. ACM Press, 726–733.

    Google Scholar

    Pearce D.1992. Repeated games: cooperation and rationality. In Advances in Economic Theory: Sixth World Congress, vol. 1. Cambridge University Press, 132–174.

    Google Scholar

    Piccione M.2002. The repeated prisoner's dilemma with imperfect private monitoring. Journal of Economic Theory102(1), 70–83.

    Google Scholar

    Radner R.1986. Repeated partnership games with imperfect monitoring and no discounting. The Review of Economic Studies53(1), 43–57.

    Google Scholar

    Ramchurn S., Huynh D., Jennings N.2004. Trust in multi-agent systems. The Knowledge Engineering Review19(1), 1–25.

    Google Scholar

    Rasmusen E.1994. Games and Information. Blackwell Cambridge.

    Google Scholar

    Renault J., Scarlatti S., Scarsini M.2005. A folk theorem for minority games. Games and Economic Behavior53(2), 208–230.

    Google Scholar

    Renault J., Scarlatti S., Scarsini M.2008. Discounted and finitely repeated minority games with public signals. Mathematical Social Sciences56(1), 44–74.

    Google Scholar

    Rubinstein A.1979. Equilibrium in supergames with the overtaking criterion. Journal of Economic Theory21(1), 1–9.

    Google Scholar

    Russell S., Norvig P.2009. Artificial Intelligence: A Modern Approach, 3rd edn.Prentice Hall.

    Google Scholar

    Sekiguchi T.1997. Efficiency in repeated Prisoner's Dilemma with private monitoring. Journal of Economic Theory76(2), 345–361.

    Google Scholar

    Sorin S.1986. On repeated games with complete information. Mathematics of Operations Research11(1), 147–160.

    Google Scholar

    Sutton R. S., Barto A. G.1998. Reinforcement Learning: An Introduction. The MIT Press.

    Google Scholar

    Zemel E.1989. Small talk and cooperation: a note on bounded rationality. Journal of Economic Theory49(1), 1–9.

    Google Scholar

  • Cite this article

    Andriy Burkov, Brahim Chaib-Draa. 2014. Repeated games for multiagent systems: a survey. The Knowledge Engineering Review 29(1)1−30, doi: 10.1017/S026988891300009X
    Andriy Burkov, Brahim Chaib-Draa. 2014. Repeated games for multiagent systems: a survey. The Knowledge Engineering Review 29(1)1−30, doi: 10.1017/S026988891300009X

Article Metrics

Article views(16) PDF downloads(197)

Other Articles By Authors

RESEARCH ARTICLE   Open Access    

Repeated games for multiagent systems: a survey

The Knowledge Engineering Review  29 2014, 29(1): 1−30  |  Cite this article

Abstract: Abstract: Repeated games are an important mathematical formalism to model and study long-term economic interactions between multiple self-interested parties (individuals or groups of individuals). They open attractive perspectives in modeling long-term multiagent interactions. This overview paper discusses the most important results that actually exist for repeated games. These results arise from both economics and computer science. Contrary to a number of existing surveys of repeated games, most of which originated from the economic research community, we are first to pay a special attention to a number of important distinctive features proper to artificial agents. More precisely, artificial agents, as opposed to the human agents mainly aimed by the economic research, are usually bounded whether in terms of memory or performance. Therefore, their decisions have to be based on the strategies defined using finite representations. Furthermore, these strategies have to be efficiently computed or approximated using a limited computational resource usually available to artificial agents.

    • At this point, let the word ‘rational’ simply stand for ‘the one whose goal is to maximize its own payoff’.

    • This is true for certain choices of the players’ long-term payoff criteria. Different criteria will be described further.

    • Another definition of a mixed strategy is also possible. If we denote by $$\[--><$>{{\rSigma }_i} <$><!--$$ the set of all pure strategies available to player $$\[--><$> i <$><!--$$, then player $$\[--><$> i <$><!--$$'s mixed strategy can be defined as a mapping $$\[--><$>{{\rho }_i}:H\, \mapsto \,{{{\rm{\rSigma }}}_i} <$><!--$$. It can be verified that these two definitions are equivalent (Mertens et al., 1994).

    • In the notation γt, t is the power of γ and not a superscript.

    • The support of a mixed action $$\[--><$>{{\alpha }_i} \in {\rm{\rDelta }}({{A}_i}) <$><!--$$ are those pure actions $$\[--><$>{{ a}_i} \in {{A}_i} <$><!--$$ that have a non-zero probability in $$\[--><$>{{\alpha }_i} <$><!--$$.

    • Recall that an outcome path is defined as $$\[--><$>{{{\bf a}}}\, \equiv \,({{a}^0} ,{{a}^1} ,\, \ldots ) <$><!--$$ with $$\[--><$>{{ a}^t} \in A <$><!--$$.

    • The assumption of the subsequent punishment on the part of Player 1 is a corollary of the one-shot deviation principle. We have already mentioned in the proof of Theorem 3 that we can assume that the deviator (say Player 1) deviates only once and then it follows the prescribed strategy. In practice, this means that it punishes Player 2 for not being punished by Player 2.

    • More precisely, the problem of computing a Nash equilibrium profile for an n-player repeated game with n >2 is at least as complex as the problem of computing a Nash equilibrium in an (n − 1)-player stage-game. The latter problem does not have efficient algorithms solving it. Furthermore, this problem has recently been shown to be PPAD-complete (see Papadimitriou & Yannakakis, 1988; Chen & Deng, 2006; Daskalakis et al., 2006 for more details on this subject).

    • Copyright © Cambridge University Press 2013 2013Cambridge University Press
References (85)
  • About this article
    Cite this article
    Andriy Burkov, Brahim Chaib-Draa. 2014. Repeated games for multiagent systems: a survey. The Knowledge Engineering Review 29(1)1−30, doi: 10.1017/S026988891300009X
    Andriy Burkov, Brahim Chaib-Draa. 2014. Repeated games for multiagent systems: a survey. The Knowledge Engineering Review 29(1)1−30, doi: 10.1017/S026988891300009X
  • Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return