Search
2011 Volume 26
Article Contents
RESEARCH ARTICLE   Open Access    

A unifying framework for iterative approximate best-response algorithms for distributed constraint optimization problems1

More Information
  • Corresponding authors: Archie C. Chapman ;  Alex Rogers ;  Nicholas R. Jennings ;  David S. Leslie

Article Metrics

Article views(16) PDF downloads(13)

RESEARCH ARTICLE   Open Access    

A unifying framework for iterative approximate best-response algorithms for distributed constraint optimization problems1

  • Corresponding authors: Archie C. Chapman ;  Alex Rogers ;  Nicholas R. Jennings ;  David S. Leslie
The Knowledge Engineering Review  26 Article number: 10.1017/S0269888911000178  (2011)  |  Cite this article

Abstract: Abstract: Distributed constraint optimization problems (DCOPs) are important in many areas of computer science and optimization. In a DCOP, each variable is controlled by one of many autonomous agents, who together have the joint goal of maximizing a global objective function. A wide variety of techniques have been explored to solve such problems, and here we focus on one of the main families, namely iterative approximate best-response algorithms used as local search algorithms for DCOPs. We define these algorithms as those in which, at each iteration, agents communicate only the states of the variables under their control to their neighbours on the constraint graph, and that reason about their next state based on the messages received from their neighbours. These algorithms include the distributed stochastic algorithm and stochastic coordination algorithms, the maximum-gain messaging algorithms, the families of fictitious play and adaptive play algorithms, and algorithms that use regret-based heuristics. This family of algorithms is commonly employed in real-world systems, as they can be used in domains where communication is difficult or costly, where it is appropriate to trade timeliness off against optimality, or where hardware limitations render complete or more computationally intensive algorithms unusable. However, until now, no overarching framework has existed for analyzing this broad family of algorithms, resulting in similar and overlapping work being published independently in several different literatures. The main contribution of this paper, then, is the development of a unified analytical framework for studying such algorithms. This framework is built on our insight that when formulated as non-cooperative games, DCOPs form a subset of the class of potential games. This result allows us to prove convergence properties of iterative approximate best-response algorithms developed in the computer science literature using game-theoretic methods (which also shows that such algorithms can also be applied to the more general problem of finding Nash equilibria in potential games), and, conversely, also allows us to show that many game-theoretic algorithms can be used to solve DCOPs. By so doing, our framework can assist system designers by making the pros and cons of, and the synergies between, the various iterative approximate best-response DCOP algorithm components clear.

    • Cf. partial monitoring, as in multi-armed Bandit problems. See Blum and Mansour (2007) for a discussion of the issues surrounding these two monitoring models.

    • Note that, when applied to DCOPs, many approaches to distributed optimization usually can be placed into one of the three categories defined above. For example, many negotiation models and local exchange markets are, in effect, local approximate best-response algorithms. Consider a negotiation model in which at each time step one agent in each neighbourhood announces a new configuration of the variables under its control to its neighbours, with the constraint that each new configuration weakly improves the agent's payoff (as is commonly employed). This type of negotiation model is captured by our framework, as messages are local (only neighbours receive the agent's update to its state) and the process is iterative. We also point out that other approaches to the general problem of distributed optimization that can be applied to DCOPs, such as token-passing (e.g. Xu et al., 2005) or auction protocols (e.g. Gerkey & Mataric, 2002), do not fall into any of these three categories. However, we consider an exhaustive classification and analysis of these distributed optimization techniques outside the scope of this paper, because we are primarily concerned with local approximate best-response algorithms in the specific case of DCOPs.

    • The complete algorithms are typically used in applications where their optimality is the key concern and timeliness is not a limiting factor, such as industrial scheduling and timetabling problems (Petcu & Faltings, 2005) or routing protocols for fixed environmental sensor networks (Kho et al., 2009).

    • One notable exception to this trend is Marden et al. (2009a), who illustrate the connections between potential games and consensus problems. In a consensus problem, a set of agents must reach consensus upon a given value (such as a meeting point). These problems may be modelled as a DCOP containing binary and unary constraints. Each binary constraint between two agents is satisfied when their variables are set to the same value, and violations are penalized in proportion to their distance between their variables’ values. However, an agent's strategy set may be limited by its unary constraints such that it is not possible for it to simultaneously satisfy all its binary constraints.

    • Generally, any optimization problem that forms a commutative semi-ring can be expressed as a bipartite factor graph—see Aji and McEliece (2000) for details.

    • Indeed, this result holds for any problem that can be represented as a factor graph and in which the variable domains are finite.

    • Tumer and Wolpert (2004) provide another method for showing that DCOP games are potential games. Generally, any global utility function whose variables are controlled by independent agents can be instantiated as a potential game by setting each agent's utility function equal to its marginal effect on the global utility (as in the alignment criterion above). Then, by definition, any change in an agent's utility is matched by an equivalent change in the global utility. In a DCOP, one way to achieve this is to set each agent's utility to the sum of the payoffs for constraints that it is involved in, as in Equation (2).

    • Although Nash equilibria correspond to 1-optima, note that strong equilibria (Aumann, 1959) do not correspond to n-optima. A strong equilibrium is robust to deviations from all coalitions of n players and less, where a coalition deviates if at least one member's individual payoff improves and none decrease. In a DCOP game, an n-optimum always exists, while a strong equilibrium may not. However, if a strong equilibrium does exist, it is n-optimal.

    • Maheswaran et al. (2005) show that MGM is anytime and converges to an element in the set of Nash equilibrium in DCOP games directly, without using a potential game characterization of the problem.

    • Copyright © Cambridge University Press 20112011Cambridge University Press
References (68)
  • About this article
    Cite this article
    Archie C. Chapman, Alex Rogers, Nicholas R. Jennings, David S. Leslie. 2011. A unifying framework for iterative approximate best-response algorithms for distributed constraint optimization problems1. The Knowledge Engineering Review. 26:178 doi: 10.1017/S0269888911000178
    Archie C. Chapman, Alex Rogers, Nicholas R. Jennings, David S. Leslie. 2011. A unifying framework for iterative approximate best-response algorithms for distributed constraint optimization problems1. The Knowledge Engineering Review. 26:178 doi: 10.1017/S0269888911000178
  • Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return