410 likes | 416 Views
Constraint Satisfaction Problems and Games. S Kameshwaran Oct 22, 2002. Outline. Introduction to CSP Introduction to N-Person Non-cooperative Games Nash Equilibrium revisited: Mixed Strategies Mapping CSP to a Game Tracing Procedure (Evolutionary Process) to find NE. CSP.
E N D
Constraint Satisfaction Problems and Games S Kameshwaran Oct 22, 2002
Outline • Introduction to CSP • Introduction to N-Person Non-cooperative Games • Nash Equilibrium revisited: Mixed Strategies • Mapping CSP to a Game • Tracing Procedure (Evolutionary Process) to find NE
CSP • Given a set of variables, find possible values to the variables which simultaneously satisfy a set of constraints • Applications: • Scheduling • Resource allocation • Computational Molecular Biology • Vision…
CSP • Given: • X: Variables ( X1, X2, …, Xn) • D: Domain Di for variable Xi • R: Set of constraints r (can be logical) • Find: • Solution: Assignment of value to Xi from its domain Di such that all constraints are satisfied
CSP • A constraint is called k-ary constraint if it connects k variables • k(r): arity of constraint r • (d1, d2, …, dn) r: The assignment di to Xi satisfies constraint r • Characteristic Function: r(d1, d2, …, dn) • 1 if (d1, d2, …, dn) r • 0 otherwise
CSP • Example: 8 Queens Problem • Place the 8 queens on the chess board such that no queen attacks the others • No two queens should be placed on the same row, or on the same column or on the same diagonal • 8 variables • Xi=j: Queen on ith row is placed on jth column • Constraint r: No two queens should be placed in the same column • Binary Constraint: k(r)=2 • Xi is not equal to Xj
CSP • Solution Approaches • Search algorithms • Backtracking • Forward checking • Graph based algorithms • Neural Networks
N-Person Non-cooperative Games • N players • Non-cooperative vs. Cooperative: : • Players cannot make binding commitments • Players join and split the gains out of cooperation • Solution concept: Nash Equilibrium
N-Person Non-cooperative Games • Normal Form Games • N players • Si=Strategy set of player i (Pure Strategy) • Single simultaneous move: each player i chooses a strategy siSi • Nobody observes others’ move • The strategy combination (s1, s2, …, sN) gives payoff (u1, u2, …, uN) to the N players • All the above information is known to all the players and it is common knowledge
Nash Equilibrium • Nash Equilibrium is a strategy combination s*= (s1*, s2*, …, sN*), such that si* is a best response to (s1*, …,si-1*,si+1*,…, sN*), for each i • (s1*, s2*, s3*) is a Nash Equilibrium (3 player game) iff • s1* is the best response of 1, if 2 chooses s2* and 3 chooses s3* • s2* is the best response of 2, if 1 chooses s1* and 3 chooses s3* • s3* is the best response of 3, if 1 chooses s1* and 2 chooses s2* • Note: It is a simultaneous game and nobody knows what exactly the choice of other agents • Nash Equilibrium assumes correct and consistent beliefs
Nash Equilibrium: Battle of the Sexes • (Prize Fight, Prize Fight) is a NE: Best responses to each other • (Ballet, Ballet) is a NE: Best responses to each other
The Welfare Game • Government wishes to aid a pauper if he searches for work but not otherwise • Pauper searches for work only if he cannot depend on government aid
The Welfare Game • (Aid, Try to Work) is not NE: Pauper prefers Be Idle
The Welfare Game • (Aid, Try to Work) is not NE: Pauper prefers Be Idle • (Aid, Be Idle) is not NE: Govt prefers No Aid
The Welfare Game • (Aid, Try to Work) is not NE: Pauper prefers Be Idle • (Aid, Be Idle) is not NE: Govt prefers No Aid • (No Aid, Be Idle) is not NE: Pauper prefers Try to Work
The Welfare Game • (Aid, Try to Work) is not NE: Pauper prefers Be Idle • (Aid, Be Idle) is not NE: Govt prefers No Aid • (No Aid, Be Idle) is not NE: Pauper prefers Try to Work • (No Aid, Try to Work) is not NE: Govt prefers Aid
Mixed Strategies • Pure Strategy: Player i chooses strategy sijfrom set Si • Mixed Strategy: Player i chooses strategy sij with probability qij (qij>=0, j qij=1) • Every pure strategy is also a mixed strategy • Payoff in mixed strategies is the expected payoff
Mixed Strategies: Advantages • Mathematical point of view: • Convexifies the set: Convex sets are nice to play around as the terrain is well understood
Mixed Strategies: Advantages • Mathematical point of view: • Convexifies the set: Convex sets are nice to play around as the terrain is well understood • Existence of Nash Equilibrium for finite games: Kakutani fixed point theorem
Mixed Strategies: Advantages • Mathematical point of view: • Convexifies the set: Convex sets are nice to play around as the terrain is well understood • Existence of Nash Equilibrium for finite games: Kakutani fixed point theorem • Practical point of view: • Yes and No (depends on the situation)
Mixed Strategies: Interpretation • Expected payoff: • Let payoff with strategy si1 be 1 and si2 be 4 • Mixed strategy (½, ½) gives the expected payoff ½+2=2.5
Mixed Strategies: Interpretation • Expected payoff: • Let payoff with strategy si1 be 1 and si2 be 4 • Mixed strategy (½, ½) gives the expected payoff ½+2=2.5 • It means a sure payoff of 2.5 is equivalent to a gamble where the payoffs are 1 and 4, each with probability ½
Mixed Strategies: Interpretation • Expected payoff: • Let payoff with strategy si1 be 1 and si2 be 4 • Mixed strategy (½, ½) gives the expected payoff ½+2=2.5 • It means a sure payoff of 2.5 is equivalent to a gamble where the payoffs are 1 and 4, each with probability ½ • The above interpretation will not make sense if the payoff is money • It is true only for utilities
Mixed Strategies: Interpretation • Games where multiple strategies can be simultaneously employed • Betting on more than one horse
Mixed Strategies: Interpretation • Games where multiple strategies can be simultaneously employed • Betting on more than one horse • Multiple instances of the same game • War Scenario: qij% of pilots use strategy sij
Mixed Strategies: Interpretation • Games where multiple strategies can be simultaneously employed • Betting on more than one horse • Multiple instances of the same game • War Scenario: qij% of pilots use strategy sij • Same game repeated infinitely
Mixed Strategies: Interpretation • Games where multiple strategies can be simultaneously employed • Betting on more than one horse • Multiple instances of the same game • War Scenario: qij% of pilots use strategy sij • Same game repeated infinitely • For a single game: The probability distribution is the opponents’ estimation of player i’s decision
CSP as Games • CSP C=< X, D, R > • Game induced by C: GC=(S1, …, Sn; U1, …,Un) • n = |X| • Si = Di • Ui(d1, …, dn) = rR[i]k(r)r(d1, d2, …, dn) • R[i] = Constraint set that includes variable i • The payoff function counts the number of satisfied constraints connecting that variable, taking every constraint along with its arity • Instead of arity one can use different weights
Equilibria and Solutions • Every solution of C is a Nash Equilibrium of GC • For a solution, all constraints are satisfied, so no agent can improve its payoff by assuming a different value • All Nash Equilibriums are not solutions • C may not have a solution but still GC will have a NE
Equilibria and Solutions • Not a Solution
Equilibria and Solutions • Not a Solution • Nash Equilibrium: A better solution is not possible by moving a single queen in one move – Non-cooperative
Trivia: Non-cooperative Games • A solution better to some agents may be available, but cannot be reached by a decision of single agent alone • Cooperation or non-cooperation depends on the game • Non-cooperation need not be due to conflict in goal, but may be due to communication costs
Trivia: Non-cooperative Games • Prisoner’s Dilemma in reality: Should IISc water its garden when there is drought in Mysore? • Consider the following situation: Drought in Mysore but not in Bangalore • Saved water from Bangalore can be transported to Mysore • Decision making of an agent in Bangalore: • If all saves water, his saving will not contribute much • If nobody saves water, his saving will not contribute much • So, better not to save
Equilibria and Solutions • The complexity of the CSP depends on its structure • Finding solution to C Finding NE to GC • Complexity of finding NE is not known • It is unlikely to be in P • It is also unlikely to be NP-hard as existence of solution is guaranteed
Equilibrium Selection • Tracing Procedure (Evolutionary process) • For agent i, there is a probability distribution (mixed strategy) pi, which the other agents expect that i will use • p=(p1, …, pn) • Assumption • Limited computational power • Agents are Bayesian decision makers • Each agent estimates its best strategy depending on p • Value of p is updated based on the previous outcome
Equilibrium Selection • BR(p)=Best response strategy to the distribution p • Synchronous Process • p: Initial distribution • p0= p • pk= BR(pk-1)+(1- )pk-1 • 0<<=1 • If pk converges then the limit point is NE
Equilibrium Selection • Computation of BR(p) for an agent is computationally taxing if it is connected with large number of variables • The process may converge to a NE that may not be a solution • Can be considered as the best possible quasi solution • No general proof that the process will always converge • Susceptible to initial probability distributions
Trivia: Solution to 8 Queens Problem • 10 distinct solutions • http://www.math.utah.edu/~alfeld/queens/queens.html
References • Equilibrium Theory and Constraint Networks, Francesco Ricci, 1992 • Games and Decisions, Luce and Raiffa, Dover Publications, 1957 • Games and Information: An Introduction to Game Theory, Eric Rasmusen, Basil Blackwell Publishers, 1989
Next.. • 25/10/02 Nash Equilibrium: P or NP?