02 Solution concepts: examples

Dominant strategy solution

i, best(si) unique, independent of si: \(u(s_{i}, s’{-i}) \geq u(s’{i}, s’{-i}) \forall s’\). Eg: Prisoner’s dilemma; not: Battle of sexes. So, players individually select strategies. May not lead to optimal payoff for any \(p{i}\).

Prisoner’s dilemma

Cost matrix C=[(4,4)(1,5)\(5,1)(2,2)]. s = (2, 2) best for both, but unstable: If p1 sets s1=2, p2 tempted to set s2=1. s=(1,1) stable. Optimal selfish strategy of p1 independent of p2’s actions. Can be extended to multi-player game.

Tragedy of commons

n players; 1 common bandwidth. Strategy about demanding a portion. So, strategies for each: si[0,1]. If si>1, payoff for all 0; As si increases, utility decreases for all; so utility ui(si)=si(1jisj). So, maximizing, best strategy: si=(1jisj)/2. Unique stable soln: si=(1+n)1i. ui=n(1+n)2n1; so tragedy. A cartel would be better!

2nd price auction

Aka Vickery. 1 shot Auction of an item: pi’s value for item is vi; bids si; if pi looses, ui = 0; wins if si=maxjsj. If pi wins, ui=visj, where j’s bid is next highest. Dominant strategy: si=vi! Item auctioned to pi who values it most: ‘socially optimal outcome’.

Revelation principle

All pi can give Game Designer (GD) \ valuation function, let GD play game. Maybe need exponential communication for value function. Also, security needed.

Iterated elimination of str dominated strategies

Take\ game matrix. For each player: Amongst the strategies left, eliminate all dominated strategies.

Sometimes left with many incomparable strategies.

Elimination for weakly dominated strategies could lead to elimination of Nash equilibrium strategies.

Nash equilibrium

Defn: D or {Di} where even if all pi know all Di, no treachery profitable. Maybe D not unique. So each pi can decide Di if he knows Di.

Pure strategy

s is Nash equilib if \(\forall i, s’: u_{i}(s_{i}, s_{-i}) \geq u_{i}(s’{i}, s{-i}) \). Eg: Battle of sexes. Includes dominant strategy solns.

(Anti) Coordination games

Battle of the sexes: p1,p2 gain when s1=s2. Players select strategies together. C=[(4,5)(1,1)\(2,2)(5,4)]. Multiple equilibria; Eg: Pr(s=(1,1))=1.

Randomized (mixed) strategies

Not Pure strategy s, but distr D. Risk neutral pi maximize ui(D)=EsD[ui(s)], with PrsD(s)=iPrsiDi(si).

D or {Di} is mixed strategy Nash equilib if \(\forall i, D’: u_{i}(D) \geq u_{i}(D’{i}, D{-i})\). (Check) Eg: Matching pennies. Generalizes pure strategy equilib.

Matching pennies C=[(1,1)(1,1)\(1,1)(1,1)]. A 0 sum game; so pi selects strategy independently. No stable s; so, p1 randomizes s1 to thwart 2.

Existance of Equilibria

(Nash) Any game with |P|,|Si| finite, mixed strategy Nash equilib. \why

Some games don’t have Nash equilibrium.

Properties

If Di part of Nash equilibrium,\ every tDi is a best response to Di: EDi[t,Di] is maximum: else there could’ve been 0 wt on t.

Time Complexity

Finding Nash equilibria is PPAD complete.

Games representable by digraphs with indegree, outdegree 1; problem is to find source or sink other than a ‘standard source’. Like Lemke - Howson polytope.

\tbc

eps Nash equilib

A special case: \(\forall i, D’: u_{i}(D) \geq u_{i}(D’{i}, D{-i}) - \eps\)

Correlated equilibrium D

(Aumann). Coordinator has distr D, samples s from D, tells each pi its si. pi not told sj, but knows it is correlated to si; so knows all Pr(si|si). D known to every pi. D is correlated equilib if it is not in any pi’s interest to deviate from s, assuming other pi follow instructions: \ \(E_{s_{-i} \distr D|s_{i}} [u_{i}(s_{i}, s_{-i})] \geq E_{s_{-i}\distr D|s_{i}} [u_{i}(s’{i}, s{-i})] \).

Mixed strategy Nash equilibrium is the special case where Di are independently randomized (with diff coins).

Not well motivated in some games.

Coordinator picks correlated equilibrium by optimizing some fn (Eg: total payoff or avg payoff).

Regret defn

fi:SiSi, regret ri(s,f)=ui(fi(si),si)ui(s):\ EsD[ri(s,fi)]0.

eps correlated equilibrium

EsD[ri(s,fi)]ϵ.

Traffic light/ Chicken C=[(100,100)(1,0)\(0,1)(0,0)]. s = (1, 2) and (2, 1) stable; so coordinator picks one randomly. This correlation increases payoff as the low expected utility mixed strategy Di=(1011,11011) is avoided.

Reduction to LP

Unknowns: concatenation of vectors {Di}. If n players, m strategies each, mn unknowns. Make inequalities: Ui(Di,Di)Ui(Di,Di); \(\norm{D_{i}}{1} = 1\); \(D{i} \geq 0\).

\tbc

Time Complexity

Correlated equilibria form a convex set; so if game specified explicitly, can find one in polynomial time if matrix U given. But finding optimal correlated equilibrium is hard. \tbc