/
6.853: Topics in Algorithmic Game Theory 6.853: Topics in Algorithmic Game Theory

6.853: Topics in Algorithmic Game Theory - PowerPoint Presentation

lindy-dunigan
lindy-dunigan . @lindy-dunigan
Follow
401 views
Uploaded On 2016-04-01

6.853: Topics in Algorithmic Game Theory - PPT Presentation

Fall 2011 Constantinos Daskalakis Lecture 11 Last Lecture 0 n Generic PPAD Embed PPAD graph in 01 3 3DSPERNER canonical p w linear BROUWER multiplayer NASH 4player ID: 272558

nash game equilibrium algorithms game nash algorithms equilibrium player lawyer support games dgp approximate polymatrix point strategies time equilibria

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "6.853: Topics in Algorithmic Game Theory" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

6.853: Topics in Algorithmic Game Theory

Fall 2011

Constantinos Daskalakis

Lecture 11Slide2

Last

Lecture

...

0

n

Generic PPAD

Embed PPAD graph in [0,1]

3

3D-SPERNER

canonical

p

.w

. linear

BROUWER

multi-player

NASH

4-player

NASH

3-player

NASH

2-player

NASH

[Pap ’94]

[DGP ’

05]

[DGP ’05]

[DGP ’

05]

[DGP ’

05]

[DGP ’

05]

[DP ’

05]

[CD’

05]

[CD’

06]

DGP = Daskalakis, Goldberg, Papadimitriou

CD = Chen, DengSlide3

This

Lecture

...

0

n

Generic PPAD

Embed PPAD graph in [0,1]

3

3D-SPERNER

multi-player

NASH

4-player

NASH

3-player

NASH

2-player

NASH

[Pap ’94]

[DGP ’

05]

[DGP ’05]

[DGP ’

05]

[DGP ’

05]

[DGP ’

05]

[DP ’

05]

[CD’

05]

[CD’

06]

DGP = Daskalakis, Goldberg, Papadimitriou

CD = Chen, Deng

canonical

p

.w

. linear

BROUWERSlide4
Slide5

Polymatrix

Games (Review)The PPAD-hard games we constructed were

separable multiplayer games, aka polymatrix

games

. These are multi-player games with edge-wise separable utility functions.

- player’s payoff is the sum of payoffs from all adjacent edges

- edges are

2-player

gamesSlide6

Reducing

Polymatrix to Bimatrix

Games

polymatrix

game

w.l.o.g

. can assume bipartite, by turning every gadget used in the reduction into a bipartite game (

inputs&output

are on one side and “middle player” is on the other side)Slide7

2-player

game

polymatrix

game

red lawyer

represents

red nodes

, while

blue lawyer

represents

blue nodes

w.l.o.g

. can assume bipartite, by turning every gadget used in the reduction into a bipartite game (

inputs&output

are on one side and “middle player” is on the other side)

Reducing

Polymatrix

to

Bimatrix

GamesSlide8

Payoffs of the Lawyer-Game

wishful thinking

:

if (

x

,

y

)

is a

Nash equilibrium

of the lawyer-game, then the marginal distributions that

x

assigns to the strategies of the

red nodes

and the

marginals

that

y

assigns to the

blue nodes

, comprise a

Nash

equilibrium.

But why would a lawyer play every node

he represents?

…Slide9

Enforcing Fairness

- The lawyers play on the side a high-stakes game.

-

W.l.o.g

. assume that each lawyer represents

n

clients. Name these clients 1,…,

n

.

- Payoffs of the high-stakes game:

Suppose the red lawyer plays any strategy of client

j

, and blue lawyer plays any strategy of client

k

, then

=

M

If , then red lawyer gets +M, while blue lawyer gets –M.

If , then both players get 0. Slide10

Enforcing Fairness

Claim:

The unique Nash equilibrium of the high-stakes lawyer game is for both lawyers to play uniformly over their clients.

Proof:

exerciseSlide11

Enforcing Fairness

+

M,-M

0,0

0,0

0,0

M,-M

0, 0

0, 0

0 , 0

M,-M

M =

high

stakes

game

payoff table addition

Choose:Slide12

Analyzing the Lawyer Game

- when it comes to distributing the total probability mass among the different nodes of

, essentially only

the high-stakes game is

relevant to the lawyers…

Lemma

1: if (

x

,

y

)

is an equilibrium of the lawyer

game, for all

u

,

v :

- when it comes to distributing the probability mass

x

u

among the different strategies of node u

,

only the payoffs of the game are

relevant…

The

payoff

difference for the red lawyer from strategies and

is

Lemma

2:

Proof:

exercise

total probability mass assigned by lawyers on nodes

u

,

v

respectively Slide13

Analyzing the Lawyer Game (cont.)

Lemma

2

if ,

then for

all

j

:

- if

M

is large, can correct it to an exact Nash equilibrium of the

polymatrix game, using similar technique as exercise of last time.

(

marginals

given by lawyers to different nodes)

-

define and

Observation:

if we had

x

u

=1/

n

, for all

u

, and

y

v =1/n, for all

v, then

would

be a Nash equilibrium.

-

the deviation from uniformity

results in an approximate Nash

equilibrium of the polymatrix

game. Slide14

lawyer construction

Exercise form Last time

Exercise from Last Time

obvious

through SPERNER,

BROUWERSlide15
Slide16

Algorithms for Nash

Equilibria

Simplicial

Approximation Algorithms

Support Enumeration Algorithms

Lipton-

Markakis

-Mehta

Algorithms for Symmetric

Games (next time)

The Lemke-

Howson

Algorithm (next time)Slide17

Algorithms for Nash

Equilibria

Simplicial

Approximation AlgorithmsSlide18

Simplicial

Approximation Algorithms

Given a continuous function , where

f

satisfies a

Lipschitz

condition and

S

is a compact convex subset of the Euclidean space, find

such that .

.

(or exhibit a pair of points violating the

Lipschitz

condition, or a point mapped by the function outside of

S)

suppose that S is described in some meaningful way in the input, e.g.

polytope

, or ellipsoid

Simplicial

Approximation Algorithms

comprise a family of algorithms computing an approximate fixed point of

f

by dividing

S

up into

simplices and defining a walk that pivots from simplex to simplex of the subdivision until it settles at a simplex containing an approximate fixed point.

(this is a re-iteration of the BROUWER problem that we defined in earlier lectures; for details on how to make the statement formal check previous lectures)Slide19

(our own)

Simplicial

Approximation Algorithm

(details in Lecture 6,7)

1. Embed

S

into a large enough hypercube.

2. Define an extension

f

of

f

to the points in the hypercube that lie outside of S

in a way that, given an approximate fixed point of f’

, an approximate fixed point of

f can be obtained in polynomial time.

3. Define the canonical subdivision of the hypercube (with small enough precision that depends on the

Lipschitz property of f

’ see previous lectures

).

4. Color the vertices of the subdivision with

n

+1 colors, where n

is the dimensionality of the hypercube. The color at a point x

corresponds to the angle of the displacement vector .

5. The colors define a legal

Sperner coloring.

6. Solve the

Sperner instance, by defining a directed walk starting at the “

starting simplex” (defined in lecture 6) and pivoting between

simplices through colorful facets.

7. One of the corners of the simplex where the walk settles is an approximate fixed point.

the non-constructive stepSlide20

Algorithms for Nash

Equilibria

Simplicial

Approximation Algorithms

Support Enumeration AlgorithmsSlide21

Support Enumeration Algorithms

How better would my life be if I knew the support of the Nash equilibrium?

… and the game is 2-player?

any feasible point (

x

,

y

) of the following linear program is an equilibrium!

Setting:

Let (

R

,

C

) be an

m by n

game, and suppose a friend revealed to us the supports and respectively of the Row and Column players’ mixed strategies at some equilibrium of the game.

s.t

.

andSlide22

Support Enumeration Algorithms

How better would my life be if I knew the support of the Nash equilibrium?

… and the game is 2-player?

for guessing the support

for solving the LP

Runtime:Slide23

Support Enumeration Algorithms

How better would my life be if I knew the support of the Nash equilibrium?

… and the game is

polymatrix

?

can do this with Linear Programming too!

input:

the support of every node at equilibrium

goal:

recover the Nash equilibrium with that support

the idea of why this is possible is similar to the 2-player case:

- the expected payoff of a node from a given pure strategy is linear in the mixed strategies of the other players;

- hence, once the support is known, the equilibrium conditions correspond to linear equations and inequalities.Slide24

Rationality of

Equilibria

Important Observation:

The correctness of the support enumeration algorithm implies that in 2-player games and in

polymatrix

games there always exists an equilibrium in rational numbers, and with description complexity polynomial in the description of the game!Slide25

Algorithms for Nash

Equilibria

Simplicial

Approximation Algorithms

Support Enumeration Algorithms

Lipton-

Markakis

-MehtaSlide26

Computation of Approximate

Equilibria

Theorem [Lipton,

Markakis

, Mehta ’03]:

For

all and any 2-player game with at most

n

strategies per player and payoff entries in [0,1],

there exists

an -

approximate

Nash equilibrium in which each player’s strategy is uniform on a

multiset

of their pure strategies of size

- By Nash’s theorem, there exists a Nash equilibrium (

x

,

y

).

- Suppose we take samples from

x

, viewing it as a distribution.

: uniform distribution over the sampled pure strategies

- Similarly, define by taking

t

samples from

y

.

Claim:

Proof idea:

(of a stronger claim)Slide27

Computation of Approximate

Equilibria

Lemma: With probability at least 1-4/n the following are satisfied:

Proof:

on the board using

Chernoff

bounds.

Suffices to show the following:Slide28

Computation of Approximate

Equilibria

set

:

every point is a pair of mixed strategies that are uniform on a

multiset

of size

Random sampling from takes expected time

Oblivious Algorithm

:

set does not depend on the game we are solving.

Theorem

[

Daskalakis-Papadimitriou

09]

:

Any oblivious algorithm for general games runs in expected time