• 沒有找到結果。

Game Theory at Grace Baptist Church 貳

N/A
N/A
Protected

Academic year: 2021

Share "Game Theory at Grace Baptist Church 貳"

Copied!
110
0
0

加載中.... (立即查看全文)

全文

(1)

Game Theory at Grace Baptist Church

Spring 2006

Professor Hsueh-I Lu (呂學一)

National Taiwan University

(2)

重要的規定(將會每週提醒)

重要的規定(將會每週提醒)

除了白開水之外,禁止在教室內飲食,

違者立刻喪失旁聽與修課的資格。

注意路上的安全,過馬路時不要趕時間

,我們上課不會點名,所以遲到,缺席

都沒有關係,就是千萬不要「用生命趕

路」。

盡可能將腳踏車停在校園內然後走過來

,以免造成懷恩堂周遭環境的紊亂。

(3)

Outline

Outline

Strategic games

Nash equilibrium

Strictly competitive games

Bayesian games

(4)
(5)

上官與左左木兩位同學涉嫌於期末考聯

手舞弊,遭隔離偵訊

An example

An example

上招

上不招

左招

各記大過一次

左小過、上退學

左不招 上小過、左退學

都沒事

(6)

Strategic games

Strategic games

Also known as

– Matrix games

– Games in normal form

(7)

Choices

Choices

In a strategic game,

– (1) each player chooses her plan of action

once and for all, and

– (2) all players make their choices

simultaneously.

An equivalent implementation for (2):

– When each player makes her choices, she is

not aware of other players’ choices.

(8)

Players

Players

上招

上不招

左招

各記大過一次

左小過、上退學

左不招 上小過、左退學

都沒事

F

There are always a ¯nite number n ¸ 2 of players in

a game.

F

Let N = f 1;2;:::;ng denote the set of players. For

example, thefollowing is a 2-player game. Sometime,

we call them the

row player

and the

column player

.

(9)

Actions

Actions

上招

上不招

左招

各記大過一次

左小過、上退學

左不招 上小過、左退學

都沒事

F

Each player may have an arbitrary number of actions. A

game is

¯nite

if the number of actions for each player is

¯nite.

F

Let A = A

1

£ A

2

£ ¢¢¢£ A

n

denote the set possible actions.

F

For example, in the following game, each player has two

actions. We have jA

1

j = jA

2

j = 2 and thus jAj = 2¢2 = 4.

(10)

Consequences

Consequences

上招

上不招

左招

各記大過一次

各記小過一次

左不招

各記小過一次

都沒事

F

Let C consist of all possible consequences of

the game.

F

Each action is associated with a consequence.

F

For example, thefollowing gamehas only three

possible consequences.

(11)

Preference on consequences

Preference on consequences

上招

上不招

左招

c

1

: 各記大過一次

c

2

: 各記小過一

左不招

c

2

: 各記小過一

c

3

: 都沒事

F

The preference of each player on C is a total order.

F

For two consequences c and d in C, let c %

i

d signify

that the i-th player does not prefer d to c.

F

For example c

3

%

i

c

2

%

i

c

1

holds for i = 1;2 in the

following game.

(12)

Preference on actions

Preference on actions

上 Confess

上 Deny

Confess

c

1

: 各記大過一次

c

2

: 各記小過一

左 Deny

c

2

: 各記小過一

c

3

: 都沒事

F

The preference of each player on A is a total order.

F

For two actions a and bin A, let a %

i

bsignify c(a) %

i

c(b), where c(a) and c(b) are the consequences of a

and b, respectively.

F

For example, (Deny;Deny) %

1

(Confess;Deny) %

1

(Confess;Confess).

(13)

Utility (or payoff)

Utility (or payoff)

F

Usually, the preference relation is represented by a

utility (or payo®) function.

上招

上不招

左招

各記大過一次

左小過、上退學

左不招 上小過、左退學

都沒事

上招

上不招

左招

(-5,-5)

(-2,-10)

左不招

(-10,-2)

(0,0)

F

(0;0) %

1

(¡ 2;¡ 10) %

1

(¡ 5;¡ 5) %

1

(¡ 10;¡ 2).

F

(0;0) %

2

(¡ 10;¡ 2) %

2

(¡ 5;¡ 5) %

2

(¡ 2;¡ 10).

(14)

Some classic games

Some classic games

(15)

Confess or Deny

Confess or Deny

上招

上不招

左招

(-5,-5)

(-2,-10)

左不招

(-10,-2)

(0,0)

(16)

Prisoner’s Dilemma

Prisoner’s Dilemma

上招

上不招

左招

(-5,-5)

(

2

,-10)

左不招

(-10,

2

)

(0,0)

(17)

Brokeback or Crash

Brokeback or Crash

Brokeback

Mountain

Crash

Brokeback

Mountain

2,1

0,0

Crash

0,0

1,2

Its classic

name: “battle

of sexes”.

(18)

Hawk-Dove

Hawk-Dove

Dove

Hawk

Dove

3,3

1,4

(19)

Matching pennies

Matching pennies

Front Back

Front

1,-1

-1,1

Back

-1,1

1,-1

Chemistry

(20)

Nash equilibrium

Nash equilibrium

Also known as saddle point,

steady state, and local optimum.

(21)

Origin

Origin

The concept of the Nash equilibrium

was originated by Nash in his

dissertation, Non-cooperative games

(1950).

Nash showed that the various

solutions for games that had been

(22)

Roughly speaking

Roughly speaking

An action of a game is a Nash equilibrium

if each player has no incentive to change

her action under the assumption that all

the other players do not change their

(23)

Formally,

Formally,

An action a

¤

= (a

¤

1

;a

¤

2

;:::;a

¤

n

) is a

Nash equilibrium

of the game if

a

¤

%

i

(a

¤

1

;a

¤

2

;:::;a

¤

i ¡ 1

;

a

i

;a

¤

i +1

;:::;a

¤

n

)

holds for each i 2 N and each action a

i

2 A

i

.

上招

上不招

左招

(-5,-5)

(-2,-10)

左不招

(24)

Prisoner’s Dilemma

Prisoner’s Dilemma

上招

上不招

左招

(-5,-5)

(

2

,-10)

左不招

(-10,

2

)

(0,0)

(25)

Brokeback or Crash

Brokeback or Crash

Brokeback

Mountain

Crash

Brokeback

Mountain

2,1

0,0

Crash

0,0

1,2

(26)

Hawk-Dove

Hawk-Dove

Dove

Hawk

Dove

3,3

1,4

(27)

Front Back

Front

1,-1

-1,1

Back

-1,1

1,-1

Matching pennies

Matching pennies

Chemistry

(28)

An infinite game

An infinite game

A first-price auction

畢卡索的油畫

畢卡索的油畫

《拿煙斗的男孩

拿煙斗的男孩

(Garcon al la Pipe)

(Garcon al la Pipe)

,以

,以

1

1

416

416

萬美元的天價售出,一舉刷新繪畫作品拍賣的世界記錄,成為

萬美元的天價售出,一舉刷新繪畫作品拍賣的世界記錄,成為

目前世界上最昂貴的畫。

(29)

Settings

Settings

F

Let v

i

be thevaluation of thepainting for thei-th player such that

v

1

> v

2

> ¢¢¢> v

n

> 0:

T he set of actions for each player is [0;1 ). T herefore, it is an

in¯nite game.

F

Suppose that b

i

is the bid of the i-th player. If

b

i

= max

1· j · n

b

j

and b

j

< b

i

holds for each j < i, then the payo®for the i-th player

is v

i

¡ b

i

. Otherwise, the payo®for the i-th player is 0.

(30)

Question

Question

What are the Nash equilibria of the

game?

(31)

Nash equilibria

Nash equilibria

The Nash equilibria consist of the actions b such that

F

C1: v

2

· b

1

· v

1

;

F

C2: b

j

· b

1

holds for all j > 1; and

F

C3: there is an index j > 1 with b

j

= b

1

.

We show that Conditions C1, C2, and C3 together imply that b

is a Nash equilibrium

F

By C2, player 1 always wins. Therefore, the payo®is

(v

1

¡ b

1

;0;0;:::;0):

F

Player 1 does not want to change her bid. Why?

(32)

If player k wins the bid

If player k wins the bid

The Nash equilibria consist of the actions b such that

F

C1: v

2

· b

1

· v

1

;

F

C2: b

j

· b

1

holds for all j > 1; and

F

C3: there is an index j > 1 with b

j

= b

1

.

The payo®of player k with a bid ^

b

k

> b

1

is

(33)

Nash equilibria

Nash equilibria

T he Nash equilibria consist of the actions b such that

F

C1: v

2

· b

1

· v

1

;

F

C2: b

j

· b

1

holds for all j > 1; and

F

C3: there is an index j > 1 with b

j

= b

1

.

T he direction that

b is a Nash equilibrium implies Conditions C1, C2, and C3

is left as an exercise.

(34)

Comments

Comments

The aforementioned game for first-price

auction describes the situation that all

players know the valuations of all players.

If some valuations are unknown to some

players, the first-price auction should be

modeled as a Bayesian game, to be

(35)

Second-price auction

Second-price auction

A second-price auction is also

known as Vickery auction, named

after William Vickery, who received

the Nobel prize in 1996.

– Vickery is recognized as the founder of

auction theory.

– Vickery is the first one discovered that

bidding the valuation is a dominant

(36)

Dominant strategy

Dominant strategy

上招

上不招

左招

(-5,-5)

(2,-10)

左不招

(-10,2)

(0,0)

The column player’s

dominant strategy

The row player’s

dominant strategy

(37)

The game

The game

F

Let v

i

be the valuation for the i-th player such that

v

1

> v

2

> ¢¢¢> v

n

> 0:

T he set of actions for each player is [0; 1 ). So, it is an in¯nite

game.

F

Suppose that, for each j with 1 · j · n, b

j

is the bid of the j -th

player. If i is the index such that (1) b

i

= max

1· j · n

b

j

, and (2)

b

j

< b

i

holds for each j with 1 · j < i, then the payo®for the i-th

player is

v

i

¡

max

1· j · n;j 6

=i

b

j

:

(38)

Vickery’s Finding

Vickery’s Finding

Letting b

i

= v

i

is a

(weakly) dominant strategy

for each player i.

That is, the payo®of the i-th player for the bid b with b

i

= v

i

is

no worse than the payo®of the i-th player for any bid ^

bsuch that

^

b

j

= b

j

holds for each j 6

= i.

(Why?)

An observation: If b

i

= v

i

, then the i-th player has non-negative

payo®in the second-price auction.

(39)

Case analysis – (1)

Case analysis – (1)

Case 1: max

j 6

=i

b

j

¸ v

i

.

F

If the i-th player loses the painting with ^

b, the

payo®of ^

b is zero, no better than that of b.

F

If thei-th player wins thepainting with ^

b, then

^

b

i

> v

i

. The second price is at least v

i

.

There-fore, the payo®of ^

b for the i-th player is at

most zero.

(40)

Case analysis – (2)

Case analysis – (2)

Case 2: max

j 6

=i

b

j

< v

i

.

With b, the i-th player wins the painting and pays

the price max

j 6

=i

b

j

.

F

If the i-th player loses the painting with ^

b, the

payo®of ^

b is zero, no better than that of b.

F

If thei-th player wins thepainting with ^

b, then

the i-th player still pays the price max

j 6

=i

b

j

.

Her payo®does not change.

(41)

Comments (again)

Comments (again)

The aforementioned game for

second-price auction describes the situation that

all players know the valuations of all

players.

If some valuations are unknown to some

players, the second-price auction should

be modeled as a Bayesian game, to be

discussed later.

(42)

Strictly competitive games

Strictly competitive games

(43)

Definition

Definition

A two-player game is

strictly competitive

if

a %

1

b if and only if b%

2

a

(44)

Questions

Questions

Let u be the payo®(i.e., utility) function of a two-player

strictly competitive game. The de¯nition says that

u

1

(a) ¸ u

1

(b) if and only if u

2

(b) ¸ u

2

(a)

holds for any two actions a and b of the game.

Do the following conditions hold as well?

F

u

1

(a) = u

1

(b) if and only if u

2

(a) = u

2

(b)?

(45)

A popular name

A popular name

(46)

Definition

Definition

Let u be the payo®(i.e., utility) function of a two-player game. The

game is

zero-sum

if

u

1

(a) + u

2

(a) = 0

holds for any action a 2 A of the game.

(47)

Question

Question

Are two definitions the same?

Otherwise, why a strictly competitive game

is known as a zero-sum game?

(48)

How they relate?

How they relate?

Any two-player strictly competitive game admits a

payo®function u to represent its preference relation

such that u

1

(a) +u

2

(a) = 0 holds for any action a of

the game.

(49)

Focus on zero-sum games

Focus on zero-sum games

The rest of the discussion for strictly

competitive games focuses on zero-sum

games.

It suffices to specify the payoff of one

player (e.g., the row player) in a zero-sum

game.

– The row player is the maximizer.

(50)

Maxi-min and Mini-max

Maxi-min and Mini-max

Consider a two-person zero-sum game G in strategic form,

where

h

is the payo®function for the

row player

. Let (i;j )

represent the action that the row player makes the i-th

choice and the column player makes the j -th choice.

Q: What are

max

i

min

j

h(i;j )

and

min

j

max

i

h(i;j )?

(51)

An easy observation

An easy observation

max

i

min

j

h(i;j ) · min

j

max

i

h(i;j )

P roof Suppose that (i

1

;j

1

) is

a maximinimizer of the LHS and

(i

2

;j

2

) is a minimaximizer of the

RHS. T herefore,

LHS = h(i

1

;j

1

)

·

h(i

1

;j

2

)

·

h(i

2

;j

2

)

= RHS:

i

1

i

2

j

1

j

2

(52)

For example,

For example,

Rock

Scissors

Paper

Rock

0

1

-1

Scissors

-1

0

1

Paper

1

-1

0

3

1

2

4

2

3

1

4

(53)

When does the equality

When does the equality

hold?

hold?

(54)

Saddle point

Saddle point

Consider a zero-sum gamein strategic form. Let h(i;j ) denote

the payo®of the row player when the row player selects her

i-th choice and the column player selects her j -th choice.

A pair of choices (i

¤

;j

¤

) is a

saddle point

(i.e., Nash

equilib-rium) if

h(i;j

¤

) · h(i

¤

;j

¤

) · h(i

¤

;j )

holds for all row indices i and column indices j .

If (i

¤

;j

¤

) is a saddle point, then we say that the game has a

(55)

雞首牛後

雞首牛後

2

3

1

4

4

7

6

1

5

3

2

8

9

(56)

Can a game have multiple

Can a game have multiple

saddle points?

saddle points?

3

4

3

(57)

Can a game have no

Can a game have no

saddle point?

saddle point?

Rock

Scissors

Paper

Rock

0

1

-1

Scissors

-1

0

1

(58)

Theorem

Theorem

2

3

1

4

4

7

6

1

5

3

2

8

9

3

4

3

2

0

1

T he game has a value v (and thus has a Nash equilibrium)

if and only if

max

(59)

Proof strategy

Proof strategy

Since we have

max

i

min

j

h(i;j ) · min

j

max

i

h(i;j );

it remains to show that the game has a value v if and only

if

LHS = max

(60)

value v

value v

LHS ≥ v ≥

LHS ≥ v ≥

RHS

RHS

i

1

i

¤

j

¤

i

2

j

1

j

2

Let (i

1

;j

1

) be a

max-iminimizer of the LHS.

Let (i

2

;j

2

) be a

minimax-imizer of the RHS. Let

(i

¤

;j

¤

) be a Nash

equilib-rium. Thus h(i

¤

;j

¤

) = v.

We have

h(i

1

;j

1

) ¸

h(i

¤

;j

¤

)

(61)

value v

value v

LHS ≥ v ≥

LHS ≥ v ≥

RHS

RHS

j

1

j

2

i

2

i

1

Let (i

1

;j

1

) be a maximinimizer

of theLHS. Let (i

2

;j

2

) bea

min-imaximizer of the RHS.

Since LHS ¸ RHS, we have

h(i

1

;j

1

) = h(i

1

;j

2

) = h(i

2

;j

2

):

Therefore, (i

1

;j

2

) is a Nash

equi-librium, and h(i

1

;j

2

) = v.

(62)

Theorem and corollary

Theorem and corollary

T he game has a value v (and thus has a Nash equilibrium)

if and only if

max

i

min

j

h(i;j ) = v = min

j

max

i

h(i;j ):

A corollary: if a two-player zero-sum game has multiple

Nash equilibria, then all equilibria have the same payo®

(i.e., the value of the game.)

3

4

3

(63)

Bayesian games

Bayesian games

Strategic games with imperfect

information

(64)

John C. Harsanyi

John C. Harsanyi

1920 – 2000

Born and educated in Budapest, Hungary (just like John

von Neumann)

– Received his Ph.D. degree in Philosophy

Moved to USA

– Received his Ph.D. degree in economics from Stanford U.

– Joined the faculty of U.C. Berkeley.

Defined and studied Bayesian games in 1967/1968.

1994 Nobel prize in economics (together with John

Nash.)

(65)

Two types of player 2

Two types of player 2

Brokebac

k

Crash

Brokeback

2,1

0,0

Crash

0,0

1,2

Brokebac

k

Crash

Brokeback

2,0

0,2

Crash

0,1

1,0

(66)

Player 1’s belief

Player 1’s belief

Brokebac

k

Crash

Brokeback

2,1

0,0

Crash

0,0

1,2

Brokebac

k

Crash

Brokeback

2,0

0,2

Crash

0,1

1,0

Player 2 wishes to meet player 1

Player 2 wishes to avoid player 1

(67)

Player 1’s viewpoint

Player 1’s viewpoint

A two-player game

– Player 1: an “ordinary” player

– Player 2: a uniform distribution of two types

When they play the game, player 2 select

one of her types uniformly at random to

play against player 1.

– Player 2 knows her own type.

(68)

學毅

學毅

s interpretation

s interpretation

There are two cards, one gives the left game and

the other gives the right game.

When the game starts, player 2 draws a card

uniformly at random and plays the game

according to the card.

What player 1 knows?

– The content of both cards.

– Player 2 plays the game according to one of the cards.

– The card is drawn uniformly at random.

(69)

Player 1 against two

Player 1 against two

types of player 2

types of player 2

(B,B)

(B,C)

(C,B)

(C,C)

B

2

=2*.5+

2*.5

1

=2*.5+

0*.5

1

=0*.5+

2*.5

0

=0*.5+

0*.5

C

0

=0*.5+

0*.5

0.5

=0*.5+

1*.5

0.5

=1*.5+

0*.5

1

=1*.5+

1*.5

(70)

Q: Why 4 actions for Player 2?

Q: Why 4 actions for Player 2?

The two types of player 2 act

independently. Each type of player 2 has

exactly two possible actions (i.e., pure

strategies). Therefore, player 2 has exactly

four possible actions (i.e., pure strategies.)

(71)

Q: Why expected pay-off?

Q: Why expected pay-off?

Take the expected pay-off for (B, (B, C))

as an example: 1 = 2 * 0.5 + 0 * 0.5.

– (B, C) describes the situation that

the first type of player 2 takes action (i.e., pure

strategy) B, and

The second type of player 2 takes action C.

– Since the two types of player 2 are “drawn”

uniformly at random, this is the expected

pay-off of player 1 if she takes action B.

(72)

The situation for the

The situation for the

whole game

whole game

It is like a game of three players:

– Player 1

– Player 2(a)

– Player 2(b)

(73)

The payoffs

The payoffs

Player 1

(B,B)

(B,C)

(C,B)

(C,C)

B

2

1

1

0

C

0

0.5

0.5

1

Player 1

(Player 2(a), Player 2(b))

Player 2(a) B C

B

1 0

C

0 2

Player 1

Player 2(a)

Player 2(b) B C

B

0 2

C

1 0

Player 2(b)

Player 1

(74)

The scenario is like…

The scenario is like…

Player 1 plays against players 2(a) and 2(b)

Player 2(a) plays against players 1 and 2(b).

Her payoff is independent of Player 2(b)’s

moves.

Player 2(b) plays against players 1 and 2(a).

Her payoff is independent of Player 2(a)’s

moves.

(75)

Eight actions

Eight actions

(B, (B, B)), (C, (B, B))

(B, (B, C)), (C, (B, C))

(B, (C, B)), (C, (C, B))

(B, (C, C)), (C, (C, C))

Question: Which of these eight actions are

Nash equilibrium of the Bayesian game?

(76)

Nash equilibrium

Nash equilibrium

(B, (B,C)) is the unique Nash equilibrium of the

Bayesian game.

一言以蔽之 ?

– Given that

Player 1 takes action B,

Player 2(a) takes action B, and

Player 2(b) takes action C,

none of them has a strictly better action (i.e., pure

strategy.)

(77)

Another example

Another example

(78)

Player 1’s belief

Player 1’s belief

Brokebac

k

Crash

Brokeback

2,1

0,0

Crash

0,0

1,2

Brokebac

k

Crash

Brokeback

2,0

0,2

Crash

0,1

1,0

Player 2 wishes to meet player 1

Player 2 wishes to avoid player 1

(79)

Player 1 against two

Player 1 against two

types of player 2

types of player 2

(B,B)

(B,C)

(C,B)

(C,C)

B

2

=2*.25

+2*.75

0.5

=2*.25

+0*.75

1.5

=0*.25

+2*.75

0

=0*.25

+0*.75

C

0

=0*.25

+0*.75

0.75

=0*.25

+1*.75

0.25

=1*.25

+0*.75

1

=1*.25

+1*.75

(80)

The payoffs

The payoffs

Player 1

(B,B)

(B,C)

(C,B)

(C,C)

B

2

0.5

1.5

0

C

0

0.75

0.25

1

Player 1

(Player 2(a), Player 2(b))

Player 2(a) B C

B

1 0

C

0 2

Player 1

Player 2(a)

Player 2(b) B C

B

0 2

C

1 0

Player 2(b)

Player 1

(81)

Eight actions

Eight actions

(B, (B, B)), (C, (B, B))

(B, (B, C)), (C, (B, C))

(B, (C, B)), (C, (C, B))

(B, (C, C)), (C, (C, C))

Question: Which of these eight actions are

Nash equilibrium of the Bayesian game?

(82)

Nash equilibrium

Nash equilibrium

This Bayesian game does not have any

Nash equilibrium.

(83)

Formal definition

Formal definition

A

Bayesian game

consists of

F

a ¯nite set N = f 1;2;:::;ng of players,

F

a ¯nite set

of states, and

F

for each player i 2 N

I

a set A

i

of actions,

I

a ¯nite set T

i

of signals that she may receive,

I

a signal function ¿

i

that maps each state in

to a signal in T

i

,

I

a probability measure p

i

(¢j t

i

) on

for each signal t

i

in T

i

,

I

a preference relation %

i

(or a payo®function) over

(84)

Illustration using the

Illustration using the

second example

second example

States = {stat_meet, stat_avoid}

Actions for each player: {Brokeback, Crash}.

Signal:

– Player 1 has only one signal: {x}.

– Player 2 has two signals: {sig_meet, sig_avoid}.

Belief:

– Player 1 (after receiving x) always assigns probability

0.25 to stat_meet and 0.75 to stat_avoid.

– Player 2,

if receiving sig_meet, assigns probability 1 to stat_meet; and,

if receiving sig_avoid, assigns probability 1 to stat_avoid.

(85)

Two states

Two states

Payoffs

B

C

B 2,1 0,0

C 0,0 1,2

B

C

B 2,0

0,2

C 0,1

1,0

(86)

Two signals of player 2

Two signals of player 2

B

C

B 2,1 0,0

C 0,0 1,2

B

C

B 2,0 0,2

C 0,1 1,0

If she receives sig_meet, then

Pr[stat_meet] = 1 and

Pr[stat_avoid] = 0.

If she receives sig_avoid, then

Pr[stat_avoid] = 1 and

(87)

States versus signals

States versus signals

Each state corresponds to a game.

Each signal for a player corresponds to a type of

the player.

When a player receives a signal, she believes in a

particular probability distribution over the states.

– The distribution describes the belief of the

corresponding type of player about the distributions

of the games.

(88)

Nash equilibrium

Nash equilibrium

A Nash equilibrium of a Bayesian game is a Nash equilibrium for the

strategic game for

P

i

jT

i

j players, in which the payo®for the i-th

player of type t

i

with respect to action a is

X

! 2

Pr(! j t

i

) ¢u

i

((a

i

; ^

a

¡ i

(! )); ! );

(89)

Notation

Notation

F

a

i

, which has to be a member of A

i

, is the action of the i-th

player speci¯ed in a.

F

u

i

(¢;! ) is the pay-o®for the i-th player for state ! .

F

¿

j

(! ) is the type (i.e., signal) of the j -th player corresponds to

state ! . Note that ¿

j

is a function from

to T

j

, which could

be many-to-one.

F

a

^

j

= a(j ;¿

j

(! )) is the action of the j -th player of type ¿

j

(! )

that is speci¯ed in action a.

F

a

^

¡ i

= (^

a

1

; ^

a

2

;:::; ^

a

i ¡ 1

; ^

a

i +1

; ^

a

i +2

;:::; ^

a

n

).

F

Pr(! j t

i

) is the belief of the i-th player of type t

i

about the

distribution of .

(90)

More information may

More information may

hurt…

hurt…

(91)

Two players, two states

Two players, two states

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

(92)

The setting

The setting

States = {stat_left, stat_right}

Actions for player 1: {T, B}.

Actions for player 2: {L, M, R}

Signals:

– Player 1 has only one signal: {sig_p1}.

– Player 2 has only one signal: {sig_p2}.

Belief:

– Player 1 (after receiving sig_p1) always assigns probability 0.5

to stat_left and 0.5 to stat_right

– Player 2 (after receiving sig_p2) always assigns probability 0.5

to stat_left and 0.5 to stat_right.

(93)

Player 1’s viewpoint

Player 1’s viewpoint

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

Each player assign probability 0.5 to each state.

L

M

R

T

1

1

1

(94)

Player 2’s viewpoint

Player 2’s viewpoint

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

Each player assign probability 0.5 to each state.

L

M

R

T 0.5

0.375

0.375

(95)

The Nash equilibrium

The Nash equilibrium

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

Each player assign probability 0.5 to each state.

L

M

R

T 1, 0.5 1, 0.375 1, 0.375

B 2, 2

0, 1.5

0, 1.5

(96)

Information

Information

The original setting is like both players

have “no information” about the

distribution of the game.

What if player 2 knows that the game they

are playing is the left one?

(97)

Player 1’s viewpoint

Player 1’s viewpoint

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

L

M

R

T

1

1

1

B

2

0

0

Prob.

= 0.5

Prob.

= 0.5

(98)

Player 2’s viewpoint

Player 2’s viewpoint

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

L

M

R

T 0.5

0

0.75

B

2

0

3

Prob. = 1

Prob. = 0

(99)

The Nash equilibrium

The Nash equilibrium

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

L

M

R

T 1, 0.5

1, 0

1, 0.75

B 2, 2

0, 0

0, 3

Prob. = 1

Prob. = 0

Prob.

= 0.5

Prob.

= 0.5

(100)

Similarly, if player 2 is told that

Similarly, if player 2 is told that

they are playing the right

they are playing the right

game...

game...

L

M

R

T 1, 0.5 1, 0 1, 0.75

B 2, 2 0, 0

0, 3

L

M

R

T 1, 0.5 1, 0.75 1, 0

B 2, 2

0, 3

0, 0

L

M

R

T 1, 0.5 1, 0.75

1, 0.

B 2, 2

0, 3

0, 0

Prob. = 0

Prob. = 1

Prob.

= 0.5

Prob.

= 0.5

(101)

More information may

More information may

hurt…

hurt…

(102)

應同學要求

應同學要求

舉一個 signals 與 states 間之對

應關係稍微複雜一點的例子

(103)

Brokeback or Crash

Brokeback or Crash

Belief of Player 1, who prefers B to C.

– with probability 1/2 player 2 wants to go to movie together, and

– with probability 1/2 player 2 does not want to go to movie

together.

Belief of Player 2, who prefers C to B.

– with probability 2/3 player 1 wants to go to movie together, and

– with probability 1/3 player 1 does not want to go to movie

together.

Each player knows whether or not herself wants to go to

(104)

yy B

C

B 2,1 0,0

C 0,0 1,2

yn B

C

B 2,0 0,2

C 0,1 1,0

ny B

C

B 0,1 2,0

C 1,0 0,2

nn B

C

B 0,0 2,2

C 1,1 0,0

player 1

想一起去

player 1

不想一起去

player 2

想一起去

player 2

不想一起去

1/2

1/2

1/2

1/2

2/3

2/3

1/3

1/3

(105)

yy B

C

B 2,1 0,0

C 0,0 1,2

yn B

C

B 2,0 0,2

C 0,1 1,0

ny B

C

B 0,1 2,0

C 1,0 0,2

nn B

C

B 0,0 2,2

C 1,1 0,0

player 1

想一起去

player 1

不想一起去

player 2

想一起去

player 2

不想一起去

1/2

1/2

1/2

1/2

2/3

2/3

1/3

1/3

(106)

States and signals

States and signals

F

State space = f yy;yn;ny;nng.

F

Signals (i.e., types)

I

T

1

= f y

1

;n

1

g.

I

T

2

= f y

2

;n

2

g.

F

Mapping

I

¿

1

(yy) = ¿

1

(yn) = y

1

and ¿

1

(ny) = ¿

1

(nn) = n

1

.

I

¿

2

(yy) = ¿

2

(ny) = y

2

and ¿

2

(yn) = ¿

2

(nn) = n

2

.

F

Belief

I

Pr(yy j y

1

) = Pr(yn j y

1

) = 0:5 and Pr(ny j y

1

) = Pr(nn j y

1

) = 0.

I

Pr(yy j n

1

) = Pr(yn j n

1

) = 0 and Pr(ny j n

1

) = Pr(nn j n

1

) = 0:5.

I

Pr(yy j y

2

) = 2=3, Pr(ny j y

2

) = 1=3, and Pr(yn j y

2

) = Pr(nn j y

2

) = 0.

(107)

E.g., Player 1’s expected payoff

E.g., Player 1’s expected payoff

The expected payo®of player 1 of type y

1

, if player 1 chooses B and

player 2 chooses (B;C) (i.e., a = (B;(B;C))), is

X

! 2

Pr(! j y

1

) ¢u

1

((B;a(2;¿

2

(! )));! )

= 0:5¢(u

1

((B;a(2;¿

2

(yy)));yy) + u

1

((B;a(2;¿

2

(yn)));yn)) +

0¢(u

1

((B;a(2;¿

2

(ny)));ny) + u

1

((B;a(2;¿

2

(nn)));nn))

= 0:5¢(u

1

((B;a(2;y

2

));yy) + u

1

((B;a(2;n

2

));yn))

= 0:5¢(u

1

((B;B);yy) + u

1

((B;C);yn))

= 0:5¢(2+ 0)

= 1:

(108)

Therefore, we have

Therefore, we have

Player 1 (B, B) (B, C) (C, B) (C, C)

B

1

(109)

E.g., Player 2’s expected payoff

E.g., Player 2’s expected payoff

T heexpected payo®of player 2 of typen

2

, if player 1 chooses (C;B ) and player

2 chooses C (i.e., a = ((C;B);C)), is

X

! 2

Pr(! j n

2

) ¢u

2

((a(1;¿

1

(! )); C);! )

=

2

3

¢u

2

((a(1;¿

1

(yn));C); yn) +

1

3

¢u

2

((a(1;¿

1

(nn));C);nn) +

0¢(u

2

((a(2; ¿

1

(ny));C);ny) + u

2

((a(2;¿

1

(yy)); C);yy))

=

2

3

¢u

2

((a(1;y

1

);C);yn) +

1

3

¢u

2

((a(1;n

1

);C);nn)

=

2

3

¢u

2

((C; C);yn) +

1

3

¢u

2

((B ;C);nn)

=

2

3

¢0+

1

3

¢2

=

2

3

:

(110)

Therefore, we have

Therefore, we have

Player 2

B

C

(B, B)

(B, C)

(C, B)

2/3

(C, C)

數據

Illustration using the Illustration using the

參考文獻

相關文件

In Section 3, the shift and scale argument from [2] is applied to show how each quantitative Landis theorem follows from the corresponding order-of-vanishing estimate.. A number

(Some campers may react faster than others to familiar words.) 5. The game can be repeated for a few rounds.. Before starting the game, the camp leaders show the campers both

Now, nearly all of the current flows through wire S since it has a much lower resistance than the light bulb. The light bulb does not glow because the current flowing through it

We can therefore hope that the exact solution of a lower-dimensional string will provide ideas which could be used to make an exact definition of critical string theory and give

If the skyrmion number changes at some point of time.... there must be a singular point

Researches of game algorithms from earlier two-player games and perfect information games extend to multi-player games and imperfect information games3. There are many kinds of

Midpoint break loops are useful for situations where the commands in the loop must be executed at least once, but where the decision to make an early termination is based on

⚫ Students should be able to create interactive user selection, such as the 2-level interdependent select list, pull down menu and click-to-expand menu. Students should be able