• 沒有找到結果。

Subgame Perfect Nash Equilibrium

6.3 Perfect Signal: Advantage of Playing First

6.3.2 Subgame Perfect Nash Equilibrium

In a sequential game, we will study the subgame perfect Nash equilibrium. Subgame perfect Nash equilibrium is a popular refinement to the Nash equilibrium under the se-quential game. It guarantees that all players choose strategies rationally in every possible subgame. A subgame is a part of the original game. In Chinese restaurant game, any game process begins from player i, given all possible actions before player i, could be a subgame.

Definition 15. A subgame in Chinese restaurant game is consisted of two elements: 1) It begins from customer i; 2) The current grouping before customer i is ni = (ni,1, ..., ni,J) withJj=1ni,j = i− 1.

Definition 16. A Nash equilibrium is a subgame perfect Nash equilibrium if and only if it is a Nash equilibrium for any subgame.

We would like to show the existence of subgame perfect Nash equilibrium in Chinese restaurant game by constructing one. Basically, as a rational customer, customer i should predict the final equilibrium grouping according to his current observation on the choices of previous customers ni and the system state θ. Then, he may choose the table with highest expected utility according to the prediction. Following from this idea, we derive the best response of customers in a subgame.

We first implement the prediction part through two functions as follows. First, let EG(Xs, Ns) be the function that generates the equilibrium grouping for a table setXsand number of customers Ns. The equilibrium grouping is generated by the greedy algorithm

shown in previous section with X being replaced by Xs and N being replaced by Ns. Notice thatXscould be any subset of the total table setX = {1, ..., J}, and Nsis less or equal to N .

Then, let P C(Xs, ns, Ns), where nsdenotes the current grouping observed by the cus-tomer, be the algorithm that generates the set of available tables given nsin the subgame.

The algorithm removes the tables that already occupied by more than the expected number of customers in the equilibrium grouping. This helps the customer remove those unrea-sonable choices and correctly predict the final equilibrium grouping in every subgame.

The basic flow of this algorithm is shown as follows 1) calculate the equilibrium grouping ne given the table set Xs and number of customers Ns, 2) check if there is any overly occupied table by comparing ns with ne. If so, 3) remove these tables fromXs and the customers occupying these tables from Ns, and go back to 1). Otherwise, the algorithm terminates. The procedures of implementing P C(Xs, ns, Ns) are described as follows:

1. Initialize: Xo =Xs, Nt= Ns

2. Xt =Xo, ne = EG(Xt, Nt),Xo ={x|x ∈ Xt, nej ≥ nsj}, Nt= Nsx∈Xs\Xonsx. 3. IfXo ̸= Xt, go back to step 2.

4. OutputXo.

Now, we propose a method to construct a subgame perfect Nash equilibrium. This equilibrium also satisfies (6.5). For each customer i, his strategy in a subgame is

BEise(ni, θ) = arg max

x∈Xi,cand,ni,x<ni,candx

U (Rx(θ), ni,candx ), (6.13)

whereXi,cand = P C(X, ni, N ), Ni,cand = N−x∈X\Xi,candni,x, and ni,cand = EG(Xi,cand, Ni,cand).

The proposed best response BEise∗(ni, θ) chooses the table with the highest utility ac-cording to the predicted equilibrium grouping ni,candand candidate table setXi,cand. The equilibrium grouping ni,candis obtained by EG(Xi,cand, Ni,cand), where the candidate ta-ble setXi,cand is derived by P C(X, ni, N ). In Lemma 6, we show that the above strategy results in the equilibrium grouping in any subgame.

Lemma 6. Given the available table setXs = P C(X, ns, N ), Ns= N−x∈X\Xsnsx, the proposed strategy shown in (6.13) leads to an equilibrium grouping n = EG(Xs, Ns) overXs.

Proof. We prove this by contradiction. Let n = (nj|j ∈ Xs) be the final grouping after all customers choose their tables according to (6.13). Suppose that n̸= n = EG(Xs, Ns), then there exists some tables j that nj > nj. Let table j be the first table that ex-ceeds nsj in this sequential subgame. Since nj > nj, there are at least nj + 1 cus-tomers choosing table j. Suppose the nj + 1-th customer choosing table j is customer i. Let ni = (ni,1, ni,2, ..., ni,J) be the current grouping observed by customer i before he chooses the table. Since customer i is the nj + 1-th customer choosing table j, we have ni,j = nj. Since table j is the first table exceeding n after customer i's choice, we have ni,x ≤ nx ∀x ∈ Xs.

According to the definition of P C(·), none of the tables will be removed from candi-dates. Thus,Xi,cand =Xsand Ni,cand = Ns. We have

ni,cand = EG(Xi,cand, Ni,cand) = EG(Xs, Ns) = n. (6.14)

However, according to (6.13), the customer i should not choose table j since ni,j = nj = ni,candj . This contradicts with our assumption that customer i is the nj+1-th customer choosing table j. Thus, the strategy (6.13) should lead to the equilibrium grouping n = EG(Xs, Ns).

Note that Lemma 6 also shows that the final grouping of the sequential game should be n = EG(X, N) if all customers follow the proposed strategy in (6.13). In the following Lemma, we show that P C(Xs, ns, Ns) removes the tables that are dominated by other tables if all customers follow (6.13).

Lemma 7. Given a subgame with current grouping ns, if table j ̸∈ Xs = P C(X, ns, N ), then table j is never the best response of the customer if all other customers follow (6.13).

Proof. Let n = EG(X, N), and n be the final grouping. We first show that for every table under the final grouping n, there always exists a table providing a less or equal utility

under the grouping n. According to Lemma 6, the final grouping n is an equilibrium grouping overXsif all customers follow (6.13). Additionally, nj = nsj since no customers will choose table j. Assuming that there exists a table k ∈ Xs with nk < nk. Since nj = nsj > nj, we havex∈X\{j}nx <x∈X\{j}nx. Therefore,∃k ∈ Xs that nk > nk. Since n and n are equilibrium groupings overXs, similar to (6.10), we have

U (Rk(θ), nk+ 1)≥ U(Rk(θ), nk)≥ U(Rk(θ), nk + 1)≥ U(Rk(θ), nk)≥ U(Rk(θ), nk+ 1)(6.15)

The first and third inequalities are due to nk < nkand nk > nk, and the second and fourth ones come from the equilibrium grouping condition in (6.5). The equation is valid only when all equalities hold. Thus, if nk < nk,∃k ∈ Xsthat U (Rk(θ), nk) = U (Rk(θ), nk), which means that we can always find a table kproviding the same utility as U (Rk(θ), nk) under grouping n. When nk ≥ nk, we have U (Rk(θ), nk) ≥ U(Rk(θ), nk). Therefore,

∀k ∈ Xs,∃k ∈ Xsthat U (Rk(θ), nk)≥ U(Rk(θ), nk).

Then, we show that table j is dominated by all other tables under n. Since table j is removed by P C(X, ns, N ), we have nsj > nj. Therefore, according the above discussion and the fact that n is an equilibrium grouping, we have∀k ∈ Xs,

U (Rk(θ), nk)≥ min

k∈XsU (Rk(θ), nk)≥ U(Rj(θ), nj+ 1) > U (Rj(θ), nsj + 1). (6.16) Since U (Rj(θ), nsj+ 1) is the highest utility that can be offered by table j, it is dominated by all other tables inXsunder the final grouping n. So, table j is never the best response of the customer.

Theorem 16. There always exists a subgame perfect Nash equilibrium with the cor-responding equilibrium grouping n satisfying (6.5) in a sequential Chinese restaurant game.

Proof. We would like to show that the proposed strategy in (6.13) forms a Nash equilib-rium. Suppose customer i chooses table j in his round according to (6.13). Then, customer i's utility is ui = U (Rj(θ), nj) since based on Lemma 6, the equilibrium grouping nwill be reached at the end.

Now we show that table j is indeed customer i's best response. Let's assume that customer i is the last customer, i.e, i = N , and chooses another table j ̸= j in his round, then his utility becomes U (Rj(θ), nj + 1). However, according to (6.5), we have

uj = U (Rj(θ), nj)≥ U(Rj(θ), nj + 1). (6.17)

Thus, choosing table j is never worse than choosing table j for customer N .

For the case that customer i is not the last customer, we assume that he chooses table jinstead of table j in his round. Since all customers before customer i follows (6.13), we have ni,j ≤ nj ∀j ∈ X. Otherwise, ncannot be reached, which contradicts with Lemma 6.

If ni,j < nj, we have ni+1,j ≤ nj. In addition, we have ni+1,j = ni,j ≤ nj ∀j ∈ X \ {j}, since other tables are not chosen by customer i. Thus, Xi+1,cand = P C(X, ni+1, N ) and Ni,cand = N . According to Lemma 6, the final grouping should be n = EG(X, N).

Thus, the new utility of customer i becomes ui = U (Rj(θ), nj). However, according to (6.13), we have

ui = U (Rj(θ), nj) = arg max

x∈X,ni,x<nxU (Rx(θ), nx)≥ U(Rj(θ), nj) = ui.(6.18)

Thus, choosing table jnever gives customer i a higher utility.

If ni,j = nj, and the final grouping is n = (n1, n2, ..., nJ). Since customer i chooses table j when ni,j = nj, we have nj ≥ ni+1,j = ni,j + 1 = nj+ 1. Thus, we have

ui = U (Rj(θ), nj)≥ U(Rj(θ), nj + 1)≥ U(Rj(θ), nj) = ui, ∀j ∈ X,(6.19)

where the first inequality comes from the equilibrium grouping condition in (6.5), and the second inequality comes from the fact that U (R, n) is decreasing over n and nj nj + 1. Thus, under both cases, choosing table j is never better than choosing table j.

We conclude that{BEise(·)} in (6.13) forms a Nash equilibrium, where the grouping being the equilibrium grouping n.

Finally, we show that the proposed strategy forms a Nash equilibrium in every sub-game. In Lemma 7, we show that if the table j is removed by P C(X, ns, N ), it is never the best response of all remaining customers. Thus, we only need to consider the re-maining table candidatesXs = P C(X, ns, N ) in the subgame. Then, with Lemma 6, we show that for every possible subgame with correspondingXs, the equilibrium grouping n = EG(Xs, Ns) will be achieved at the end of the subgame. Moreover, the above proof shows that if the equilibrium grouping ns will be achieved at the end of the subgame, BEise(·) is the best response function. Therefore, the proposed strategy forms a Nash equilibrium in every subgame, i.e., we have a subgame perfect Nash equilibrium.

In the proof of Theorem 16, we observe that the sequential game structure brings ad-vantages for those customers making decisions early. According to (6.13), customers who make decisions early can choose the table providing the largest utility in the equilibrium.

When the number of customers choosing that table reaches equilibrium number, the sec-ond best table will be chosen until it is full again. For the last customer, he has no choice but to choose the worst one.