Repeated games
From Theory
Scribe: Luyi Gui
Finitely Repeated Game
Given a finite game G = (N,A,u), define a finitely repeated game GT,
as follows. For every time period t = 1,2,...,T, player i chooses an action
. The vector at is defined as having i-th coordinate
. The history of the game up to round t is then ht = (a1,a2,...,at − 1). The set of all possible history vectors ht on round t is At − 1, the cross product of A with itself t − 1 times. The payoff to player i is obtained as an average of his payoffs on the individual rounds of the game, giving a total payoff of
. A pure strategy for player i is defined as a function
, where
. Similarly, a mixed strategy for player i is defined as a function
. The idea of this definition is that, given any sequence of moves, the strategy will say what to do next. Therefore, it is assumed that player i chooses his action
by following a strategy si and based on the history of the game. More precisely,
.
Infinitely Repeated Game
Given the game G as above, one could attempt to define an infinitely repeated game
as follows. A pure strategy for player i is defined as a function
, where
. Similarly, a mixed strategy is defined as a function
.
It might seem natural to define the payoff function in
as
. However, this limit might not exist so the game
is not defined in this generality. Instead, introducing a discount parameter
, an infinitely repeated game
is defined as having payoff function for player i given by
. Since
, the total payoff is the weighted sum of the payoff received in each individual game.
In order to understand the definition, one way is to consider the time value of money. An alternative way is to take λ as the probability that there is going to be a next game so that the total payoff evaluates the expected amount of money received. If λ is close to 1, we say that the player is patient. If λ is close to 0, we say that the player is myopic.
The following theorem shows that no matter what the value of λ, there always exists a Nash Equilibrium for
. In fact, the equilibria can be naturally inherited from the original game G. Given a mixed strategy profile σ for G, define a mixed strategy profile ασ for
by
for all players i and all history vectors h.
If σ is a Nash Equilibrium for G, then ασ is a Nash Equilibrium for .
|
Proof: Let U denote the payoff function for the game
. Given any player i, it must be shown that
for every mixed strategy
for player i. It follows from the definition of ασ that when the game
is played following this strategy, the action vectors at are equal to σ for all t. Therefore,
. Similarly, when the game
is played following the strategy
, the action vectors at are given by
. Hence
. Since σ is a Nash equilibrium for G, then
. Multiplying both sides of the inequality by λt − 1 and adding up these inequalities for all t it follows from the previous formulas that
, thus proving that ασ is a Nash equilibrium for
.
Repeated Prisoners' Dilemma
| Cooperate | Defect | |
| Cooperate | 1,1 | -1,2 |
| Defect | 2,-1 | 0,0 |
| Fig. 1: A payoff matrix for Prisoners' Dilemma | ||
Consider two strategies.
Tit for Tat: On the first round, both players play C. On round t + 1, player i players
.
Trigger: On the first round, both players play C. On round t + 1, player i plays C if and only if
.
Both of the above strategies are ε-NE for a finitely repeated game GT where
. The reason is that a player can only earn extra profit by defecting on the final round, which produces
more profit.
In the case of an infinitely repeated game
, it can be proved that <Trigger,Trigger> is a Nash Equilibrium as long as the players are all patient.
If , the strategy <Trigger,Trigger> is a Nash Equilibrium.
|
Proof: If both players play according to Trigger, then their payoff will be 1.
Let k be the number of the first round that player i does not cooperate. Because the other player is playing Trigger, player i will defect for the rest of the game. Then his payoff will be
It's easy to see that when
, neither player will earn positive profit by deviating to other strategies.
Q.E.D.
<Trigger, Trigger> is also a subgame perfect Nash Equilibrium.
