A Cooperative Dual to the Nash Equilibrium  for Two-Person Prescriptive Games

Corley, H. W.; Kwain, Phantipa

doi:https://doi.org/10.1155/2014/806794

Journal of Applied Mathematics

On this page

Abstract Introduction Conclusions References Copyright Related Articles

Special Issue

Operational Research 2014

View this Special Issue

Research Article | Open Access

Volume 2014 | Article ID 806794 | https://doi.org/10.1155/2014/806794

A Cooperative Dual to the Nash Equilibrium for Two-Person Prescriptive Games

H. W. Corley¹and Phantipa Kwain²

Academic Editor: Mohammad Khodabakhshi

Received26 Feb 2014

Accepted03 Jun 2014

Published17 Jun 2014

Abstract

An alternative to the Nash equilibrium (NE) is presented for two-person, one-shot prescriptive games in normal form, where the outcome is determined by an arbiter. The NE is the fundamental solution concept in noncooperative game theory. It is based on the assumption that players are completely selfish. However, NEs are often not played in practice, so we present a cooperative dual as an alternative solution concept by which an arbiter can assign the players' actions. In this dual equilibrium (DE), each player acts in the other's best interest. We formally define prescriptive games and the DE, then summarize the duality relationships between the NE and DE for two players. We also apply the DE to some prescriptive games and compare it to other outcomes.

1. Introduction

Game theory is the study of strategic interactions among agents called players. Ultimately it involves a solution concept to describe, predict, or prescribe the choices of these players [1]. Modern game theory [2, 3] is predominantly noncooperative and assumes that any joint rational actions by the players must be a Nash equilibrium (NE) [1–5]. In other words, rational players act in their individual self-interest. Each player’s action maximizes his payoff for the actions of the other players. The result is that no player can improve his expected payoff by unilaterally changing his strategy. Various refinements of the NE [2, 3] have been proposed, yet players can often do better by cooperating. Social dilemmas such as the Prisoner’s Dilemma, Snow drift, and Ultimatum games [6–9] illustrate that selfish behavior may conflict with group interests.

To address such issues, we consider here one-shot, two-person prescriptive games in normal form, where the outcome is determined by an arbiter. In this paper we provide the arbiter with an alternative approach to the NE for assigning the players’ actions. Our framework is prescriptive because the assumptions of noncooperative games are often not met in practice and because outcomes are often influenced by external forces. An arbiter can assign reasonable actions to both players that would be precluded by selfish strategies chosen by the players themselves. Pure strategies are emphasized here. Mixed strategies are somewhat problematic to interpret [1, 10] for noncooperative games but even more so when an arbiter for a one-shot game must specify an action for each player.

In Section 2 we define prescriptive games and the dual equilibrium (DE) in which each player acts in the other’s best interest. In Section 3 we describe how to obtain pure DEs and present some examples. In Section 4 we summarize the duality relationships between the DE and NE, which do not extend to one-shot prescriptive games with more than two players. In Section 5 we consider the special case of zero-sum games. In Section 6 we present conclusions and discuss future work.

2. The Dual Equilibrium

Let denote a two-person, one-shot prescriptive game in normal form, where is the payoff matrix of von Neumann-Morgenstern (VNM) utilities for Player I when Player I plays pure strategy and Player II plays pure strategy . Similarly, is the payoff matrix for Player II. The prescriptive mechanism is an arbiter who assigns unique actions to the players for their one shot. The arbiter could be a person or group of people. It could, for example, be a licensing agreement for the licensees of a patent. The arbiter could also be a person selected to rule in a formal legal arbitration. It could be a computer algorithm for making real-time decisions on a website where the players have agreed to its terms and conditions, as well as a policy imposed by a governmental agency on some segment of the population. In this paper, the arbiter will assign pure DE strategies to the two players. Hence the arbiter could even be a tacit agreement between the two players based on social pressures that dictate that the players should cooperate unselfishly. In this case, their joint notion of rationality based on social pressure is incorporated in the DE. When is implicit, as in such an agreement, is simply referred to as the game . If there are multiple pure DEs, we assume that the arbiter selects a unique one. Regardless, a strategy pair assigned by is an equilibrium in the sense that the players cannot change the prescribed actions.

An NE and DE for are next defined in terms of mixed strategies. Let be the set of mixed strategies of Player I and the set of mixed strategies for Player II.

Definition 1 (NE). The mixed strategy pair is an NE for if and only if

Definition 2 (DE). The mixed strategy pair is a DE for if and only if

Definitions 1 and 2 depict one aspect of the duality between the NE and the DE, which is the players’ opposing behaviors. In (1) each player selfishly responds to the NE strategy for the other player so as to maximize his own expected utility. In (2) each player unselfishly responds to the DE strategy for the other player so as to maximize the expected utility of the other player. In other words, in an NE no player can improve his payoff with a unilateral change in his strategy. In a DE a unilateral change in either player’s strategy cannot improve the other player’s payoff. A DE is a mutual-max outcome used in [11, page 1282] in defining a fairness equilibrium. A joint equilibrium (JE), which is both an NE and DE, incorporates selfishness and unselfishness in one outcome. It is a special case of the Rabin fairness equilibrium.

3. Computing Pure DEs

Pure NEs and DEs are easily obtained from the notions of regret and disappointment for a game . The regret function is a transformation of a player’s VNM utilities for pure strategies to a loss function. For a fixed pure strategy of Player II, Player I’s regret for using pure strategy is the regret function value . For Player II, . The bimatrix can thus be transformed into a regret bimatrix that has the same NEs [5] as the bimatrix . In particular, a pure strategy pair is an NE if and only if is the corresponding entry in . Likewise, the bimatrix can be transformed into a disappointment matrix , where disappointment for a player may be interpreted as regret with respect to the other player. For a fixed pure strategy of Player I, Player I’s disappointment at Player II’s using pure strategy is Player I’s disappointment function value , while for Player II. The proof of Proposition 3 below is similar to that in [5] showing that has the same NEs as .

Proposition 3. has the same DEs as , and a pure strategy pair is a DE if and only if is the corresponding entry in .

Example 4 (Prisoner’s Dilemma game). Table 1 shows the matrices , , and from left to right for a Prisoner’s Dilemma game [6], where the two players are arrested for a crime and held in separate rooms. To cooperate means to deny that either player had any part in the crime. To defect means to swear that the other player committed the crime alone. For each strategy pair, the VNM utility denotes years spent in jail. In a prescriptive version of the game, the arbiter could be a lawyer who represents both players and tells them how to respond when interrogated. There is a pure NE (defect, defect) whose payoff is dominated [12] by the payoff of the pure DE (cooperate, cooperate). The maximin outcome [2], in which each player’s action maximizes his minimum payoff resulting from the actions of the other players, is the NE (defect, defect).

Example 5 (Snow Drift game). Table 2 gives , , and for the Snow Drift game, which involves two drivers trapped on opposite sides of a snow drift blocking a road. Each has the option of staying in his car or shoveling snow to clear a path. The Snow Drift game has been said to more realistically reflect social situations that humans face than Prisoner’s Dilemma [7]. In a prescriptive version of the game, the two drivers could be neighbors, and the arbiter could be the social pressure to cooperate and preserve good will in the players’ future interactions as neighbors. The pure NEs are (shovel, refuse) and (refuse, shovel). The pure DE is (shovel, shovel). The maximin outcome is the DE (shovel, shovel), in contrast to being the NE in Example 4. There is also a mixed NE and DE.

Example 6 (JE). Consider the matrices , , and of Table 3. The strategy pair is a JE, but for is dominated by the DE with payoffs and so is not a Pareto optimum [12] for . However, it is a Rabin fairness equilibrium. An arbiter prescribing a pure DE would assign to the players. Any outcome in the or rows of Table 3 is a maximin outcome. That includes the DE but not the JE.

4. Duality Relationships

We now summarize the duality relationships between the NE and DE that exist for two-person games. The propositions below follow immediately from the definitions.

Definition 7. The two-person game with Player I as the row player and Player II as the column player is the dual of the game also with Player I as the row player and Player II as the column player.

Proposition 8. The dual game of the dual game of is .

Definition 9. For the bimatrix , define its swap matrix as . Denote the swap matrices of and by and , respectively.

Proposition 10. and . Hence, the set of DEs for is the set of NEs for , and the set of NEs for is the set of DEs for .

In the dual game of Definition 7 the players simply play for each other. Proposition 10 implies that any computational approaches and existence properties for two-player NEs are also valid for two-player DEs. In particular, a computational method for finding an NE for can therefore be used to find a DE for . Moreover, a DE exists for since an NE exists for [2]. Games with more than two players, however, do not exhibit such duality.

5. Zero-Sum Games

To find DEs for the zero-sum game we need only consider the matrix for Player I as in the case for zero-sum NEs. Proposition 11 states the standard NE version of the minimax theorem [13] for zero-sum games in (3) below for comparison with the DE version stated in (4). The proof of (4) follows immediately from (3) and Proposition 10.

Proposition 11. Consider the zero-sum game . Then there exists a value such that for any NE In addition, there exists a value w such that, for any DE ,

Proposition 11 asserts that a pure DE is obtained for zero-sum games when the minimax value for row Player I equals the maximin value for column Player II. This situation is exactly the opposite of the standard approach for finding pure zero-sum NEs, which are also called saddle points. It should be noted that the value in (3) may be larger, smaller, or equal to the value in (4). In addition, it follows from Proposition 10 that the linear programs for finding mixed DE strategies and y for the zero-sum game are identical to the linear programs [2] for finding mixed NE strategies and y, respectively, for the dual game . In other words, the in the NE linear programs are replaced by .

Example 12. Consider the zero-sum matrix game with as in Table 4. There is no pure NE. The single mixed NE is and with an expected payoff of 4 for Player I. On the other hand, the single DE occurs at from the discussion following Proposition 11. At the minimax payoff for Player I is 6, and the maximin payoff for Player II is therefore –6. In this example, the pure DE does not seem as good for Player II as Player I. The arbiter might well assign some outcome different from the DE.

6. Conclusions

In this paper we defined a two-person, one-shot prescriptive game, as well as a cooperative dual to the Nash equilibrium. Prescriptive games allow other factors than the players themselves to influence outcomes and also let nonselfish behavior be regarded as rational. In particular, the DE sometimes gives the players better payoffs than the NE and may thus be a better choice for an arbiter assigning pure strategies to the players. Unfortunately there may be either no pure DE or none satisfactory to the arbiter. Future research should address these issues. One possibility is a scalar equilibrium as in [14] that gives a reasonable outcome in pure strategies. In addition, the DE should be studied for n-person games.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

References

R. Aumann, “What is game theory trying to accomplish?” in Frontiers of Economics, K. Arrow and S. Honkapohja, Eds., pp. 5–46, Basil Blackwell, Oxford, UK, 1985.
View at: Google Scholar
M. Maschler, E. Solan, and S. Zamir, Game Theory, Cambridge University Press, Cambridge, UK, 2013.
R. Myerson, Game Theory: Analysis of Conflict, Harvard University Press, Cambridge, Mass, USA, 1991.
J. F. Nash, “Equilibrium points in N-person games,” Proceedings of the National Academy of Sciences, vol. 36, no. 1, pp. 48–49, 1950.
View at: Publisher Site | Google Scholar
J. F. Nash, “Non-cooperative games,” The Annals of Mathematics, vol. 54, no. 2, pp. 286–295, 1951.
View at: Publisher Site | Google Scholar
W. Poundstone, Prisoner's Dilemma, Random House, New York, NY, USA, 2011.
R. Kümmerli, C. Colliard, N. Fiechter, B. Petitpierre, F. Russier, and L. Keller, “Human cooperation in social dilemmas: comparing the Snowdrift game with the Prisoner's Dilemma,” Proceedings of the Royal Society B, vol. 274, no. 1628, pp. 2965–2970, 2007.
View at: Publisher Site | Google Scholar
M. A. Nowak, K. M. Page, and K. Sigmund, “Fairness versus reason in the ultimatum game,” Science, vol. 289, no. 5485, pp. 1773–1775, 2000.
View at: Publisher Site | Google Scholar
M. Beckenkamp, “A game-theoretic taxonomy of social dilemmas,” Central European Journal of Operations Research, vol. 14, no. 3, pp. 337–353, 2006.
View at: Publisher Site | Google Scholar
A. Rubinstein, “Comments on the interpretation of game theory,” Econometrica, vol. 59, no. 4, pp. 909–924, 1991.
View at: Google Scholar
M. Rabin, “Incorporating fairness into game theory and economics,” The American Economic Review, vol. 83, no. 5, pp. 1281–1302, 1993.
View at: Google Scholar
M. Ehrgott, Multicriteria Optimization, Springer, New York, NY, USA, 2nd edition, 2005.
J. von Neumann and O. Morgenstern, Theory of Games and Economic Behavior, Princeton University Press, Princeton, NJ, USA, 1994.
N. Engsuwan, Scalar equilibria for n-person games [Ph.D. thesis], University of Texas at Arlington, Arlington, TX, USA, 2013.

Copyright

Copyright © 2014 H. W. Corley and Phantipa Kwain. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2858

Downloads

965

Citations