Quantal response equilibrium

From Wikipedia, the free encyclopedia
Quantal response equilibrium
A solution concept in game theory
Relationship
Superset ofNash equilibrium, Logit equilibrium
Significance
Proposed byRichard McKelvey and Thomas Palfrey
Used forNon-cooperative games
ExampleTraveler's dilemma

Quantal response equilibrium (QRE) is a solution concept in game theory. First introduced by Richard McKelvey and Thomas Palfrey,[1][2] it provides an equilibrium notion with bounded rationality. QRE is not an equilibrium refinement, and it can give significantly different results from Nash equilibrium. QRE is only defined for games with discrete strategies, although there are continuous-strategy analogues.

In a quantal response equilibrium, players are assumed to make errors in choosing which pure strategy to play. The probability of any particular strategy being chosen is positively related to the payoff from that strategy. In other words, very costly errors are unlikely.

The equilibrium arises from the realization of beliefs. A player's payoffs are computed based on beliefs about other players' probability distribution over strategies. In equilibrium, a player's beliefs are correct.

Application to data[edit]

When analyzing data from the play of actual games, particularly from laboratory experiments, particularly from experiments with the matching pennies game, Nash equilibrium can be unforgiving. Any non-equilibrium move can appear equally "wrong", but realistically should not be used to reject a theory. QRE allows every strategy to be played with non-zero probability, and so any data is possible (though not necessarily reasonable).

Logit equilibrium[edit]

The most common specification for QRE is logit equilibrium (LQRE). In a logit equilibrium, player's strategies are chosen according to the probability distribution:

is the probability of player choosing strategy . is the expected utility to player of choosing strategy under the belief that other players are playing according to the probability distribution . Note that the "belief" density in the expected payoff on the right side must match the choice density on the left side. Thus computing expectations of observable quantities such as payoff, demand, output, etc., requires finding fixed points as in mean field theory.[3]

Of particular interest in the logit model is the non-negative parameter λ (sometimes written as 1/μ). λ can be thought of as the rationality parameter. As λ→0, players become "completely non-rational", and play each strategy with equal probability. As λ→∞, players become "perfectly rational", and play approaches a Nash equilibrium.[4] In a non-mean-field variant of QRE, the Gibbs measure is the resulting form of the equilibrium measure, and this parameter λ is in fact the inverse of the temperature of the system which quantifies the degree of random noise in decisions.[5]

For dynamic games[edit]

For dynamic (extensive form) games, McKelvey and Palfrey defined agent quantal response equilibrium (AQRE). AQRE is somewhat analogous to subgame perfection. In an AQRE, each player plays with some error as in QRE. At a given decision node, the player determines the expected payoff of each action by treating their future self as an independent player with a known probability distribution over actions. As in QRE, in an AQRE every strategy is used with nonzero probability.

Applications[edit]

The quantal response equilibrium approach has been applied in various settings. For example, Goeree et al. (2002) study overbidding in private-value auctions,[6] Yi (2005) explores behavior in ultimatum games,[7] Hoppe and Schmitz (2013) study the role of social preferences in principal-agent problems,[8] and Kawagoe et al. (2018) investigate step-level public goods games with binary decisions.[9]

Most tests of quantal response equilibrium are based on experiments, in which participants are not or only to a small extent incentivized to perform the task well. However, quantal response equilibrium has also been found to explain behavior in high-stakes environments. A large-scale analysis of the American television game show The Price Is Right, for example, shows that contestants behavior in the so-called Showcase Showdown, a sequential game of perfect information, can be well explained by an agent quantal response equilibrium (AQRE) model.[10]

Critiques[edit]

Non-falsifiability[edit]

Work by Haile et al. has shown that QRE is not falsifiable in any normal form game, even with significant a priori restrictions on payoff perturbations.[11] The authors argue that the LQRE concept can sometimes restrict the set of possible outcomes from a game, but may be insufficient to provide a powerful test of behavior without a priori restrictions on payoff perturbations.

Loss of Information[edit]

As in statistical mechanics the mean-field approach, specifically the expectation in the exponent, results in a loss of information.[12] More generally, differences in an agent's payoff with respect to their strategy variable result in a loss of information.

See also[edit]

References[edit]

  1. ^ McKelvey, Richard; Palfrey, Thomas (1995). "Quantal Response Equilibria for Normal Form Games". Games and Economic Behavior. 10: 6–38. CiteSeerX 10.1.1.30.5152. doi:10.1006/game.1995.1023.
  2. ^ McKelvey, Richard; Palfrey, Thomas (1998). "Quantal Response Equilibria for Extensive Form Games" (PDF). Experimental Economics. 1: 9–41. doi:10.1007/BF01426213.
  3. ^ Anderson, Simon P.; Goeree, Jacob K.; Holt, Charles A. (2004). "Noisy Directional Learning and the Logit Equilibrium". The Scandinavian Journal of Economics. 106 (3): 581–602. CiteSeerX 10.1.1.81.8574. doi:10.1111/j.0347-0520.2004.00378.x. S2CID 14404020.
  4. ^ Goeree, Jacob K.; Holt, Charles A.; Palfrey, Thomas R. (August 2018). "Stochastic Game Theory for Social Science: A Primer on Quantal Response Equilibrium" (PDF). pp. 10–11. Archived (PDF) from the original on August 4, 2023.
  5. ^ Michael J. Campbell; Vernon L. Smith (2021). "An elementary humanomics approach to boundedly rational quadratic models". Physica A. 562: 125309. doi:10.1016/j.physa.2020.125309. S2CID 221726989.
  6. ^ Goeree, Jacob K.; Holt, Charles A.; Palfrey, Thomas R. (2002). "Quantal Response Equilibrium and Overbidding in Private-Value Auctions" (PDF). Journal of Economic Theory. 104 (1): 247–272. doi:10.1006/jeth.2001.2914. ISSN 0022-0531.
  7. ^ Yi, Kang-Oh (2005). "Quantal-response equilibrium models of the ultimatum bargaining game". Games and Economic Behavior. 51 (2): 324–348. doi:10.1016/s0899-8256(03)00051-4. ISSN 0899-8256.
  8. ^ Hoppe, Eva I.; Schmitz, Patrick W. (2013). "Contracting under Incomplete Information and Social Preferences: An Experimental Study". Review of Economic Studies. 80 (4): 1516–1544. doi:10.1093/restud/rdt010.
  9. ^ Kawagoe, Toshiji; Matsubae, Taisuke; Takizawa, Hirokazu (2018). "Quantal response equilibria in a generalized Volunteer's Dilemma and step-level public goods games with binary decision". Evolutionary and Institutional Economics Review. 15 (1): 11–23. doi:10.1007/s40844-017-0081-6. ISSN 1349-4961. S2CID 189937929.
  10. ^ Klein Teeselink, Bouke; van Dolder, Dennie; van den Assem, Martijn J.; Dana, Jason (2022-06-29). "High-Stakes Failures of Backward Induction: Evidence from The Price Is Right". SSRN 4130176. {{cite journal}}: Cite journal requires |journal= (help)
  11. ^ Haile, Philip A.; Hortaçsu, Ali; Kosenok, Grigory (2008). "On the Empirical Content of Quantal Response Equilibrium". American Economic Review. 98 (1): 180–200. CiteSeerX 10.1.1.193.7715. doi:10.1257/aer.98.1.180. S2CID 3083373.
  12. ^ Jessie, Daniel T.; Saari, Donald G. (2016). "From the Luce Choice Axiom to the Quantal Response Equilibrium". Journal of Mathematical Psychology. 75: 3–9. doi:10.1016/j.jmp.2015.10.001.