Program equilibrium and open-source game theory – an annotated bibliography

Compiled by Caspar Oesterheld (oesterheld@cmu.edu). Last updated Feb 7, 2025.

R. Preston McAfee (May 1984): Effective Computability in Economic Decisions. This paper introduces the basic idea of achieving cooperation in (a variant of) the Prisoner's Dilemma in equilibirum by having players submit programs who then get access to each other's source code. McAfee's program works by syntactic comparison, roughly: "if opponent source code is equal to mine, then cooperate; else defect".
J. V. Howard (1988): Cooperation in the Prisoner's Dilemma, Theory and Decision, 24, pages 203–213. Similar to McAfee in terms of what it shows about program equilibrium.
Section 10.4 of Ariel Rubinstein (1998): Modeling Bounded Rationality, MIT Press. The discussion is brief and based on syntactic comparison but already gives the folk theorem (with proof left as a "project" to the reader).
Moshe Tennenholtz (2004): Program equilibrium, Games and Economic Behavior, 49(2), pages 363–373. Formally defines program equilibrium and proves the folk theorem using programs based on syntactic comparison.
Lance Fortnow (2009): Program equilibria and discounted computation time. TARK '09: Proceedings of the 12th Conference on Theoretical Aspects of Rationality and Knowledge, pages 128–133.
Mihaly Barasz, Paul Christiano, Benja Fallenstein, Marcello Herreshoff, Patrick LaVictoire, and Eliezer Yudkowsky (January 2014): Robust Cooperation in the Prisoner"s Dilemma: Program Equilibrium via Provability Logic. This paper studies programs like the following: "If I can prove that you cooperate against me, I cooperate. Else I defect." Contrary to what one might expect, such a program cooperates when paired with a copy of itself. To our knowledge this is the earliest example of a program that achieves cooperation in equilibrium while relying on semantic analysis of the opponent program.
Andrew Critch (2019): A Parametric, Resource-Bounded Generalization of Löb's Theorem, and a Robust Cooperation Criterion for Open-Source Game Theory, Journal of Symbolic Logic 84(4), pages 1368–1381. Subtly but importantly refines Barasz et al.'s approach.
Caspar Oesterheld (2019): Robust program equilibrium, Theory and Decision, 86(1), pages 143–159. Studies programs called εGroundedFairBots, which are programs of the form, "with ε probability cooperate; with the remaining probability simulate the opponent and copy their action".
Andrew Critch, Michael Dennis, Stuart Russell (2022): Cooperative and uncooperative institution designs: Surprises and problems in open-source game theory.
Caspar Oesterheld (2022): A Note on the Compatibility of Different Robust Program Equilibria of the Prisoner's Dilemma.
Caspar Oesterheld, Johannes Treutlein, Roger Grosse, Vincent Conitzer, Jakob Foerster (2022): Similarity-based Cooperative Equilibrium.
Anthony DiGiovanni and Jesse Clifton (2023): Commitment Games with Conditional Information Disclosure. AAAI 2023.
Emery Cooper, Caspar Oesterheld, Vincent Conitzer (2025): Characterising Simulation-Based Program Equilibria. AAAI 2025. This paper extends Oesterheld's ε-grounded FairBots to achieve a folk theorem when a correlated random signal is available. It also (partially) characterizes what kinds of payoffs can be achieved in simulation-based program equilibria when no such shared randomness is available.

Some further writings, not in the form of papers, incomplete:

Alex Mennen ran an open-source Prisoner's Dilemma tournanment in 2013. [Announcement and description; results]
Caspar Oesterheld (2018): Testing εGroundedFairBot in a Transparent Prisoner's Dilemma Tournament.
Caspar Oesterheld (2023): Results of the CAIF summer school open prompt prisoner's dilemma tournament.

Program equilibrium can be seen as a form of bilateral (more generally multilateral), simultaneous, mutually conditional commitment (which contrasts with, for example, the widely studied Stackelberg games in which one player commits and the other responds). There are various other models of bilateral commitment, some of which avoid some of the interesting questions posed by program games ("how should and can the programs reason about each other, given logical paradoxes, circularities, multplicity of fixed points, etc.?"). Still, they all follow the structure of the figure below.
The game structure underlying program games and other formalisms of bilateral commitment.

The game structure underlying program games and other formalisms of bilateral commitment.

Here's a presumably incomplete list:

Logical formulas as a language of commitment. Instead of specifying one's commitment as a computer program, one might specify one's commitment in the form of a logical formula (e.g., in Peano arithmetic). This approach differs relatively little from the computer program-based approach, because some of the most natural formulas to use can also be written as computer programs. Arguably, the provability logic / Löb's theorem-based approaches listed above could be listed here instead. (The current classification mostly follows the papers' framings.) One difference is that some logical formulas allow for ambiguity of outcomes. For example, in logical-formula-based settings, the commitment, "I do whatever you do", may be permissible, whereas it is not permissible in a program-based setting.
- Peters and Szentes (2012): Definable and Contractible Contracts, Econometrica, 80(1), pp. 363–411. Their equilibrium construction is of the cooperate-against-copies variety. They consider a Bayesian setting, specifically a setting in which the principals have private information that they may reveal as part of the program or contract they submit.
- van der Hoek, Witteveen, Wooldridge (2013): Program equilibrium–a program reasoning approach. International Journal of Game Theory, 42, pp. 639–671.
- Arbital article on "modal combat".
- Andrew Critch (2023): Modal Fixpoint Cooperation without Löb's Theorem.
Preferences as a language of commitment. Consider a setting in which Alice and Bob hire representatives Charlie and Dan, respectively, to make decision on their behalfs. They incentivize their representative by giving them a utility function (for example, in the form of committing to pay them depending on the outcomes they achieve). Assume further that Charlie can see Bob's incentive contract for Dan, and Dan can see Alice's incentive contract for Charlie. In general it will be in Charlie's interest to consider Dan's incentives. Thus, delegating to utility-maximizing representatives induces a similar bilateral commitment setting as program equilibrium. The main difference is that preferences (utility functions) (rather than computer programs) are used as a language to specify the mutually conditional behavior.
- Werner Raub (1990): A General Game-Theoretic Model of Preference Adaptations in Problematic Social Situations, Rationality and Society, 2(1), pp. 67–93.
- Oesterheld and Conitzer (2022): Safe Pareto improvements for delegated game playing, Autonomous Agents and Multi-Agent Systems, 36(46). (See Appendix B for a comparison to Raub (1990).) [See here for an overview of work in this area.]
- Fershtman and Judd (1987): Equilibrium Incentives in Oligopoly, The American Economic Review, 77(5), pp. 927–940. Note that this paper has about 2000 citations (as of November 15, 2023) according to Google Scholar.
Specifying mutual dependence in action space. A strategy in a program equilibrium setting typically takes an opponent program as input and outputs an action. Instead, some authors have considered cases in which a strategy directly depends on the opponent's actions. A further mechanism (e.g., a fixed point finder) is then needed to resolve this mutual dependence on one another's actions.
- Bastianello and Ismail: Conditional strategy equilibrium.
Bilateral commitment without a commitment language. The literature on program equilibrium typically conisders a setting in which commitments that condition on other player's commitment are specified in some general-purpose language, e.g., as computer programs or logical formulas. But we could also assume that commitments are simply objects of some set for each players and that a commitment game specifies how to resolve any given combination of choices of these objects. This approach is followed by the following papers:
- Kalai, Kalai, Lehrer and Samet (2010): A commitment folk theorem, Games and Economic Behavior, 69, pp. 127–137.
- Dov Monderer and Moshe Tennenholtz (2006): Strong Mediated Equilibrium. AAAI 2006. Extended version published in 2009 in Artificial Intelligence.
- Arguably the following paper can be seen as an instance of this: Michael L. Katz (2006): Observable Contracts as Commitments: Interdependent Contracts and Moral Hazard, Journal of Economics & Management Strategy, 15(3), pp. 685–706.
- Françoise Forges (2013): A folk theorem for Bayesian games with commitment. Games and Economic Behavior, 78, pp. 64–71.

Acknowledgments: I'd like to thank John Mori for bringing some of the work in the economics literature to my attention (Peters and Szentes 2012 and references therein).