The Prisoner's Dilemma
The Prisoner's Dilemma is the most studied model in game theory. Two players simultaneously choose to cooperate (C) or defect (D). If both cooperate, they each receive a reward R=3. If both defect, they each get a punishment P=1. But if one defects while the other cooperates, the defector gets the temptation payoff T=5 while the cooperator gets the sucker's payoff S=0. The dilemma: rational self-interest drives both to defect, even though mutual cooperation is better for both.
Axelrod's Tournament
In 1980, political scientist Robert Axelrod invited game theorists to submit strategies for a computer tournament of the iterated Prisoner's Dilemma. The winner was Tit-for-Tat (TFT), submitted by Anatol Rapoport — the simplest strategy entered. TFT cooperates on the first move, then copies whatever the opponent did on the previous move. It won not by beating any single opponent, but by accumulating high scores against a wide range of strategies.
Why Reciprocity Works
TFT embodies four principles that Axelrod identified as keys to success: niceness (never be the first to defect), retaliation (respond to defection immediately), forgiveness (return to cooperation if the opponent does), and clarity (be predictable enough that opponents can learn to cooperate with you). These principles have profound implications far beyond game theory — they illuminate the evolution of cooperation in biology, diplomacy, and everyday social interaction.
The Role of Noise
In real-world interactions, signals are imperfect. The 'noise' parameter models accidental defections or misunderstood cooperations. Under noise, strict TFT can get trapped in cycles of mutual retaliation triggered by a single error. This led researchers like Nowak and Sigmund to discover strategies like Win-Stay Lose-Shift (Pavlov) that are more robust to noise. Increase the noise slider to see how error degrades cooperative strategies.
Interpreting the Simulation
Adjust the population shares to see how different ecological compositions change outcomes. In a world dominated by defectors, even TFT struggles. But when enough cooperative or reciprocal strategies are present, they form clusters of mutual cooperation that outperform pure defection. This is the essence of Axelrod's insight: cooperation can evolve and sustain itself even among self-interested agents, given sufficient repetition and the possibility of reciprocity.