Multiplication by Infinity: The Prisoner's Dilemma by Carl Sagan

Friday, October 1, 2010

The Prisoner's Dilemma by Carl Sagan

Prisoner's Dilemma
"A New Way To Think About Rules To Live By"

by Carl Sagan, Parade magazine, 28 Nov 1993

Moral codes that seek to regulate human behavior have been with us not only since the dawn of civilization but also among our pre-civilized, and highly social, hunter-gatherer ancestors. And even earlier. Different societies have different codes. Many cultures say one thing and do another. In a few fortunate societies, an inspired lawgiver lays down a set of rules to live by. But many revered codes have failed to establish a long-lived moral order. For example, the codes of Ashoka (India), Hammurabi (Babylon), Lycurgus (Sparta) and Solon (Athens), which once held sway over mighty civilizations, are today largely defunct. Perhaps they misjudged human nature and asked too much of us. Perhaps experience from one epoch or culture is not wholly applicable to another.

In this article, I describe an early effort - tentative but emerging - to approach the matter scientifically.

In our every day lives, as in the momentous affairs of nations, we must decide: What does it mean to do the right thing? How do we deal with an enemy? Should we ever take advantage of someone who treats us kindly? If hurt by a friend, or helped by an enemy, should we reciprocate in kind?

Examples are all around us: Your sister-in-law ignores your snub and invites you over for Christmas dinner. Should you accept? A co-worker makes you look bad in front of the boss. Should you try to get even? Should you cheat at cards? On a larger scale: Should we kill killers? If a power company supports a symphony orchestra, ought we to ignore its destructive, although legal, pollution of the environment? Shattering a worldwide voluntary moratorium, China resumes its testing of nuclear weapons. Should we?

In making such decisions, we're concerned not only with doing right but also with what works - what makes us and the rest of society happier and more secure. There's a tension between what we call ethical and what we call pragmatic. If, even in the long run, ethical behavior were self-defeating, we would not call it ethical, but foolish. (We might even claim to respect it but in practice ignore it.) Bearing in mind the variety and complexity of human behavior, are there any simple rules - whether we call them ethical or pragmatic - that actually work? Let's look at some of the rules we're taught:

THE GOLDEN RULE. The most admired standard of behavior in the West is the Golden Rule. Its formulation in the first-century Gospel of St. Matthew is: "Do unto others as you would have them do unto you." Almost no one follows it consistently. When the Chinese philosopher K'ung-Tzu (known as Confucius in the West) was asked in the sixth century B.C. his opinion of the Golden Rule - of repaying evil with kindness - he replied, "Then with what will you repay kindness?"

THE SILVER RULE. The Silver Rule is different: "Do not do unto others what you would not have them do unto you." The most inspiring 20th-century exemplars of the Silver Rule are Mohandas Gandhi and Dr. Martin Luther King Jr. They counseled oppressed peoples not to repay violence with violence, but not to be compliant and obedient either. Non-violent civil disobedience was what they advocated - putting your body on the line and showing, by your willingness to be punished in defying an unjust law, the justice of your cause. They aimed at melting the hearts of their oppressors. It worked, up to a point. But even Gandhi had trouble reconciling the rule of nonviolence with the necessities of defense against those with less lofty rules of conduct.

THE BRAZEN RULE. "Repay kindness with kindness," said Confucius, describing relations between individuals, "but evil with justice." This might be called the Bronze or Brazen Rule: "Do unto others as they do unto you." It's "an eye for an eye, and a tooth for a tooth," plus "one good turn deserves another." In actual human (and chimpanzee) behavior, it's a familiar standard. Without having to appeal to anyone's better nature, we institute a kind of operant conditioning, rewarding others when they're nice to us and punishing them when they're not. We're not pushovers, be we're not unforgiving either.

THE IRON RULE... AND OTHERS. Of baser coinage is the Iron Rule: "Do unto others as you like, before they do it unto you." It's sometimes formulated as, "He who has the gold makes the rules," underscoring not just its rejection of, but also its contempt for, the Golden Rule. This is the secret maxim of many, if they can get away with it, and often the unspoken precept of the powerful.

Finally, I should mention two mixed rules, found throughout the living world. They explain a great deal. One is: "Suck up to those above you, and intimidate those below." This is the motto of bullies. It's really the Golden Rule for superiors, the Iron Rule for inferiors. Since there is no known alloy of gold and iron, we'll call it the Tin Rule for its flexibility. The other common rule is: "Give precedence in all things to close relatives, and do as you like to others" - the Golden Rule for relatives, the Iron rule for others. This Nepotism Rule is known to evolutionary biologists as "kin selection."

Despite its apparent practicality, there's a fatal flaw in the Brazen Rule: unending vendetta. Each act of justifiable retribution triggers another. Violence begets violence. The reasonable part of us tries to keep the peace, but the passionate part of us cries out for vengeance. Extremists in the two warring factions can count on one another. They are allied against the rest of us, contemptuous of appeals to understanding an loving kindness. A few hotheads can force-march a legion of more prudent and rational people to brutality and war.

WHAT GAMES TEACH US. Clearly, the Brazen Rule is too unforgiving. But the Golden and Silver Rules seem too complacent. They systematically reward cruelty and exploitation. It is hard to imagine a Hitler or a Stalin being shamed into redemption by good example. The Iron Rule promotes the advantage of a ruthless and powerful few against the interest of the many. So is there a rule between the Golden and the Silver, on the one hand, and the Brazen and Iron, on the other, which works better than any of them?

Suppose we seek not to confirm or deny what we've been taught but to find out what really works. Is there a way to test alternative codes of ethics?

We're used to playing games in which somebody wins and somebody loses. Every point made by our opponent puts us that much farther behind. "Win-lose" games seem so natural that many people are hard-pressed to think of a game that isn't win-lose. In win-lose games, the losses just balance the wins - that's why they're also called "zero-sum" games.

Many children are appalled the first time they really come face to face with the "lose" side of win-lose games. On the verge of bankruptcy in the game Monopoly (tm), for example, they plead for special dispensation. When this is not forthcoming, they may, in tears, denounce the game as heartless and unfeeling - which, of course, it is. Within the rules of Monopoly, there's no way for players to cooperate so that all benefit. That's not how the games is designed. The same is true for boxing, football, hockey, basketball, baseball, lacrosse, tennis, racquetball, pinochle, chess, all Olympic events, yacht and car racing, potsy and partisan politics. There may be rewards for teamwork, but not for teamwork with the opponent. In none of these games is there an opportunity to practice the Golden or Silver Rule, or even the Brazen. There is room only for the Rule of Iron.

Nuclear war, however (and many conventional wars), economic depression and assaults on the global environment are all "lose-lose" propositions. Such vital human concerns as love, friendship, parenthood and the pursuit of knowledge are "win-win" propositions. Everyone gains from the creation of great music, art, architecture and literature, wise and just laws and, indeed, far-seeing moral codes. Our vision is dangerously narrow if all we know is "win-lose."

THE PRISONER'S DILEMMA. The scientific field that deals with such matters is called "game theory." It's used in military strategy, trade policy, corporate competition and the limiting of environmental pollution. The Defense Department has its own gaming agency. The paradigmatic game is the Prisoner's Dilemma. It is not zero-sum. Win-win, win-lose and lose-lose outcomes all are possible. It is wholly pragmatic and amoral:

Imagine that you and a friend are arrested for committing a serious crime. Before the two of you have any chance to compare stories or plan strategy, you are taken to separate interrogation cells. There, oblivious of your Miranda rights ("You have the right to remain silent..."), the police try to make you confess. They tell you, as police sometimes do, that your friend has confessed. The police might be telling the truth. Or they might be lying. If you're willing to say anything, what's your best tack to minimize punishment?

You're permitted only to plead guilty or not guilty; you cannot implicate or clear your friend. These are the possible outcomes:

* If you deny committing the crime, and (unknown to you) your friend also denies it, the case might be hard to prove. In the ensuing plea bargain, both your sentences will be very light.

* If you confess, and your friend does likewise, then the effort the State must expend to solve the crime is small. In exchange, you both will be given a fairly light sentence, although not so light as if you both had asserted your innocence.

* If you plead not guilty, and your friend confesses, the State will ask for a maximum sentence for you and minimal punishment (maybe none) for your friend. Uh-oh. You're very vulnerable to a kind of double cross. So's he.

So if you and your friend both plead innocent, you both escape the worst. But each must be sure of the other.

Should you play it safe and guarantee no worse than a middle range of punishment by confessing? Then, if your friend pleads innocent while you plead guilty - well, too bad for him, and you might get off scot-free.

When you think it through, you realize that, whatever your friend does, you're better off confessing. Maddeningly, the same holds true for your friend. But if both of you confess, you both are worse off than if both of you had pleaded innocent. This is the Prisoner's Dilemma.

Robert Axelrod, a professor of political science at the University of Michigan, has pioneered the study of a repeated Prisoner's Dilemma in which the two players go through a sequence of such games with no direct communication between them. At the end of each, they figure out from their punishment how the other must have pleaded. They gain experience about each other's strategy (and character). Will they learn to "cooperate" game after game - both always denying that they committed any crime - even if the reward for finking on the other (or "defecting") is very large?

If you cooperate overmuch, the other player may exploit your good nature. If you defect overmuch, your friend is likely to retaliate often, which will be bad for both of you. What is the right mix of cooperation and defection? How to behave then becomes, like any other question in Nature, a subject to be investigated experimentally.

This matter has been explored by Axelrod in a continuing round-robin computer tournament. Various codes of behavior confront one another, and at the end we see who wins (who gets the lightest cumulative prison term). The simplest strategies might be to cooperate all the time, no matter how much advantage is taken of you; or never to cooperate, no matter what benefits might accrue from cooperation. Both the Golden Rule and the Iron Rule always lose - the one from an excess of kindness, the other from an overabundance of ruthlessness. Strategies that are slow to punish defection lose, in part because they send a signal that non-cooperation works.

A RULE THAT WORKS. The most effective strategy in many such tournaments is called "Tit-for-Tat." It's very simple: You start out cooperating and, in each subsequent round, simply do what your opponent did last time. You punish defections, but once the other player cooperates, you're willing to let bygones be bygones. At first it seems to garner only mediocre success. But as time goes on, the other strategies defeat themselves - from too much kindness or too much cruelty - and this middle way pulls ahead.

Except for always being nice on the first move, Tit-for-tat is identical to the Brazen Rule. It promptly (in the very next game) rewards cooperation and punishes defection, and it has the great virtue that it makes its strategy absolutely clear.

To succeed, Tit-for-tat strategists must find others who are willing to reciprocate - players with whom to cooperate. Once there get to be several players employing Tit-for-tat, they rise in the standings together. After the first tournament, in which the Brazen Rule unexpectedly won, some experts thought it would pay to be less forgiving. Next tournament, they tried to exploit the Brazen Rule by defecting more often. They did poorly. Even experienced strategists tended to underestimate the power of forgiveness and reconciliation.

The superiority of the Brazen Rule in such tournaments was discovered by Axelrod and described in his remarkable book "The Evolution of Cooperation." A variant of Tit-for-tat that forgives other players for defecting occasionally - say 10 percent of the time - does even better if there's any chance of misunderstanding. We might call it the Goldplated Brazen Rule. Among other virtues, it breaks out of unending vendetta.

The Prisoner's Dilemma is a very simple game. Real life is considerably more complex. But its central lessons are striking: Be friendly at first meetings. Do not envy. Be generous; forgive your enemy if he forgives you. Be neither a tyrant or a patsy. Retaliate proportionately to an intentional injury (within the constraints of the rule of law). And make your behavior fairly (although not perfectly) clear and consistent. What would the world be like if more of us, individuals as well as nations, lived by these rules?

8 comments:

Phil Warnell said...: Hi Steven,

A very interesting exposé of Sagan’s philosophical bend, which is based solely on the merits of mathematics and computational logic; which have actually been proposed as philosophical methods before Sagan; in one form or another. I would admit that it would be hard to deny, that as Sagan suggests such a tactical approach to personal and societal relations, if believed to be practiced by all (or the vast majority), might serve to have the world to function much better than it ever has in the past.

However, as I see it there are two key things wrong with this that has me not subscribe to it, as being the ultimate solution for either man or mankind in remaining respectively consistent with nature; both our own and that of reality’s more generally. The first being that its depends on others recognizing and excepting this as being a better approach and the second is that it not being indicative of the nature of the individual, as only being a strategy, which is void of holding any true semblance with empathy; only rather able to have it difficult to detect if it be present and as such ignoring the necessity for transparency and with it having true understanding denied.

This for me in the end simply represents yet another attempt to have it believed metrics can be used to as all that’s required to instil good in the individual, and society more generally, when at best it only has one able to have programmed some level of control relative to either’s behaviour, while never having effect on the character of the individual (or humanity more generally) at all, that is in terms of genuine goodness, fostered by having true empathy, born of compassion as having realized each of us and everything being inseparable at the deepest of levels.

That fact is I’ve been for some time totally convinced, that as Robert M. Pirsig was first known to have proposed, that ultimately the universe, respective of reality, will ultimately be explained as to become understood on the basis of it being the physical manifestation of quality. The bottom line being is that although there is some utility to having quality measured in an attempt to have it quantified, it has no power at all in having it instilled as to be created, since this can only be achieved in the continual search by one’s self and others for understanding, rather than a preoccupation with the value of judgement.

Best,

Phil; October 2, 2010 at 8:35 AM
Phil Warnell said...: Hi Steven,

Only meant as a postscript to my previous comment which exceeded Blogger's capacity for quantity in sacrifice of quality:-)

“The difference between a good mechanic and a bad one, like the difference between a good mathematician and a bad one, is precisely this ability to select the good facts from the bad ones on the basis of quality. He has to care! This is an ability about which normal traditional scientific method has nothing to say. It's long past time to take a closer look at this qualitative preselection of facts which has seemed so scrupulously ignored by those who make so much of these facts after they are "observed." I think that it will be found that a formal acknowledgment of the role of Quality in the scientific process doesn't destroy the empirical vision at all. It expands it, strengthens it and brings it far closer to actual scientific practice.”

-Robert M. Pirsig- Zen and the Art of Motorcycle Maintenance - page 253

Best,

Phil; October 2, 2010 at 8:54 AM
Steven Colyer said...: Hey Phil, I'll make a deal with you. I'll read Pirsig's Motocycle book if you read Douglas Adams Hitchiker book. Deal?

Before I reply (tomorrow), I thought you might wish to be aware of some news 2 hrs old. Click here. What that basically says is a new think tank is starting up in Vancouver at U. British Columbia, funded by Germany's Max Planck Society. What, Canada hasn't enough money it has to ask the Germans? :-); October 4, 2010 at 6:17 PM
Phil Warnell said...: Hi Steven,

Okay you have a deal, however I can’t promise as to how quickly I will be able to make time for it. You may or may not be surprised to learn I’m a slow reader as my habit is to try to connect with the author by relinquishing my own thoughts as best as I’m able until my first reading is complete. The other thing is I have to get out and get a copy, which could take some time respective of me currently being overloaded with work.

Best,

Phil; October 6, 2010 at 7:13 AM
Steven Colyer said...: I guess the thing I liked about it was the time it came out, 1993, in the Sunday newspaper supplement: Parade magazine.

Sociology and Group Psychology has advanced far since that year, but Sagan's words spoke to me deeply as a Scientist talking about perhaps the most important thing in our lives, how we choose to live, being a field that one would not normally associate with an astronomer.

And he wrote about it beautifully.

I think Ann Druyan had a lot to do with it.

I pretty much thought at the time that I lived what Sagan calls the gold-plated brazen rule. I was 37 then, and felt good to be vindicated. This is one of my favorite pieces of prose, all-time.

What a great writer Carl was. We miss him so, but he is still to-date immortal. That boyish enthusiasm inspired many to enter Science. I hope the picture I chose reflects that.; October 6, 2010 at 4:12 PM
Phil Warnell said...: Hi Steven,

Don’t take me wrong, as in the end I think Sagan as having the greatest sense of morality, in realizing himself or any self as being just a small portion of the big picture from a universal perspective and thus being more concerned with humanity as a whole then any single part. So I would contend that Sagan understood the place quality stands as being in the scheme of things. He is certainly missed and to date I’ve found no one as of yet as being able to replace him.

“For we are the local embodiment of a Cosmos grown to self-awareness. We have begun to contemplate our origins; starstuff pondering the stars; organizing assemblages of ten billion billion billion atoms considering the evolution of atoms; tracing the long journey by which, here at least, consciousness arose. Our loyalties are to the species and the planet. We speak for earth. Our obligation to survive is owed not just to ourselves but also to that Cosmos, ancient and vast, from which we sprang.”

-Carl Sagan- Cosmos

Best,

Phil; October 7, 2010 at 6:55 AM
Steven Colyer said...: From which we sprang ... and to which we shall return. Ashes to ashes, dust to etc.

Well, you're right no one has replaced him Phil, though there are many who would if they could.

Sean Carroll seems to be making a run. Woit in his own way. Hawking comes closest I suppose, but he's been a bit squirrelly lately. Not sure what to make of that.

Before Carl there was Isaac Asimov, before Asimov there was George Gamow, and before Gamow either Eddington or "the press" that kept our grandparents' generation abreast of the latest debate between Bohr and Einstein in their time.

But Carl was the best in my lifetime. The very best popularizer of Science of all time IMO. It was the boyish enthusiasm, IMO. I don't think I'm wrong. Many are intelligent, but how few brought THAT much PASSION, indeed, LOVE, to their craft as much as Carl did?

Blatant HONESTY, man. It's not always bad. :-); October 8, 2010 at 10:57 AM
Phil Warnell said...: Hi Steven,

You said: ” From which we sprang ... and to which we shall return. Ashes to ashes, dust to etc.”

Yes and yet this thought to consider existence as nothing more, is what has many believe that one’s life to be measurable primarily quantitatively, rather than qualitatively. One of the best expressions of this as showing how many of us are so preoccupied was written by Shakespeare, when having Hamlet relate it as to be found here.

However my contention being it is only through the search for true understanding, that such concerns can be put aside to have realized that what we call consciousness, as most consider to be something isolated and unique to each of us, could be but just a small aperture through which our physical body enables us to view but a small portion of it all.

Thus in considering this as possible, diminishes the worries born of mortality, to have not only us better able to appreciate, as to enjoy our current place in the scheme of things, without need to worry, yet knowing what after remains of us will not be that decayed to be abandoned aperture, yet rather that which in life we were so privileged to be able look upon and consider; if only for an instant in the context of its entirety.

Best,

Phil; October 9, 2010 at 12:36 PM