Scientists are accustomed to seeing data fall into perfect straight lines when they come from hard sciences like physics, such as the volume of a gas plotted against its temperature. But not in their wildest dreams do they expect the messy data from history to be so well behaved. The data we are looking at come from a ragbag of deadly quarrels ranging from the greatest cataclysm in the history of humanity to a coup d'état in a banana republic, and from the dawn of the Industrial Revolution to the dawn of the computer age. The jaw drops when seeing this mélange of data fall onto a perfect diagonal.
查看中文翻译
Richardson made a second major discovery about the statistics of deadly quarrels. It emerged when he counted the number of quarrels of each magnitude -- how many with death tolls in the thousands, how many in the tens of thousands, how many in the hundreds of thousands, and so on. It isn't a complete surprise that there were lots of little wars and only a few big ones. What was a surprise was how neat the relationship turned out to be. When Richardson plotted the log of the number of quarrels of each magnitude against the log of the number of deaths per quarrel (that is, the magnitude itself), he ended up with a graph like figure 5-7.
查看中文翻译
Piles of data in which the log of the frequency of a certain kind of entity is proportional to the log of the size of that entity, so that a plot on log-log paper looks like a straight line, are called power-law distributions. The name comes from the fact that when you put away the logarithms and go back to the original numbers, the probability of an entity showing up in the data is proportional to the size of that entity raised to some power (which translates visually to the slope of the line in the log-log plot), plus a constant. In this case the power is-1.5, which means that with every tenfold jump in the death toll of a war, you can expect to find about a third as many of them. Richardson plotted murders (quarrels of magnitude 0) on the same graph as wars, noting that qualitatively they follow the overall pattern: they are much, much less damaging than the smallest wars and much, much more frequent. But as you can see from their lonely perch atop the vertical axis, high above the point where an extrapolation of the line for the wars would hit it, he was pushing his luck when he said that all deadly quarrels fell along a single continuum. Richardson gamely connected the murder point to the war line with a swoopy curve so that he could interpolate the numbers of quarrels with death tolls in the single digits, the tens, and the hundreds, which are missing from the historical record. (These are the skirmishes beneath the military horizon that fall in the crack between criminology and history.) But for now let's ignore the murders and skirmishes and concentrate on the wars.
查看中文翻译
FIGURE 5-7: Number of deadly quarrels of different magnitudes, 1820-1952
查看中文翻译
Source: Graph adapted from Weiss, 1963, p. 103, based on data from Richardson, 1960, p. 149. The range 1820-1952 refers to the year a war ended.
查看中文翻译
Scientists are intrigued by power-law distributions for two reasons. One is that the distribution keeps turning up in measurements of things that you would think have nothing in common. One of the first power-law distributions was discovered in the 1930s by the linguist G. K. Zipf when he plotted the frequencies of words in the English language. If you count up the instances of each of the words in a large corpus of text, you'll find around a dozen that occur extremely frequently, that is, in more than 1 percent of all word tokens, including the (7 percent), be (4 percent), of (4 percent), and (3 percent), and a (2 percent). Around three thousand occur in the medium-frequency range centered on 1 in 10,000, such as confidence, junior, and afraid. Tens of thousands occur once every million words, including embitter, memorialize, and titular. And hundreds of thousands have frequencies far less than one in a million, like kankedort, apotropaic, and deliquesce.
查看中文翻译
Could Richardson just have been lucky with his sample? Fifty years later the political scientist Lars-Erik Cederman plotted a newer set of numbers in a major dataset of battle deaths from the Correlates of War Project, comprising ninety-seven interstate wars between 1820 and 1997 (figure 5-8). They too fall along a straight line in log-log coordinates. (Cederman plotted the data in a slightly different way, but that doesn't matter for our purposes.)
查看中文翻译
FIGURE 5-8: Probabilities of wars of different magnitudes, 1820-1997
查看中文翻译
The second remarkable thing about power-law distributions is that they look the same over a vast range of values. To understand why this is so striking, let's compare power-law distributions to a more familiar distribution called the normal, Gaussian, or bell curve. With measurements like the heights of men or the speeds of cars on a freeway, most of the numbers pile up around an average, and they tail off in both directions, falling into a curve that looks like a bell. Figure 5-9 shows one for the heights of American males. There are lots of men around 5'10" tall, fewer who are 5'6" or 6'2", not that many who are 5'0" or 6'8", and no one who is shorter than 1'11" or taller than 8'11" (the two extremes in The Guinness Book of World Records). The ratio of the tallest man in the world to the shortest man in the world is 4.8, and you can bet that you will never meet a man who is 20 feet tall.
查看中文翻译
Source: Graph from Cederman, 2003, p. 136.
查看中文翻译
Another example of a power-law distribution was discovered in 1906 by the economist Vilfredo Pareto when he looked at the distribution of incomes in Italy: a handful of people were filthy rich, while a much larger number were dirt-poor. Since these discoveries, power-law distributions have also turned up, among other places, in the populations of cities, the commonness of names, the popularity of Web sites, the number of citations of scientific papers, the sales figures of books and musical recordings, the number of species in biological taxa, and the sizes of moon craters.
查看中文翻译
Source: Graph from Newman, 2005, p. 324.
查看中文翻译
But with other kinds of entities, the measurements don't heap up around a typical value, don't fall off symmetrically in both directions, and don't fit within a cozy range. The sizes of towns and cities is a good example. It's hard to answer the question "How big is a typical American municipality?" New York has 8 million people; the smallest municipality that counts as a "town," according to Guinness, is Duffield, Virginia, with only 52. The ratio of the largest municipality to the smallest is 150,000, which is very different from the less-than-fivefold variation in the heights of men.
查看中文翻译
FIGURE 5-9: Heights of males (a normal or bell-curve distribution)
查看中文翻译
Also, the distribution of sizes of municipalities isn't curved like a bell. As the black line in figure 5-10 shows, it is L-shaped, with a tall spine on the left and a long tail on the right. In this graph, city populations are laid out along a conventional linear scale on the black horizontal axis: cities of 100,000, cities of 200,000, and so on. So are the proportions of cities of each population size on the black vertical axis: three-thousandths (3/1000, or 0.003) of a percent of American municipalities have a population of exactly 20,000, two thousandths of a percent have a population of 30,000, one thousandth of a percent have a population of 40,000, and so on, with smaller and smaller proportions having larger and larger populations. Now the gray axes at the top and the right of the graph stretch out these same numbers on a logarithmic scale, in which orders of magnitude (the number of zeroes) are evenly spaced, rather than the values themselves. The tick marks for population sizes are at ten thousand, a hundred thousand, a million, and so on. Likewise the proportions of cities at each population size are arranged along equal order-of-magnitude tick marks: one onehundredth (1/100, or 0.01) of a percent, one one-thousandth (1/1,000, or 0.001) of a percent, one ten-thousandth, and so on. When the axes are stretched out like this, something interesting happens to the distribution: the L straightens out into a nice line. And that is the signature of a power-law distribution.
查看中文翻译
Source: Graph adapted from Newman, 2005, p. 324.
查看中文翻译
FIGURE 5-10: Populations of cities (a power-law distribution), plotted on linear and log scales
查看中文翻译
Which brings us back to wars. Since wars fall into a power-law distribution, some of the mathematical properties of these distributions may help us understand the nature of wars and the mechanisms that give rise to them. For starters, power-law distributions with the exponent we see for wars do not even have a finite mean. There is no such thing as a "typical war." We should not expect, even on average, that a war will proceed until the casualties pile up to an expected level and then will naturally wind itself down.
查看中文翻译
Also, power-law distributions are scale-free. As you slide up or down the line in the log-log graph, it always looks the same, namely, like a line. The mathematical implication is that as you magnify or shrink the units you are looking at, the distribution looks the same. Suppose that computer files of 2 kilobytes are a quarter as common as files of 1 kilobyte. Then if we stand back and look at files in higher ranges, we find the same thing: files of 2 megabytes are a quarter as common as files of 1 megabyte, and files of 2 terabytes are a quarter as common as files of 1 terabyte. In the case of wars, you can think of it this way. What are the odds of going from a small war, say, with 1,000 deaths, to a medium-size war, with 10,000 deaths? It's the same as the odds of going from a medium-size war of 10,000 deaths to a large war of 100,000 deaths, or from a large war of 100,000 deaths to a historically large war of 1 million deaths, or from a historic war to a world war.
查看中文翻译
Finally, power-law distributions have "thick tails," meaning that they have a nonnegligible number of extreme values. You will never meet a 20-foot man, or see a car driving down the freeway at 500 miles per hour. But you could conceivably come across a city of 14 million, or a book that was on the bestseller list for 10 years, or a moon crater big enough to see from the earth with the naked eye -- or a war that killed 55 million people.
查看中文翻译
The thick tail of a power-law distribution, which declines gradually rather than precipitously as you rocket up the magnitude scale, means that extreme values are extremely unlikely but not astronomically unlikely. It's an important difference. The chances of meeting a 20-foot-tall man are astronomically unlikely; you can bet your life it will never happen. But the chances that a city will grow to 20 million, or that a book will stay on the bestseller list for 20 years, is merely extremely unlikely -- it probably won't happen, but you could well imagine it happening. I hardly need to point out the implications for war. It is extremely unlikely that the world will see a war that will kill 100 million people, and less likely still that it will have one that will kill a billion. But in an age of nuclear weapons, our terrified imaginations and the mathematics of power-law distributions agree: it is not astronomically unlikely.
查看中文翻译
So far I've been discussing the causes of war as Platonic abstractions, as if armies were sent into war by equations. What we really need to understand is why wars distribute themselves as power laws; that is, what combination of psychology and politics and technology could generate this pattern. At present we can't be sure of the answer. Too many kinds of mechanisms can give rise to power-law distributions, and the data on wars are not precise enough to tell us which is at work.
查看中文翻译
Still, the scale-free nature of the distribution of deadly quarrels gives us an insight about the drivers of war. Intuitively, it suggests that size doesn't matter. The same psychological or game-theoretic dynamics that govern whether quarreling coalitions will threaten, back down, bluff, engage, escalate, fight on, or surrender apply whether the coalitions are street gangs, militias, or armies of great powers. Presumably this is because humans are social animals who aggregate into coalitions, which amalgamate into larger coalitions, and so on. Yet at any scale these coalitions may be sent into battle by a single clique or individual, be it a gang leader, capo, warlord, king, or emperor.
查看中文翻译
How can the intuition that size doesn't matter be implemented in models of armed conflict that actually generate power-law distributions? The simplest is to assume that the coalitions themselves are power-law-distributed in size, that they fight each other in proportion to their numbers, and that they suffer losses in proportion to their sizes. We know that some human aggregations, namely municipalities, are power-law-distributed, and we know the reason. One of the commonest generators of a power-law distribution is preferential attachment: the bigger something is, the more new members it attracts. Preferential attachment is also known as accumulated advantage, the-rich-get-richer, and the Matthew Effect, after the passage in Matthew 25:29 that Billie Holiday summarized as "Them that's got shall get, them that's not shall lose." Web sites that are popular attract more visitors, making them even more popular; bestselling books are put on bestseller lists, which lure more people into buying them; and cities with lots of people offer more professional and cultural opportunities so more people flock to them. (How are you going to keep them down on the farm after they've seen Paree?)
查看中文翻译
Richardson considered this simple explanation but found that the numbers didn't add up. If deadly quarrels reflected city sizes, then for every tenfold reduction in the size of a quarrel, there should be ten times as many of them, but in fact there are fewer than four times as many. Also, in recent centuries wars have been fought by states, not cities, and states follow a log-normal distribution (a warped bell curve) rather than a power law.
查看中文翻译
Another kind of mechanism has been suggested by the science of complex systems, which looks for laws that govern structures that are organized into similar patterns despite being made of different stuff. Many complexity theorists are intrigued by systems that display a pattern called self-organized criticality. You can think of "criticality" as the straw that broke the camel's back: a small input causes a sudden large output. "Self-organized" criticality would be a camel whose back healed right back to the exact strength at which straws of various sizes could break it again. A good example is a trickle of sand falling onto a sandpile, which periodically causes landslides of different sizes; the landslides are distributed according to a power law. An avalanche of sand stops at a point where the slope is just shallow enough to be stable, but the new sand trickling onto it steepens the slope and sets off a new avalanche. Earthquakes and forest fires are other examples. A fire burns a forest, which allows trees to grow back at random, forming clusters that can grow into each other and fuel another fire. Several political scientists have developed computer simulations that model wars on an analogy to forest fires. In these models, countries conquer their neighbors and create larger countries in the same way that patches of trees grow into each other and create larger patches. Just as a cigarette tossed in a forest can set off either a brushfire or a conflagration, a destabilizing event in the simulation of states can set off either a skirmish or a world war.
查看中文翻译
In these simulations, the destructiveness of a war depends mainly on the territorial size of the combatants and their alliances. But in the real world, variations in destructiveness also depend on the resolve of the two parties to keep a war going, with each hoping that the other will collapse first. Some of the bloodiest conflicts in modern history, such as the American Civil War, World War I, the Vietnam War, and the Iran-Iraq War, were wars of attrition, where both sides kept shoveling men and matériel into the maw of the war machine hoping that the other side would exhaust itself first.
查看中文翻译
John Maynard Smith, the biologist who first applied game theory to evolution, modeled this kind of standoff as a War of Attrition game. Each of two contestants competes for a valuable resource by trying to outlast the other, steadily accumulating costs as he waits. In the original scenario, they might be heavily armored animals competing for a territory who stare at each other until one of them leaves; the costs are the time and energy the animals waste in the standoff, which they could otherwise use in catching food or pursuing mates. A game of attrition is mathematically equivalent to an auction in which the highest bidder wins the prize and both sides have to pay the loser's low bid. And of course it can be analogized to a war in which the expenditure is reckoned in the lives of soldiers.
查看中文翻译
The War of Attrition is one of those paradoxical scenarios in game theory (like the Prisoner's Dilemma, the Tragedy of the Commons, and the Dollar Auction) in which a set of rational actors pursuing their interests end up worse off than if they had put their heads together and come to a collective and binding agreement. One might think that in an attrition game each side should do what bidders on eBay are advised to do: decide how much the contested resource is worth and bid only up to that limit. The problem is that this strategy can be gamed by another bidder. All he has to do is bid one more dollar (or wait just a bit longer, or commit another surge of soldiers), and he wins. He gets the prize for close to the amount you think it is worth, while you have to forfeit that amount too, without getting anything in return. You would be crazy to let that happen, so you are tempted to use the strategy "Always outbid him by a dollar," which he is tempted to adopt as well. You can see where this leads. Thanks to the perverse logic of an attrition game, in which the loser pays too, the bidders may keep bidding after the point at which the expenditure exceeds the value of the prize. They can no longer win, but each side hopes not to lose as much. The technical term for this outcome in game theory is "a ruinous situation." It is also called a "Pyrrhic victory"; the military analogy is profound.
查看中文翻译
One strategy that can evolve in a War of Attrition game (where the expenditure, recall, is in time) is for each player to wait a random amount of time, with an average wait time that is equivalent in value to what the resource is worth to them. In the long run, each player gets good value for his expenditure, but because the waiting times are random, neither is able to predict the surrender time of the other and reliably outlast him. In other words, they follow the rule: At every instant throw a pair of dice, and if they come up (say) 4, concede; if not, throw them again. This is, of course, like a Poisson process, and by now you know that it leads to an exponential distribution of wait times (since a longer and longer wait depends on a less and less probable run of tosses). Since the contest ends when the first side throws in the towel, the contest durations will also be exponentially distributed. Returning to our model where the expenditures are in soldiers rather than seconds, if real wars of attrition were like the "War of Attrition" modeled in game theory, and if all else were equal, then wars of attrition would fall into an exponential distribution of magnitudes.
查看中文翻译
Of course, real wars fall into a power-law distribution, which has a thicker tail than an exponential (in this case, a greater number of severe wars). But an exponential can be transformed into a power law if the values are modulated by a second exponential process pushing in the opposite direction. And attrition games have a twist that might do just that. If one side in an attrition game were to leak its intention to concede in the next instant by, say, twitching or blanching or showing some other sign of nervousness, its opponent could capitalize on the "tell" by waiting just a bit longer, and it would win the prize every time. As Richard Dawkins has put it, in a species that often takes part in wars of attrition, one expects the evolution of a poker face.
查看中文翻译
Now, one also might have guessed that organisms would capitalize on the opposite kind of signal, a sign of continuing resolve rather than impending surrender. If a contestant could adopt some defiant posture that means "I'll stand my ground; I won't back down," that would make it rational for his opposite number to give up and cut its losses rather than escalate to mutual ruin. But there's a reason we call it "posturing." Any coward can cross his arms and glower, but the other side can simply call his bluff. Only if a signal is costly -- if the defiant party holds his hand over a candle, or cuts his arm with a knife -- can he show that he means business. (Of course, paying a self-imposed cost would be worthwhile only if the prize is especially valuable to him, or if he had reason to believe that he could prevail over his opponent if the contest escalated.)
查看中文翻译
In the case of a war of attrition, one can imagine a leader who has a changing willingness to suffer a cost over time, increasing as the conflict proceeds and his resolve toughens. His motto would be: "We fight on so that our boys shall not have died in vain." This mindset, known as loss aversion, the sunk-cost fallacy, and throwing good money after bad, is patently irrational, but it is surprisingly pervasive in human decision-making. People stay in an abusive marriage because of the years they have already put into it, or sit through a bad movie because they have already paid for the ticket, or try to reverse a gambling loss by doubling their next bet, or pour money into a boondoggle because they've already poured so much money into it. Though psychologists don't fully understand why people are suckers for sunk costs, a common explanation is that it signals a public commitment. The person is announcing: "When I make a decision, I'm not so weak, stupid, or indecisive that I can be easily talked out of it." In a contest of resolve like an attrition game, loss aversion could serve as a costly and hence credible signal that the contestant is not about to concede, preempting his opponent's strategy of outlasting him just one more round.
查看中文翻译
I already mentioned some evidence from Richardson's dataset which suggests that combatants do fight longer when a war is more lethal: small wars show a higher probability of coming to an end with each succeeding year than do large wars. The magnitude numbers in the Correlates of War Dataset also show signs of escalating commitment: wars that are longer in duration are not just costlier in fatalities; they are costlier than one would expect from their durations alone. If we pop back from the statistics of war to the conduct of actual wars, we can see the mechanism at work. Many of the bloodiest wars in history owe their destructiveness to leaders on one or both sides pursuing a blatantly irrational loss-aversion strategy. Hitler fought the last months of World War II with a maniacal fury well past the point when defeat was all but certain, as did Japan. Lyndon Johnson's repeated escalations of the Vietnam War inspired a protest song that has served as a summary of people's understanding of that destructive war: "We were waist-deep in the Big Muddy; The big fool said to push on."
查看中文翻译
The systems biologist Jean-Baptiste Michel has pointed out to me how escalating commitments in a war of attrition could produce a power-law distribution. All we need to assume is that leaders keep escalating as a constant proportion of their past commitment -- the size of each surge is, say, 10 percent of the number of soldiers that have fought so far. A constant proportional increase would be consistent with the well-known discovery in psychology called Weber's Law: for an increase in intensity to be noticeable, it must be a constant proportion of the existing intensity. (If a room is illuminated by ten lightbulbs, you'll notice a brightening when an eleventh is switched on, but if it is illuminated by a hundred lightbulbs, you won't notice the hundred and first; someone would have to switch on another ten bulbs before you noticed the brightening.) Richardson observed that people perceive lost lives in the same way: "Contrast for example the many days of newspaper-sympathy over the loss of the British submarine Thetis in time of peace with the terse announcement of similar losses during the war. This contrast may be regarded as an example of the Weber-Fechner doctrine that an increment is judged relative to the previous amount." The psychologist Paul Slovic has recently reviewed several experiments that support this observation. The quotation falsely attributed to Stalin, "One death is a tragedy; a million deaths is a statistic," gets the numbers wrong but captures a real fact about human psychology.
查看中文翻译
If escalations are proportional to past commitments (and a constant proportion of soldiers sent to the battlefield are killed in battle), then losses will increase exponentially as a war drags on, like compound interest. And if wars are attrition games, their durations will also be distributed exponentially. Recall the mathematical law that a variable will fall into a power-law distribution if it is an exponential function of a second variable that is distributed exponentially. My own guess is that the combination of escalation and attrition is the best explanation for the power-law distribution of war magnitudes.
查看中文翻译
Though we may not know exactly why wars fall into a power-law distribution, the nature of that distribution -- scale-free, thick-tailed -- suggests that it involves a set of underlying processes in which size doesn't matter. Armed coalitions can always get a bit larger, wars can always last a bit longer, and losses can always get a bit heavier, with the same likelihood regardless of how large, long, or heavy they were to start with.
查看中文翻译
The next obvious question about the statistics of deadly quarrels is: What destroys more lives, the large number of small wars or the few big ones? A power-law distribution itself doesn't give the answer. One can imagine a dataset in which the aggregate damage from the wars of each size adds up to the same number of deaths: one war with ten million deaths, ten wars with a million deaths, a hundred wars with a hundred thousand deaths, all the way down to ten million murders with one death apiece. But in fact, distributions with exponents greater than one (which is what we get for wars) will have the numbers skewed toward the tail. A power-law distribution with an exponent in this range is sometimes said to follow the 80:20 rule, also known as the Pareto Principle, in which, say, the richest 20 percent of the population controls 80 percent of the wealth. The ratio may not be 80:20 exactly, but many power-law distributions have this kind of lopsidedness. For example, the 20 percent most popular Web sites get around two-thirds of the hits.
查看中文翻译
Richardson added up the total number of deaths from all the deadly quarrels in each magnitude range. The computer scientist Brian Hayes has plotted them in the histogram in figure 5-11. The gray bars, which tally the deaths from the elusive small quarrels (between 3 and 3,162 deaths), don't represent actual data, because they fall in the criminology-history crack and were not available in the sources Richardson consulted. Instead, they show hypothetical numbers that Richardson interpolated with a smooth curve between the murders and the smaller wars. With or without them, the shape of the graph is striking: it has peaks at each end and a sag in the middle. That tells us that the most damaging kinds of lethal violence (at least from 1820 to 1952) were murders and world wars; all the other kinds of quarrels killed far fewer people. That has remained true in the sixty years since. In the United States, 37,000 military personnel died in the Korean War, and 58,000 died in Vietnam; no other war came close. Yet an average of 17,000 people are murdered in the country every year, adding up to almost a million deaths since 1950. Likewise, in the world as a whole, homicides outnumber war-related deaths, even if one includes the indirect deaths from hunger and disease.
查看中文翻译
Source: Graph from Hayes, 2002, based on data in Richardson, 1960.
查看中文翻译
FIGURE 5-11: Total deaths from quarrels of different magnitudes
查看中文翻译
Richardson also estimated the proportion of deaths that were caused by deadly quarrels of all magnitudes combined, from murders to world wars. The answer was 1.6 percent. He notes: "This is less than one might have guessed from the large amount of attention which quarrels attract. Those who enjoy wars can excuse their taste by saying that wars after all are much less deadly than disease." Again, this continues to be true by a large margin.
查看中文翻译
That the two world wars killed 77 percent of the people who died in all the wars that took place in a 130-year period is an extraordinary discovery. Wars don't even follow the 80:20 rule that we are accustomed to seeing in power-law distributions. They follow an 80:2 rule: almost 80 percent of the deaths were caused by 2 percent of the wars. The lopsided ratio tells us that the global effort to prevent deaths in war should give the highest priority to preventing the largest wars.
查看中文翻译
The ratio also underscores the difficulty of reconciling our desire for a coherent historical narrative with the statistics of deadly quarrels. In making sense of the 20th century, our desire for a good story arc is amplified by two statistical illusions. One is the tendency to see meaningful clusters in randomly spaced events. Another is the bell-curve mindset that makes extreme values seem astronomically unlikely, so when we come across an extreme event, we reason there must have been extraordinary design behind it. That mindset makes it difficult to accept that the worst two events in recent history, though unlikely, were not astronomically unlikely. Even if the odds had been increased by the tensions of the times, the wars did not have to start. And once they did, they had a constant chance of escalating to greater deadliness, no matter how deadly they already were. The two world wars were, in a sense, horrifically unlucky samples from a statistical distribution that stretches across a vast range of destruction.
查看中文翻译