Statistical Tests Suggestive of Fraud in Iran's Election

A closer look at voter ballot data reveals suspicious anomalies

July 14, 2009 RSS Feed Print

By Julie Rehmeyer, Science News

An American statistician says strong statistical evidence backs up the claims of Iranian protestors that Mahmoud Ahmadinejad’s victory in the June election was fraudulent.

Walter Mebane of the University of Michigan in Ann Arbor analyzed Iranian election data and found anomalies strongly suggesting that ballot boxes were stuffed with extra votes for Ahmadinejad. Mebane also identified 81 towns where further investigations are likely to find evidence of fraud.

“This suggests that the actual outcome should have been pretty close,” says Mebane, who described his analysis on a paper posted on his website June 15 and updated June 29. The official results showed Ahmadinejad getting almost twice as many votes as his closest rival.

“His data is highly, highly, highly suggestive that something odd was going on,” says political scientist Henry Brady of the University of California, Berkeley. “Someone who really knows the geopolitical makeup of Iran might be able to take this analysis further. I hope the CIA has someone doing that.”

Mebane cautions that the anomalous statistics could imaginably have an innocent explanation, that limited data is available, and that he is not himself an expert on Iranian politics. Nevertheless, he concludes that “because the evidence is so strikingly suspicious, the credibility of the election is in question until it can be demonstrated that there are benign explanations for these patterns.”

After receiving vote counts from each polling station, Mebane examined them for internal consistency using a statistical curiosity known as Benford’s Law. In many kinds of data, the first digit of the numbers will be 1 about 30 percent of the time, rather than following the naïve expectation of one time out of 10. Exponential growth is one way of producing this pattern: If a bacterium population starts at 100 and doubles each day, then for the entire first day there will be 100-some bacteria, giving an initial digit of 1. By comparison, the first digit will be 7 for just a few hours as the population zooms from 400 to 800 on the third day. Benford’s Law also applies when many random processes combine to produce the data.

Mebane has studied election data from many countries, including the United States, Russia and Mexico. In 2006, he found that vote counts tend to follow Benford’s Law in the second digit. That finding was initially controversial but is now widely accepted.

When Mebane studied polling station-level data from Iran, he found that the numbers on the ballots for Ahmadinejad and two of the minor candidates didn’t conform to Benford’s Law well at all.

In any fair election, a certain percentage of votes are illegible or otherwise problematic and have to be discarded. When people commit fraud by adding extra votes, they often forget to add invalid ones. Suspiciously, Mebane found that in towns with few invalid votes, Ahmadinejad’s ballot numbers were further off from Benford’s Law — and furthermore, that Ahmadinejad got a greater percentage of the votes.

“The natural interpretation is that they had some ballot boxes and they added a whole bunch of votes for Ahmadinejad,” Mebane says.

Mebane also received data from the 2005 Iran election that aggregated the votes of entire towns. He compared it with the 2009 data to see how plausible the patterns were, using a method similar to the one he used to analyze the “butterfly ballots” in Florida in the 2000 U.S. presidential election. If Ahmadinejad fared poorly in a particular town in 2005, you wouldn’t expect him to do especially well there in 2009 either. Mebane used a statistical model for finding the most likely relationship between the two results. To do so, his method ignores “outliers,” data points that don’t fit well with that most likely relationship.

The best relationship the model found produced 81 outliers out of 320 towns in the analysis, a strikingly high percentage. Another 91 fit the model, but poorly. In the majority of these 172 towns, Ahmadinejad did better than the model would have predicted.

“This is not necessarily diagnostic of fraud,” Mebane says. “It could just be that the model is really terrible.” But since the first analysis gives evidence of fraud, the cities the model flags as problematic are the sensible ones to scrutinize.


Tags:
math,
science

Reader Comments Read all comments (7)

Add Your Thoughts
Your comment will be posted immediately, unless it is spam or contains profanity. For more information, please see our Comments FAQ.

I’m impressed, I have to say. Very seldom do I see a blog thats both educational and entertaining, and let me tell you, you’ve hit the nail on the head. Your blog is important; the matter is something that not a lot of people are talking intelligently about. I’m really happy that I stumbled across this in my search for something relating to it.

Abdullah Al Mamun of CA 5:51AM December 14, 2011

Which approach is appropriate in analyzing an election?

1. Graphs are to be plotted on the basis of unbiased data reflecting a democratic election.

2. Data are to be derived from the graphs plotted by a minor party to meet their requirements.

Dilemma of AL 7:34AM August 09, 2009

The most significant piece of evidence is the obvious one. The size, intensity and duration of protests following the election clearly indicate that many people in Iran don't believe the election was fair. Even if Ahmadinejad actually did win a majority of the legal ballots cast, the perception that he didn't is strong enough to undermine the legitimacy of his government (to the extent that it is "his" anyway, since the Supreme Council seems to exert most of the real power in any case.) Repression of the people protesting the result adds to the perception of illegitimacy.

What remains to be seen is whether Iran will see greater repression long-term, short-term repression followed by gradual reform, or revolution. This issue will not be decided by elections, let alone election monitoring.

Doug Samuelson of VA 1:43PM July 19, 2009

National Science Foundation

NSF

Bringing Evolutionary Science to the Community

Center promotes Darwin Day to inspire next generation of scientists.

Constructing Biological Machines

Research has implications for industry, medicine, energy, environment.

Laser Mapping Helps City Planners

LiDAR technology can be used to predict natural disasters.

advertisement

Science Discoveries

Science Discoveries

iTunes icon RSS icon

advertisement