Blog: Numeracy

Readers of this book score higher on the SAT!

Via the Freakonomics blog, Freakonomics is one of the top five books ranked by the average SAT scores of colleges where it is popular. Steven Levitt makes the obvious point that most likely reading Freakonomics and the other top books does not improve your score, but rather highly-scoring people read it. One commenter, Dan, suggested,
You could put that on the cover of the next printing—"Readers of this book score higher on the SAT!" It would be risky, though. People might buy the book, read it and learn that correlation does not imply causation, and then protest the authors.
It's actually perfect: a statement that would be most likely to persuade people who don't understand correlation vs. causation to buy a book that would teach them about correlation vs. causation.

posted on Jan 31, 2008 2:33 pm (comment)

Statistics and red-shirted crewmen

Everyone who's watched Star Trek (the original) knows that when a red-shirted crewman beams down to the planet with the main characters in an episode, he's going to be food for whatever monster lives there, or killed by hostile aliens.

But everything we know isn't always true when you analyze statistics properly. Matt Bailey of SiteLogic uses the red-shirt phenomenon as an example to illustrate good statistical analysis, discovering that almost as many red-shirts died on the Enterprise as on planets, and most of those deaths occurred in groups. Most of all, he discovered that when Captain Kirk becomes romantically involved with a local alien woman, the red shirt survival rate skyrockets.

The article even includes a funny parody of classic lame PowerPoint statistical presentation.

posted on Aug 5, 2007 10:16 am (comment)

Randomn3ss and L3vy

This week's episode of NUMB3RS opened with Professor Eppes giving a lecture on randomness. He showed two sets of dots, one with some clusters and some empty areas, the other with the dots fairly evenly spread out. Most of the class chose the second as the most random, wrongly. Aha, I thought, a neat coincidence that they are talking about the way humans see patterns where none exist, just a few days after I posted about this in connection with Steven Levy's iPod shuffle experience. But in this case, there was a pattern, as a few scenes later Eppes uses the very same example of the iPod torn right from the pages of the book.

Episodes take weeks or more to write and produce, so the writers likely based this one on the book, which came out six weeks ago. But it's surely just randomness that I saw the show days after hearing Levy speak. After all, if the two events hadn't coincided in time, I probably wouldn't have written a blog post about this. The fact that I did makes it noticed, and noticing coincidence makes us see the clusters all the more clearly even when they occur randomly.

posted on Dec 10, 2006 5:54 pm (comment)

The pattern is not in the iPod, but in ourselves

Steven Levy of Newsweek spoke at Google today about his new book, The Perfect Thing, about the iPod Shuffle. He related a story about how, when he got his Shuffle, he started to notice a propensity for playing Steely Dan songs. So he wrote a column about this, and was flooded with letters. Many people wrote that they also noticed a preference for Steely Dan in their iPods. Others observed predilections for other artists. One thought her iPod had "moods," like being bluesy on Mondays. One noticed a preference for Bob Dylan one day, and then the next, Bob Dylan songs by different people.

Those with a good understanding of statistics will immediately realize that this is an inevitable human reaction to randomness. As Levy came to realize, "the pattern is not in the iPod, but in ourselves." Human brains are wired to recognize patterns, and they even see them where there are none. This is why some people are sure that they have a "feeling" before a disaster, or believe in ESP, or luck (as Levy points out, the gambling industry is built on people's irrational response to randomness). Not having clusters where one artist's song appears four times in a row is much more unusual than having some such clusters.

This is a great example of innumeracy among people in general, and why teaching statistics is so important.

posted on Dec 5, 2006 12:50 pm (comment)

Correlation vs causation

I've said before that the biggest gap in high school education is in statistics and basic numeric literacy (aka numeracy). Most people are simply not well equipped to read about a scientific study or anything else with numbers, and evaluate it critically.

One extremely common fallacy is confusing correlation with causation. If an alien were to come to earth and notice that on days when people wear raincoats it rains more often than on other days, that alien might conclude that raincoats cause rain. Of course, in reality the causation runs the other way. The only way to prove a causation is through a controlled experiment, where a scientist creates two groups of subjects at random, alters one attribute about one group, and then measures whether an observable result appears different in the experimental group versus a control group. This is how drug trials work, or Google user interface experiments. In social research, it's often not possible to, for example, ask half your subjects to go through a divorce just to see what happens to children, so researchers are limited to analyzing data and finding correlations without the ability to prove causation.

Many studies that find a correlation between two things either do a sloppy job of making it clear that they've only proven correlation, or journalists unfamiliar with the distinction confuse the two in articles. Just today I saw these two articles that confused correlation with causation:

First, BoingBoing cited a BBC story of a study that tracked subjects for two weeks and how much sex they had. At the end of the time period, they were put into stressful situations like public speaking. Those who engaged in penetrative sex did better, while those engaged in no sex or other types showed no change. But astute BoingBoing reader added this comment:

Just because people who have sex are less stressed doesn't mean that the sex causes them be that way. At least as plausible is that they have more sex _because_ they are less stressed and recover from stress more quickly— stress and staying steamed being a turn-off, in general, not to mention physiological effects of stress on performance, for men.
Another possibility is that people who are already less stressed, or who are temperamentally less inclined to be stressed, tend to have more sex. I highly doubt that the researcher was able to take two groups at random and arrange for one to get more sex; some people naturally have more sex than others, and those same factors which govern how much sex the person is going to have (personality, charisma, confidence, experience, etc.) also may govern how stressed they become at speaking in public.

Second, University of Minnesota researchers claim that "having too many meetings and spending too much time in meetings per day may have negative effects". Again they collected data by having participants keep diaries, and again there doesn't appear to be much accounting for the preexisting biases that lead one participant to attend more meetings.

Did the data show that people who had more meetings overall were more stressed? The article mentions how managers spend more time in meetings. Maybe managers are more likely to be stressed than other people. Or did the diaries reveal that on days when people had more meetings, they were more stressed? Depending on the jobs they did (we don't know), it could be that people had a fairly consistent amount of work, but some days also had meetings layered on top, making those days more stressful.

Now, both these studies were conducted by professional social science researchers. And it's entirely possible that they carefully tried to account for all variables that would bias the results. But just from reading the articles, it's clear that either the article glosses over important details, or the researchers did.

posted on Jan 28, 2006 1:48 am (comment)

Math: power for all

If I could add one subject to the high school curriculum, it would be statistics. Yes, I know it's already the bane of many a social science major's existence in college. But developing a basic ability to critically evaluate numbers from news articles is, in my opinion, the most important tool a citizen should have. The media is rife with bogus claims that confuse correlation with causation. And without the understanding necessary to think critically about what they read, people have no way to separate truth from distortion, and then what value is truth?

Example of the day is this obviously misleading claim from a recent Annenberg survey (PDF) - obviously misleading, that is, to those who understand basic statistics. Annenberg claims:

About as many Americans consider Rush Limbaugh, the conservative radio talk show star, to be a journalist as say the same of Bob Woodward, the Washington Post's assistant managing editor who broke the Watergate story with Carl Bernstein, according to a national survey conducted for the University of Pennsylvania's Annenberg Public Policy Center.

Twenty-seven percent of adult Americans polled between March 7 and May 2, 2005, said Limbaugh was a journalist, 55 percent said he was not, and 18 percent said they did not know. For Woodward, 30 percent said he was a journalist, 17 percent said he was not and 53 percent did not know. The difference was within the poll's margin of sampling error.

Chris Bowers of MyDD points out the fallacy:
Limbaugh's name ID is at 82% in this poll, while Woodward's is at 47%, entirely accounting for the similarity this poll supposedly finds. By a two to one margin, people who know who Rush Limbaugh is do not think he is a journalist. Also, by nearly a two to one margin, people who know who Bob Woodward is think he is a journalist.
Maybe it is unreasonable and naive to want people to be able to question the assumptions behind claims we read. But shouldn't we at least try to help them do so? Or at the very least, we can hope that polling organizations and the people who write their press releases can learn how.

posted on Jun 15, 2005 1:08 am (comment)

Purple Cows

I never saw a purple cow
But if I were to see one
Would the probability ravens are black
have a greater chance to be one?

posted on Jan 11, 2005 5:07 am (comment)

All text and images on this site are licensed under a Creative Commons license.

Creative Commons License