Page 201 - Statistics II for Dummies
P. 201

Chapter 10: Sorting Out the Means with Multiple Comparisons  185


                                This section provides an overview of other multiple comparison procedures
                                that exist and tells a little bit about each one, including the people who
                                developed them. Given the dates of when these procedures were developed,
                                I think you’ll agree with me that the 1950s was the golden age of the multiple
                                comparison procedure.


                                Controlling for baloney with
                                the Bonferroni adjustment


                                The Bonferroni adjustment (or Bonferroni correction) is a technique used in
                                a host of situations, not just for multiple comparisons. It was basically
                                created to stop people from over-analyzing data. There’s a limit to what you
                                should do when analyzing data; there’s a line that, when crossed, results in
                                something statisticians call data snooping. And the Bonferroni adjustment
                                curbs that.

                                Data snooping is when someone analyzes her data over and over again until
                                she gets a result that she can say is statistically significant (meaning the result
                                is said to have been unlikely to have happened by chance; see Chapter 3).
                                Because the number of tests completed by the data snooper is so high, she’s
                                likely to find something significant just by chance. And that result is highly
                                likely to be bogus.
                                For example, suppose a researcher wants to find out what variable is related
                                to sales of bedroom slippers. He collects data on everything he can think of,
                                including the size of people’s feet, the frequency with which they go out to
                                get the paper in their slippers, and their favorite colors. Not finding anything
                                significant, he goes on to examine marital status, age, and income.
                                Still coming up short, he goes out on a limb and looks at hair color, whether
                                or not the subjects have seen a circus, and where they like to sit on an air-
                                plane (aisle or window, sir?). Then wouldn’t you know, he strikes gold. Turns
                                out that, according to his data, people who sit on the aisles on planes are
                                more likely to buy bedroom slippers than those who sit by the window or in
                                the middle of a row.

                                What’s wrong with this picture? Too many tests. Each time the researcher
                                examines one variable and conducts a test on it, he chooses an α level at
                                which to conduct the test. (Recall that the α level is the amount of chance
                                you’re willing to take of rejecting the null hypothesis and making a false
                                alarm.) As the number of tests increases, the α’s pile up.














          16_466469-ch10.indd   185                                                                   7/24/09   9:41:43 AM
   196   197   198   199   200   201   202   203   204   205   206