Page 190 - Statistics II for Dummies
P. 190

174        Part III: Analyzing Variance with ANOVA



                      Following Up after ANOVA


                                The main reason folks use ANOVA to analyze data is to find out whether
                                there are any differences in a group of population means. Your null hypoth-
                                esis is that there are no differences, and the alternative hypothesis is that
                                there’s at least one difference somewhere between two of the means. (Note it
                                doesn’t say that all the means have to be different.)

                                If it’s established that at least two of the population means are different, the
                                next natural question is: “Okay, which ones are different?” Although this is a
                                very simple-sounding question, it doesn’t have a simple answer. The concept
                                of means being different can be interpreted in hundreds of ways. Is one larger
                                than all the others? Are three pairs of them different from each other and the
                                rest all the same? Statisticians have worked long and hard to come up with
                                a wide range of choices of procedures to explore and find differences of all
                                types in two or more population means. This family of procedures is called
                                multiple comparisons.

                                This section starts off with an example in which the ANOVA procedure was
                                used and Ho was rejected, leading you to the next step: multiple compari-
                                sons. You then get an overview of how and why multiple comparison proce-
                                dures work.



                                Comparing cellphone minutes: An example

                                Suppose you want to compare the average number of cellphone minutes
                                used per month for various age groups, where the age groups are defined as
                                the following:

                                  ✓ Group 1: 19 years old and under
                                  ✓ Group 2: 20–39 years old
                                  ✓ Group 3: Adult males 40–59 years old
                                  ✓ Group 4: Adult females 60 years old and over

                                You collect data on a random sample of ten people from each group (where
                                no one knows anyone else to keep independence), and you record the
                                number of minutes each person used their cellphone in one month. The first
                                ten lines of a hypothetical data set are shown in Table 10-1.
















          16_466469-ch10.indd   174                                                                   7/24/09   9:41:31 AM
   185   186   187   188   189   190   191   192   193   194   195