Page 256 - Statistics for Dummies
P. 256

240
                                         Part IV: Guesstimating and Hypothesizing with Confidence
                                                    However, that doesn’t necessarily mean a real difference isn’t present in the
                                                    population of all students. But the researcher can’t say the computer game is
                                                    a better reading method based on this sample of 10 students. (See Chapter 14
                                                    for information on the power of a hypothesis test and its relationship to
                                                    sample size.)
                                                    In many paired experiments, the data sets are small due to costs and time
                                                    associated with doing these kinds of studies. That means the t-distribution
                                                    (see the t-table in the appendix) is often used instead of the standard normal
                                                    (Z-) distribution (the Z-table in the appendix) when figuring out the p-value.
                                         Comparing Two Population Proportions
                                                    This test is used when the variable is categorical (for example, smoker/
                                                    nonsmoker, Democrat/Republican, support/oppose an opinion, and so
                                                    on) and you’re interested in the proportion of individuals with a certain
                                                    characteristic — for example, the proportion of smokers. In this case, two
                                                    populations or groups are being compared (such as the proportion of female
                                                    smokers versus male smokers).
                                                    In order to conduct this test, two independent (separate) random samples
                                                    need to be selected, one from each population. The null hypothesis is that the
                                                    two population proportions are the same; in other words, that their difference
                                                    is equal to 0. The notation for the null hypothesis is H : p  = p , where p  is the
                                                                                                 o  1  2       1
                                                    proportion from the first population, and p  is the proportion from the second
                                                                                        2
                                                    population.
                                                   Stating in H  that the two proportions are equal is the same as saying their dif-
                                                             o
                                                    ference is zero. If you start with the equation p  = p  and subtract p  from each
                                                                                           1   2             2
                                                    side, you get p  – p  = 0. So you can write the null hypothesis either way.
                                                                1   2
                                                    The formula for the test statistic comparing two proportions (under certain
                                                    conditions) is
                                                    where   is the proportion in the first sample with the characteristic of interest,
                                                      is the proportion in the second sample with the characteristic of interest,
                                                    is the proportion in the combined sample (all the individuals in the first and
                                                    second samples together) with the characteristic of interest, and z is a value
                                                    on the Z-distribution (see Chapter 9). To calculate the test statistic, do the
                                                    following:









                                                                                                                           3/25/11   8:14 PM
                             22_9780470911082-ch15.indd   240                                                              3/25/11   8:14 PM
                             22_9780470911082-ch15.indd   240
   251   252   253   254   255   256   257   258   259   260   261