Page 186 - Statistics II for Dummies
P. 186

170
                       Part III: Analyzing Variance with ANOVA
                       Checking the Fit of the ANOVA Model



                                  As with any other model, you must determine how well the ANOVA model
                                  fits before you can use its results with confidence. In the case of ANOVA,
                                  the model basically boils down to a treatment variable (also known as the
                                  population you’re in) plus an error term. To assess how well that model fits
                                                          2
                                                                2
                                  the data, see the values of R  and R  adjusted on the last line of the ANOVA
                                  output below the ANOVA table. For the seed-spitting data, you see those
                                  values at the bottom of Figure 9-4.
                                   ✓ The value of R  measures the percentage of the variability in the
                                                   2
                                      response variable (y) explained by the explanatory variable (x). In the
                                      case of ANOVA, the x variable is the factor due to treatment (where the
                                      treatment can represent a population being compared). A high value of
                                      R  (say, above 80 percent) means this model fits well.
                                       2
                                                                                        2
                                                   2
                                   ✓ The value of R  adjusted, the preferred measure, takes R  and adjusts
                                      it for the number of variables in the model. In the case of one-way
                                                                                                  2
                                      ANOVA, you have only one variable, the factor due to treatment, so R
                                           2
                                                                                           2
                                                                                     2
                                      and R  adjusted won’t be very far apart. For more on R  and R  adjusted,
                                      see Chapter 6.
                                                                               2
                                  For the watermelon seed-spitting data, the value of R  adjusted (as found
                                  in the last row of Figure 9-4) is only 53.97 percent. That means age group
                                  (shown to be statistically significant by the F-test; see the section “Making
                                  conclusions from ANOVA”) explains just over half of the variability in the
                                  watermelon seed-spitting distances. Because of that connection, you may
                                  find other variables you can examine in addition to age group, making an
                                  even better model for predicting how far those seeds will go.
                                  As you see in Figure 9-1, the results of the t-test done to compare the spitting
                                  distances of males and females in the section “Comparing Two Means with
                                  a t-Test” show that males and females were significantly different on mean
                                  seed-spitting distances (p-value = 0.039 < 0.05). So I would venture a guess
                                  that if you include gender as well as age group, thereby creating what statisti-
                                  cians call a two-factor ANOVA (or two-way ANOVA), the resulting model would
                                                                                      2
                                                                                2
                                  fit the data even better, resulting in higher values of R  and R  adjusted.
                                  (Chapter 11 walks you through the two-way ANOVA.)



















                                                                                                       7/23/09   9:31:31 PM
           15_466469-ch09.indd   170
           15_466469-ch09.indd   170                                                                   7/23/09   9:31:31 PM
   181   182   183   184   185   186   187   188   189   190   191