Page 225 - Intermediate Statistics for Dummies
P. 225

18_045206 ch12.qxd  2/1/07  10:20 AM  Page 204
                               204
                                         Part III: Comparing Many Means with ANOVA
                                                    In Figure 12-1, you can see the degrees of freedom for each sums of squares
                                                    listed under the DF column of the ANOVA part of the output. You see SSR
                                                    has 2 – 1 = 1 degree of freedom, SSE has 250 – 2 = 248 degrees of freedom
                                                    (because n = 250 observations were in the data set and k = 2 and you find
                                                    n – k to get degrees of freedom for SSE). The degrees of freedom for SSTO is
                                                    250 – 1 = 249.
                                                    Bringing regression to the ANOVA table
                                                    In ANOVA, you test your model Ho: All k population means are equal versus
                                                    Ha: At least two population means are different by using a F-test. You build
                                                    your F-test statistic by relating the sums of squares for treatment to the sum
                                                    of squares for error. To do this, you divide SSE and SST by their degrees of
                                                    freedom (n – k and k – 1, respectively, where n is the sample size and k is the
                                                    number of treatments) to get the mean sums of squares for error (MSE) and
                                                    mean sums of squares for treatment (MST). In general, you want MST to be
                                                    large compared to MSE, which would indicate that the model fits well. The
                                                    results of all these statistical gymnastics are summarized by Minitab in a
                                                    table called (cleverly) the ANOVA table.
                                                    The ANOVA table shown in the bottom part of Figure 12-1 for the Internet-use
                                                    data represents the ANOVA table you get from using the regression line as
                                                    your model. Under the Source column, you may be used to seeing treatment,
                                                    error, and total. For regression, the treatment is the regression line, so you
                                                    see regression instead of treatment. The error term in ANOVA is labeled resid-
                                                    ual error, because in regression, you measure error in terms of residuals.
                                                    Finally you see total, which is the same the world around.
                                                    The SS column represents the sums of squares for the regression model. The
                                                    three sums of squares listed in the SS column are SSR (for regression), SSE
                                                    (for residuals), and SST (total). These sums of squares are calculated using
                                                    the formulas from the previous section; the degrees of freedom, DF in the
                                                    table, are found by using the formulas from the previous section also.
                                                    The MS column takes the value of SS “whatever”(you fill in the blank) and
                                                    divides it by the respective degrees of freedom, just like ANOVA. For example
                                                    in Figure 12-1, SSE is 12,968.5, and the degrees of freedom is 248. Take the first
                                                    value divided by the second one to get 52.29 or 52.3, which is listed in the
                                                    ANOVA table for MSE.
                                                                                                      MST     , 9 085 .6
                                                    The value of the F-statistic, using the ANOVA method, is F =  =  =
                                                                                                       MSE    52 .3
                                                    173.7 in the Internet example, which you can see in column five of the ANOVA
                                                    part of Figure 12-1 (subject to rounding). The F-statistics’s p-value is calcu-
                                                    lated based on an F-distribution with 2 – 1 = 1 and 250 – 2 = 248 degrees of
   220   221   222   223   224   225   226   227   228   229   230