Page 164 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 164

144      4 Parametric Tests of Hypotheses


              Let us now express these sums of squares, related by 4.24, in terms of variances:

              SST =  n ( −  v ) 1 .                                       4.25a
                          c             c    
              SSW ≡ SSE = ∑  n (  i  −  v ) 1  i  =  ∑  n (  i  −  ) 1    v W  =  n ( −  c) v .  4.25b
                                                          W
                          i 1=         i 1=  
              SSB =  c ( −  v ) 1  B  .                                   4.25c

              Note that:

              1.  The within-group variance, v W, is the pooled variance and corresponds to
                 the generalization of formula 4.9:
                           c
                          ∑ ( n −1  v )  i
                              i
                 v W  ≡ v =  i=1     .                                     4.26
                       p
                             n − c
                 This variance represents the stochastic behaviour of the cases around their
                                                       2
                 group means. It is the point estimate of  σ ,  the  true  variance  of  the
                 population, and has  n – c degrees of freedom.
              2.  The within-group variance v W represents a mean square error, MSE, of the
                 observations:
                            SSE
                 MSE  ≡  v W  =  n − c  .                                  4.27


              3.  The between-group variance, v B, represents the stochastic behaviour of the
                                                                      2
                 group means around the global mean. It is the point estimate of σ when the
                 null hypothesis is true, and has c – 1 degrees of freedom.
                 When the number of cases per group is constant and equal to n, we get:
                        c
                       ∑ ( x −  x)  2
                           i
                 v =  n  i=1     =  nv ,                                   4.28
                  B
                          c −1       X
                 which is the sample expression of formula 3.8, allowing us to estimate the
                 population variance, using the variance of the means.
              4.  The between-group  variance,  v B, can be  interpreted as  a  mean between-
                 group or classification sum of squares, MSB:
                            SSB
                 MSB ≡ v B  =   .                                          4.29
                            c − 1
              With the help of formula 4.24, we see that the total sample variance, v, can be
           broken down into two parts:

               n ( −  v ) 1 =  n ( −  c) v W  +  c ( −  v ) 1  B  ,        4.30
   159   160   161   162   163   164   165   166   167   168   169