Page 216 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 216

5.2 Contingency Tables   197


           first categorise the SCORE variable into four categories. These can be classified
           as: “Poor” corresponding to a final  examination score  below 10; “Fair”
           corresponding to a score  between  10 and  13; “Good” corresponding to a score
           between 14 and 16; “Very Good” corresponding to a score above 16. Let us call
           PERF (performance) this new categorised variable.
              The 3×4 contingency table, using variables PROG and PERF, is shown in Table
           5.13.  Only two (16.7%) cells have expected counts  below  5; therefore, the
           recommended conditions, mentioned in the previous section, for using the
           asymptotic distribution of T, are met.
              The value  of  T is 43.044.  The asymptotic chi-square distribution  of  T  has
           (3 – 1)(4 – 1) = 6 degrees of freedom. At a 5% level, the critical region is above
           12.59 and therefore the null hypothesis is rejected at that level. As a matter of fact,
           the observed significance of T is p ≈ 0.

           Table 5.13. The 3×4 contingency table obtained with SPSS for the independence
           test of Example 5.12.
                                                      PERF              Total
                                                                 Very
                                           Poor    Fair   Good
                                                                 Good
           PROG 0       Count               76     78      16      7     177
                        Expected Count     63.4    73.8   21.6   18.3   177.0
                   1    Count               19     29      10     13     71
                        Expected Count     25.4    29.6    8.6    7.3    71.0
                   2    Count                2      6      7       8     23
                        Expected Count      8.2    9.6     2.8    2.4    23.0
           Total        Count               97     113     33     28     271
                        Expected Count     97.0   113.0   33.0   28.0   271.0



              The chi-square test of independence can also be applied to assess whether two
           or more groups of data are independent or can be considered as sampled from the
           same population. For instance, the results obtained for Example 5.7 can also be
           interpreted as supporting, at a 5% level, that the male and female groups are not
           independent for variable  Q7; they can  be considered  samples from the same
           population.


           5.2.4 Measures of Association Revisited

           When analysing contingency tables, it is also convenient to assess the degree of
           association  between the variables, using  the ordinal and  nominal association
           measures described in sections  2.3.5 and  2.3.6, respectively. As in 4.4.1, the
   211   212   213   214   215   216   217   218   219   220   221