Page 235 - Six Sigma Demystified
P. 235

Part 3  s i x   s i g m a  to o l s        215


                           Analyze Stage
                             •	 To compare results of sampling from different process conditions to de-
                                tect if they are independent

                           Methodology

                           The methodology for analyzing the R rows by C columns involves using the
                           chi-square statistic to compare the observed frequencies with the expected
                           frequencies, assuming independence of the subsets. The null hypothesis is that
                           the p values are equal for each column in each row. The alternative hypothesis
                           is that at least one of the p values is different.
                             Construct the R × C table by separating the subsets of the population into
                           the tested categories.
                             Calculate the expected values for each row-column intersection cell e . The
                                                                                            ij
                           expected value for each row/column is found by multiplying the percent of
                           that row by the percent of the column by the total number.
                             Calculate the test statistic:

                                                           r  c  (o  − e  ) 2
                                                       0 ∑
                                                      χ =   ∑   ij  ij
                                                       2
                                                           1 =
                                                          i = j  1  e ij
                             For example, consider a survey of 340 males and 160 females asking their
                           preference for one of three television shows. The R × C contingency table is
                           shown in Table T.2.



                                   TAble T.2  Example Data Comparing Preferences for Three Shows

                                   Sex          Show 1      Show 2     Show 3    Total
                                   Male       160         140          40        340
                                   Female      40          60          60        160
                                   Totals     200         200        100         500


                             The expected frequency for male and show 1 = e  = (340/500) × (200/500) ×
                                                                        11
                           500 = 136. Similarly, the expected values for the other row/column pairs (e ) are
                                                                                            RC
                           found as e  = 136; e  = 68; e  = 64; and e  = 64; e  = 32 (as shown in Table T.3).
                                   12
                                                               22
                                            13
                                                    21
                                                                       23
                             The test statistic is calculated as
                                                    2
                                                                                   2
                                      X  = (160	–	136) /136 + (140	–	136) /136 + (40	–	68) /68
                                        2
                                                                    2
                                        0
                                                                 2
                                               + (40	–	64) /64 + (60	–	64) /64 + (60	–	32) /32 = 49.6
                                                                              2
                                                   2
   230   231   232   233   234   235   236   237   238   239   240