Page 251 - Statistics II for Dummies
P. 251

Chapter 13: Forming Associations with Two-Way Tables   235


                                Checking for independence

                                between two variables

                                The previous section focuses on checking whether two specific categories are
                                independent in a sample. If you want to extend this idea to showing that two
                                entire categorical variables are independent, you must check the indepen-
                                dence conditions for every combination of categories in those variables. All of
                                them must work, or independence is lost. The first case where dependence is
                                found between two categories means that the two variables are dependent. If
                                you find that the first case shows independence, you must continue checking
                                all the combinations before declaring independence.

                                Suppose a doctor’s office wants to know whether calling patients to confirm
                                their appointments is related to whether they actually show up. The vari-
                                ables are x = called the patient (called or didn’t call) and y = patient showed
                                up for his appointment (showed or didn’t show). Here are the four conditions
                                that need to hold before you declare independence:

                                  ✓ P(showed) = P(showed|called)
                                  ✓ P(showed) = P(showed|didn’t call)
                                  ✓ P(didn’t show) = P(didn’t show|called)
                                  ✓ P(didn’t show) = P(didn’t show|didn’t call)

                                If any one of these conditions isn’t met, you stop there and declare the two
                                variables to be dependent in the sample. Only if all the conditions are met do
                                you declare the two variables independent in the sample.
                                You can see the results of a sample of 100 randomly selected patients for this
                                example scenario in Table 13-4.


                                   Table 13-4              Confirmation Calls Related to
                                                         Showing Up for the Appointment

                                                   Called           Didn’t Call      Row Totals
                                  Showed           57               33               90
                                  Didn’t Show      3                7                10
                                  Column Totals    60               40               100
















          20_466469-ch13.indd   235                                                                   7/24/09   9:48:00 AM
   246   247   248   249   250   251   252   253   254   255   256