Page 52 -
P. 52

2.4  Significance tests   37




                  the null hypothesis when it is false and should be rejected (Rosenthal and Rosnow,
                  2008). A widely used example to demonstrate Type I and Type II errors is the judicial
                  case. In the US justice system, a defendant is presumed innocent. This presumption
                  leads to the following null and alternative hypotheses:
                  •  H 0 : The defendant is innocent.
                  •  H 1 : The defendant is guilty.
                     A Type I error occurs when the jury decides that the defendant is guilty when he
                  is actually innocent, meaning that the null hypothesis is rejected when it is true. A
                  Type II error occurs when the jury decides that the defendant is innocent when he is
                  actually guilty, meaning that the null hypothesis is accepted when it is false. Table 2.4
                  illustrates these errors. In the ideal case, the jury should always reach the decision that
                  the defendant is guilty when he is actually guilty and vice versa. But in reality, the jury
                  makes mistakes occasionally. Each type of error has costs. When a Type I error occurs,
                  an innocent person would be sent to prison or may even lose his life; when a Type II
                  error occurs, a criminal is set free and may commit another crime.


                   Table 2.4  Type I and Type II Errors in the Judicial Case
                                                    Jury Decision
                                                    Not Guilty       Guilty
                                    Not guilty      √                Type I error
                   Reality
                                    Guilty          Type II error    √


                     Let us further examine Type I and Type II errors through a study in the HCI do-
                  main. Suppose a bank hires several HCI researchers to evaluate whether ATMs with
                  a touch-screen interface are easier to use than the ATMs with buttons that the bank
                  branches are currently using. In this case, the null hypothesis and the alternative
                  hypothesis are:
                  •  H 0 : There is no difference between the ease of use of ATMs with touch screens
                     and ATMs with buttons.
                  •  H 1 : ATMs with touch screens are easier to use than ATMs with buttons.
                     The possible Type I and Type II errors in this study are illustrated in Table 2.5. A
                  Type I error occurs when the research team decides that touch-screen ATMs are easier
                  to use than ATMs with buttons, when they are actually not. A Type II error occurs
                  when the research team decides that touch-screen ATMs are no better than ATMs with
                  buttons, when they are. Again, each type of error can induce negative consequences.
                  When a Type I error occurs, the bank may spend money to switch to touch-screen
                  ATMs that do not provide better service to the customers. When a Type II error occurs,
                  the bank chooses to stay with ATMs with buttons and loses the opportunity to improve
                  the service that it provides to its customers.
   47   48   49   50   51   52   53   54   55   56   57