Page 102 -
P. 102

88     CHAPTER 4  Statistical analysis




                            Third, the errors in the data should be normally distributed. Similar to the assump-
                         tion of “homogeneity of variance,” this assumption is only considered to be violated
                         when the sample data is highly skewed. When the errors are not normally distributed,
                         nonparametric tests (discussed in Section 4.8) should be used to analyze the data.



                         4.7  IDENTIFYING RELATIONSHIPS

                         One of the most common objectives for HCI-related studies is to identify relationships
                         between various factors. For example, you may want to know whether there is a rela-
                         tionship between age, computing experience, and target selection speed. In statistical
                         terms, two factors are correlated if there is a significant relationship between them.


                         4.7.1   CORRELATION
                         The most widely used statistical method for testing correlation is the Pearson's product
                         moment correlation coefficient test (Rosenthal and Rosnow, 2008). This test returns a
                         correlation coefficient called Pearson's r. The value of Pearson's r ranges from −1.00 to
                         1.00. When the Pearson's r value between two variables is −1.00, it suggests a perfect
                         negative linear relationship between the two variables. In other words, any specific in-
                         crease in the scores of one variable will perfectly predict a specific amount of decrease
                         in the scores of the other variable. When the Pearson's r value between two variables is
                         1.00, it suggests a perfect positive linear relationship between the two variables. That
                         is, any specific increase in the scores of one variable will perfectly predict a specific
                         amount of increase in the scores of the other variable. When the Pearson's r value is 0, it
                         means that there is no linear relationship between the two variables. In other words, the
                         increase or decrease in one variable does not predict any changes in the other variable.
                            In the data-entry method example, suppose the eight participants each complete
                         two tasks,  one using standard  word-processing  software,  the other using
                           word- prediction software. Table 4.20 lists the number of years that each participant
                         had used computers and the time they spent on each task. We can run three Pearson's
                         correlation tests based on this data set to examine the correlation between:


                          Table 4.20  Sample Data for Correlation Tests
                                                Computer
                                                Experience      Standard        Prediction
                          Participant 1             12            245              246
                          Participant 2               6           236              213
                          Participant 3               3           321              265
                          Participant 4             19            212              189
                          Participant 5             16            267              201
                          Participant 6               5           334              197
                          Participant 7               8           287              289
                          Participant 8             11            259              224
   97   98   99   100   101   102   103   104   105   106   107