Page 51 -
P. 51

36     CHAPTER 2  Experimental research




                            All of the above responses are well grounded, though the last two responses have
                         deeper statistical roots. The claim that females are taller than males is wrong due to
                         inappropriate sampling. The distribution of the heights of the human population (and
                         many other things in our life) follows a pattern called “normal distribution.” Data
                         sets that follow normal distribution can be illustrated by a bell-shaped curve (see
                         Figure 2.1), with the majority of the data points falling in the central area surround-
                         ing the mean of the population (μ). The further a value is from the population mean,
                         the fewer data points would fall in the area around that value.








                                                                          x
                                                     a  b
                                                          m
                         FIGURE 2.1
                         Normal distribution curve.
                            When you compare two large populations, such as males and females, there is
                         no way to collect the data from every individual in the population. Therefore, you
                         select a smaller group from the large population and use that smaller group to repre-
                         sent the entire population. This process is called sampling. In the situation described
                         in statement 2 above, the three males selected as the sample population happened
                         to be shorter than average males, while the three females selected as samples hap-
                         pened to be taller than average females, thus resulting in a misleading conclusion.
                         Randomization methods and large sample sizes can greatly reduce the possibility of
                         making this kind of error in research.
                            Since we are not able to measure the heights of all males and females, we can
                         only sample a subgroup of people from the entire population. Significance tests al-
                         low us to determine how confident we are that the results observed from the sampling
                         population can be generalized to the entire population. For example, a t test that is
                         significant at P < 0.05 suggests that we are confident that 95% of the time the test
                         result correctly applies to the entire population. We further explore the concept of
                         significance tests in the next section.


                         2.4.2   TYPE I AND TYPE II ERRORS
                         In technical terms, significance testing is a process in which a null hypothesis (H 0 ) is
                         contrasted with an alternative hypothesis (H 1 ) to determine the likelihood that the null
                         hypothesis is true. All significance tests are subject to the risk of Type I and Type II errors.
                            A Type I error (also called an α error or a “false positive”) refers to the mistake
                         of rejecting the null hypothesis when it is true and should not be rejected. A Type II
                         error (also called a β error or a “false negative”) refers to the mistake of not rejecting
   46   47   48   49   50   51   52   53   54   55   56