Page 236 -
P. 236

234                                                 R. Seri and D. Secchi

            where ˛ 2 Œ0; 1  is called Type-I error rate, i.e. the probability of rejecting the null
            hypothesis when it is true, or significance level. 4
                                                                         be the
              Suppose now that the alternative hypothesis is verified and let P H 1
            probability distribution under the alternative hypothesis. If the null hypothesis
            is associated with the absence of an effect (of a variable on the outcome), the
            alternative hypothesis implies generally that there is an effect. It is generally the
                                                                          5
            case that this effect can be measured through a quantity d called effect size, that
                                   . If we suppose that the alternative hypothesis is true,
            enters the formulation of P H 1
            the probability that T belongs to A or to R is:
                                      fT 2 Ag D ˇ .d/
                                   P H 1
                                      fT 2 Rg D 1   ˇ .d/
                                   P H 1
            where ˇ, belonging to Œ0; 1 ,isthe Type-II error rate, i.e. the probability of accepting
            the null hypothesis when it is false. Note that ˇ depends on the effect size d.The
            quantity 1   ˇ, especially when seen as a function of the effect size d, is called
            (statistical) power of the test and measures the probability that the test correctly
            identifies the presence of an effect when there is one.
              In a hypothetical simulation, for example, one may want to study how decision
            making makes employee motivation more effective under conditions of more or less
            organized corporate structures (Herath et al. 2017). The null hypothesis may be that
            the average motivation does not vary under alternative corporate structures, and this
            can be tested using, say, ANOVA. The Type-I error rate or significance level ˛,
            that is usually set at 5%, can be used to obtain an acceptance region A. If the null
            hypothesis is false, the effect size d—i.e. the “strength” of the effect—measures
            the impact that the different conditions exercise, on average, on the outcome. The
            probability that the statistic takes a value inside A under the alternative hypothesis is
            ˇ and depends on d. If the effect as measured by d is small, the alternative hypothesis
            is near to the null hypothesis, and the probability ˇ that the statistic T falls inside A
            under H 1 is near to the probability 1   ˛ under H 0 .If d increases, ˇ decreases.
              It is clear that there are several degrees of freedom in the choice of the probability
            ˛, of the statistic T for testing H 0 , and of the acceptance region A. Now, while the
            test statistic is often suggested by the problem under scrutiny, the probability ˛ is
            chosen routinely from a set of possibilities that have been determined by tradition
            more than by reflection. As to the choice of A and R, it is often the case that R



            4 We note that Neyman (1950, p. 259) used the term “accept” where most modern treatments
            propose to use “fail to reject” or “do not reject.” The original choice of the author is in line with
            his idea of testing as leading to decision, while the modern use appears to be incorrectly borrowed
            from Fisher’s approach (Fisher 1955, p. 73). However Pearson was more cautious (Pearson 1955,p.
            206) and this even suggested to some authors the idea that he had rejected the approach pioneered
            with Neyman (Mayo 1992).
            5
            More generally, the effect size d measures the distance of the true distribution from the distribution
            under the null hypothesis, and is generally a function of the parameters.
   231   232   233   234   235   236   237   238   239   240   241