Page 36 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 36

1.5 Beyond a Reasonable Doubt...   15


           to absolute certainty if this number tends to infinite), and that if the jury wanted to
           increase the  precision  (details) of the  verdict, it would then lose in  degree of
           certainty.

           Table 1.6. Confidence levels (δ) for the interval estimation of a proportion, when
            p ˆ = 0.74, for two different values of the tolerance (ε).
             n                              δ  for ε = 0.02       δ  for ε = 0.01
             50                                     0.25                  0.13
             100                                    0.35                  0.18
             1000                                   0.85                  0.53
             10000                                ≈ 1.00                  0.98


            1.2
                 δ
            1.0

            0.8     ε=0.04
                           ε=0.02
            0.6
                                   ε=0.01
            0.4
            0.2
                                                                        n
            0.0
               0      500    1000    1500   2000   2500    3000   3500    4000

           Figure 1.6.  Confidence  levels  for  the  interval estimation of a proportion,  when
            p ˆ = 0.74, for three different values of the tolerance.

              There is also another important and subtler point concerning confidence levels.
           Consider the value of δ  = 0.25 for a ε = ±0.02 tolerance in the n = 50 sample size
           situation  (Table 1.6).  When we say that the proportion  of students with
           performance  ≥ 3 lies somewhere in the interval  p ˆ   ± 0.02, with  the confidence
           level 0.25, it really means that if we were able to infinitely repeat the experiment of
           randomly drawing n = 50 sized samples from the population, we would then find
           that 25% of the times (in 25% of the samples) the true proportion p lies in the
           interval  p ˆ ± 0.02, where the  p ˆ  (k = 1, 2,…) are the several sample estimates
                                      k
                    k
           (from the ensemble of all possible samples). Of course, the “25%” figure looks too
           low to be reassuring. We would prefer a much higher degree of certainty; say 95%
           − a very popular value for the confidence level. We would then have the situation
           where 95% of the intervals  p ˆ ± 0.02 would “intersect” the true value p, as shown
                                    k
           in Figure 1.7.
   31   32   33   34   35   36   37   38   39   40   41