Page 117 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 117

96       3 Estimating Data Parameters


           establishing a confidence interval for  the variance  only in the case that the
           population distribution follows a normal law. Then, the sampling distribution of
           the variance follows a chi-square law, namely (see Property 4 of section B.2.7):

              ( −n  ) 1 v  ~ χ 2                                           3.18
                σ 2     n  1 −

              The chi-square distribution is asymmetrical; therefore, in  order to establish a
           two-sided confidence interval, we have to use two different values for the lower
           and upper percentiles. For the 95% confidence interval and df = n −1, we have:

                       df ×  v
                               2
                2
              χ df  . 0 ,  025  ≤  ≤  χ df  . 0 ,  975   ,                 3.19
                        σ 2

                   2
           where  χ df ,α means the α percentile of the chi-square distribution with df degrees
           of freedom. Therefore:

               df × v  ≤ σ ≤  df × v  .                                    3.20
                         2
                2
              χ df  . 0 ,  975  χ df 2  . 0 ,  025

           Example 3.7
           Q: Consider the distribution of the average perimeter of defects, variable PRM, of
           class 2 in the Cork Stoppers’ dataset. Compute the 95% confidence interval
           of its standard deviation.

           A: The assumption of  normality for the PRM variable is acceptable, as will be
           explained in Chapter 5. There are, in class 2, n = 50 cases with sample standard
           variance v = 0.7168. The chi-square percentiles are:

              χ 2 49  . 0 ,  025  =  31 . 56 ; χ 49  . 0 ,  975  =  70 . 22 .
                               2

              Therefore:

              49×v  ≤ σ 2  ≤  49×v  ⇒  . 0  50 ≤ σ 2  ≤  . 1  11 ⇒  . 0  71≤ σ ≤  . 1  06 .
              70 . 22     31 . 56


              Confidence intervals  for the  variance  are computed  by SPSS,  STATISTICA,
           MATLAB and R as part of hypothesis tests presented in the following chapter.
           They can be computed, however, either using Tools.xls   or, in the case of the
           variance alone, using the MATLAB command normfit   mentioned in section 3.2.
           We also provide the MATLAB and R function  civar(v,n,alpha)      for
           computing confidence intervals of a variance (see Appendix F).
   112   113   114   115   116   117   118   119   120   121   122