Page 86 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 86

2.3 Summarising the Data   65


              Note that:

              –  For symmetrical distributions, if the mean exists, it will coincide with the
                 median. Based on this property, one can also measure the skewness using
                 g = (mean − median)/(standard deviation). It can be proved that –1 ≤ g  ≤ 1.

              –  For asymmetrical distributions, with only one maximum (which is then the
                 mode), the median is between the mode and the mean as shown in Figure
                 2.24.

                 f(x)                             f(x)






                                        x                                 x
                   mode   mean                              mean   mode
               a      median                    b              median
           Figure 2.24. Two asymmetrical distributions: a) Skewed to the right (usually with
           γ > 0); b) Skewed to the left (usually with γ < 0).


           2.3.3.2 Kurtosis

           The degree of flatness of a probability or density function near its center, can be
           characterised by the so-called kurtosis, defined as:

              κ =  ( [ Ε X  −  ) µ  4  / ] σ 4  − 3.                       2.16

              The factor 3 is introduced in order that κ = 0 for the normal distribution. As a
           matter of fact, the κ measure as it stands in formula 2.16, is often called coefficient
           of excess (excess compared to the normal distribution). Distributions flatter than
           the normal distribution have  κ  < 0; distributions more peaked than the normal
           distribution have κ  > 0.
              The sample estimate of the kurtosis is computed as:

              k = [n (n +  ) 1 M −  ( 3 n −  ) 1 M  2 2  ( [ / ]  n − 1 )(n −  2 )(n −  ) 3 s 4  ] ,  2.17
                          4

                                j
           with:  M  j  =  ∑ n = i 1  x (  i  − x) .
              Note that the  kurtosis measure has the same shortcomings as the skewness
           measure. It does not always measure what it is supposed to.
              The skewness and the kurtosis have been computed for the PRT variable of the
            Cork Stoppers’ dataset as shown in Table 2.8. The PRT variable exhibits a
           positive  skewness indicative of a rightward skewed  distribution and a positive
           kurtosis indicative of a distribution more peaked than the normal one.
   81   82   83   84   85   86   87   88   89   90   91