Page 294 - Computational Statistics Handbook with MATLAB
P. 294

Chapter 8: Probability Density Estimation                       283




                                               h = 0.84                  h = 0.42
                                      0.8                       0.8

                                      0.6                       0.6
                                      0.4                       0.4
                                      0.2                       0.2

                                       0                         0
                                       −4   −2    0   2    4     −4   −2   0    2    4
                                               h = 0.21                  h = 0.11
                                      0.8                       0.8

                                      0.6                       0.6
                                      0.4                       0.4
                                      0.2                       0.2
                                       0                         0
                                       −4   −2    0   2    4     −4   −2   0    2    4

                               U
                              FI F IG URE G 8.9  RE 8.9
                                  8.9
                              F F II  GU  RE RE 8.9
                               GU
                              Four kernel density estimates using  n =  100   standard normal random variables. Four
                              different window widths are used. Note that as h gets smaller, the estimate gets rougher.
                             where the kernel K is a continuous probability density function with µ K =  0
                             and 0 <  σ K <  ∞.   The window width that minimizes this is given by
                                     2

                                                                      ⁄
                                                             RK()   15
                                                      *     -----------------------
                                                     h Ker =   nσ Rf ″()   .             (8.29)
                                                              4
                                                             k     
                             Parzen [1962] and Scott [1992] describe the conditions under which this
                             holds. Notice in Equation 8.28 that we have the same bias-variance trade-off
                             with h that we had in previous density estimates.
                                                                                          5
                              For a kernel that is equal to the normal density  Rf ″() =  3 (⁄  8 πσ )  , we
                             have the following Normal Reference Rule for the window width h.


                             NORMAL REFERENCE RULE - KERNELS

                                                          ⁄
                                                       4  15
                                                                          ⁄
                                                               ⁄
                                                 *     --   – 15       – 15
                                                       -
                                                h Ker =  3   σn  ≈  1.06σn  .
                                                               σ
                             We can use some suitable estimate for  , such as the standard deviation, or
                                     ⁄
                             ˆ
                             σ =  IQR 1.348  . The latter yields a window width of
                            © 2002 by Chapman & Hall/CRC
   289   290   291   292   293   294   295   296   297   298   299