Page 295 - Computational Statistics Handbook with MATLAB
P. 295

284                        Computational Statistics Handbook with MATLAB


                                                                        ⁄
                                                   ˆ *                – 15
                                                   h Ker =  0.786 ×  IQR ×  n  .
                             Silverman [1986] recommends that one use whichever is smaller, the sample
                             standard deviation or  IQR 1.348⁄   as an estimate for  . σ
                              We now turn our attention to the problem of what kernel to use in our esti-
                             mate. It is known [Scott, 1992] that the choice of smoothing parameter h is
                             more important than choosing the kernel. This arises from the fact that the
                             effects from the choice of kernel (e.g., kernel tail behavior) are reduced by the
                             averaging process. We discuss the efficiency of the kernels below, but what
                             really drives the choice of a kernel are computational considerations or the
                             amount of differentiability required in the estimate.
                              In terms of efficiency, the optimal kernel was shown to be [Epanechnikov,
                             1969]

                                                       3
                                                       -- 1 –(  t );  – 1 ≤≤  1
                                                             2
                                                                        t
                                                        -
                                               Kt() =   4
                                                      
                                                       0;           otherwise.
                             It is illustrated in Figure 8.10 along with some other kernels.



                                             Triangle Kernel        Epanechnikov Kernel
                                       1                         1
                                      0.8                       0.8
                                      0.6                       0.6
                                      0.4                       0.4
                                      0.2                       0.2
                                       0                         0
                                       −1  −0.5   0   0.5  1     −1  −0.5  0    0.5  1
                                             Biweight Kernel          Triweight Kernel
                                       1                         1
                                      0.8                       0.8
                                      0.6                       0.6
                                      0.4                       0.4
                                      0.2                       0.2
                                       0                         0
                                       −1  −0.5   0   0.5  1     −1  −0.5  0    0.5  1
                               II
                               U
                              F F FI F IG URE G 8.10  RE RE RE 8.10
                               GU
                                  8.10
                                  8.10
                               GU
                              These illustrate four kernels that can be used in probability density estimation.
                            © 2002 by Chapman & Hall/CRC
   290   291   292   293   294   295   296   297   298   299   300