Page 294 - Computational Statistics Handbook with MATLAB

P. 294

Chapter 8: Probability Density Estimation 283

h = 0.84 h = 0.42
0.8 0.8

0.6 0.6
0.4 0.4
0.2 0.2

0 0
−4 −2 0 2 4 −4 −2 0 2 4
h = 0.21 h = 0.11
0.8 0.8

0.6 0.6
0.4 0.4
0.2 0.2
0 0
−4 −2 0 2 4 −4 −2 0 2 4

U
FI F IG URE G 8.9 RE 8.9
8.9
F F II GU RE RE 8.9
GU
Four kernel density estimates using n = 100 standard normal random variables. Four
different window widths are used. Note that as h gets smaller, the estimate gets rougher.
where the kernel K is a continuous probability density function with µ K = 0
and 0 < σ K < ∞. The window width that minimizes this is given by
2

⁄
 RK()  15
* -----------------------
h Ker =  nσ Rf ″()  . (8.29)
4
 k 
Parzen [1962] and Scott [1992] describe the conditions under which this
holds. Notice in Equation 8.28 that we have the same bias-variance trade-off
with h that we had in previous density estimates.
5
For a kernel that is equal to the normal density Rf ″() = 3 (⁄ 8 πσ ) , we
have the following Normal Reference Rule for the window width h.

NORMAL REFERENCE RULE - KERNELS

⁄
4 15
⁄
⁄
* --  – 15 – 15
-
h Ker = 3  σn ≈ 1.06σn .
σ
We can use some suitable estimate for , such as the standard deviation, or
⁄
ˆ
σ = IQR 1.348 . The latter yields a window width of
© 2002 by Chapman & Hall/CRC

289 290 291 292 293 294 295 296 297 298 299