Page 277 - Introduction to Statistical Pattern Recognition

P. 277

6 Nonparametric Density Estimation 259

w =jl?(Y)dY. (6.15)
Then, $(.)/w becomes a density function. Therefore, (6.14) becomes

(6.16)

where

(6.17)

and r-'B is the covariance matrix of ?(X)/w.

Substituting (6.12) and (6.16) into (6.7) and (6.9), the moments of p(X)
are approximated by

1
E (&X) } 'I- p (X)[1 + ya(X)r2] 2nd order approximation

P (X) 1st order approximation , (6.18)

2nd order approximation

1
E -[wp(X) - p2(X)] 1st order approximation . (6.19)
N

Note that the variance is proportional to 1/N and thus can be reduced by
increasing the sample size. On the other hand, the bias is independent of N,
and is determined by V2p (X), A, and r 2.

Normal kernel: When the kernel function is normal with zero expected
vector and covariance matrix r2A, Nx(0,r2A), $(X) becomes normal as
cNx(0,r2A 12) where c = 2-"'2(2~)-"/2 IA l-"2r-". Therefore,

272 273 274 275 276 277 278 279 280 281 282