Page 272 - Introduction to Statistical Pattern Recognition

P. 272

Chapter 6

NONPARAMETRIC DENSITY ESTIMATION

So far we have been discussing the estimation of parameters. Thus, if
we can assume we have a density function that can be characterized by a set of
parameters, we can design a classifier using estimates of the parameters.
Unfortunately, we often cannot assume a parametric form for the density func-
tion, and in order to apply the likelihood ratio test we somehow have to esti-
mate the density functions using an unstructured approach. This type of
approach is called nonparametric estimation, while the former is called
parametric estimation. Since, in nonparametric approaches, the density func-
tion is estimated locally by a small number of neighboring samples, the esti-
mate is far less reliable with larger bias and variance than the parametric coun-
terpart.
There are two kinds of nonparametric estimation techniques available:
one is called the Par-zen density estimate and the other is the k-nearest neigh-
bor- densiry estimate. They are fundamentally very similar, but exhibit some
different statistical properties. Both are discussed in this chapter.
It is extremely difficult to obtain an accurate density estimate non-
parametrically, particularly in high-dimensional spaces. However, our goal
here is not to get an accurate estimate. Our goal is, by using these estimates,
to design a classifier and evaluate its performance. For this reason, the accu-
racy of the estimate is not necessarily a crucial issue. Classification and
performance evaluation will be discussed in Chapter 7. The intention of this

254

267 268 269 270 271 272 273 274 275 276 277