Page 318 - Introduction to Statistical Pattern Recognition

P. 318

Chapter 7

NONPARAMETRIC CLASSIFICATION

AND ERROR ESTIMATION

After studying the nonparametric density estimates in Chapter 6, we are
now ready to discuss the problem of how to design nonparumetric clussifiers
and estimate their classification errors.

A nonparametric classifier does not rely on any assumption concerning
the structure of the underlying density function. Therefore, the classifier
becomes the Bayes classifier if the density estimates converge to the true den-
sities when an infinite number of samples are used. The resulting error is the
Bayes error, the smallest achievable error given the underlying distributions.
As was pointed out in Chapter 1, the Bayes error is a very important parameter
in pattern recognition, assessing the classifiability of the data and measuring
the discrimination capabilities of the features even before considering what
type of classifier should be designed. The selection of features always results
in a loss of classifiability. The amount of this loss may be measured by com-
paring the Bayes error in the feature space with the Bayes error in the original
data space. The same is true for a classifier. The performance of the classifier
may be compared with the Bayes error in the original data space. However, in
practice, we never have an infinite number of samples, and, due to the finite
sample size, the density estimates and, subsequently, the estimate of the Bayes
error have large biases and variances, particularly in a high-dimensional space.

300

313 314 315 316 317 318 319 320 321 322 323