Page 91 - Introduction to Statistical Pattern Recognition

P. 91

3 Hypothesis Testing 73

-n
0.1 I I I I111111 , I IIIIIII I I I111111 I , IIIIII
8 16 32 64 128 256
Fig. 3-10 Performance of a single hypothesis test for Data 1-1.

ex and ed for various values of n is plotted in Fig. 3-10. For example, when
n =64, &X = 0.1% is increased to &d = 8.4%. This is the price we must pay
when we do not know where the second distribution is located relative to the
first distribution for a fixed IlM 11.

Ranking procedure: So far, we have pointed out that mapping the n-
dimensional X into the one-dimensional d2 and classifying samples by thres-
holding d2 produce a large increase in error. However, the error may be
reduced significantly by using different approaches for different applications.
For example, let our problem be to select one object for targeting out of many
objects detected in a field. Then, we may rank the objects according to their
distances from the selected target mean, and choose the closest one as the one
to target. This ranking, instead of thresholding, reduces the classification error
of the selected object. However, it must be noted that this problem is different
from the conventional one, in which all objects are classified and the number
of misclassified objects are counted as the error.
Assuming that kl ol-samples and k2 02-samples are available for rank-
ing, the probability of acquiring one of the k, ol-samples by this procedure
(the probability of correct classification) can be expressed as [8-91

86 87 88 89 90 91 92 93 94 95 96