Page 330 - Introduction to Statistical Pattern Recognition

P. 330

312 Introduction to Statistical Pattern Recognition

errors counted in Table 7-l(b) is identical to that obtained from the voting 2NN
procedure using the 2nd-NN and 3rd-NN columns. In the voting 2NN pro-
cedure, a sample is rejected if the classes of the 2nd-NN and 3rd-NN columns
disagree (see X2), and the case is not considered an error. Adding the 1st-NN
column, this reject case (X2) becomes correct, but the error case (X,) still
remains an error. Thus, the L method version of the voting 2NN procedure is
the same as the R method version of the voting 3NN procedure.
An experiment was conducted to compare the performances of two
approaches: one is to divide the available sample set into design and test (the H
method), and the other uses the procedure of Table 7- 1 (a) (the L method).

Experiment 1: NN error estimate, L and H
Data: I-A (Normal, n = 8, E* = 1.9%)
Sample size: N, = N2 = 50-400 (L)
50-400 for design, 50-400 for test (H)
No. of trials: z = 10
Metric: A = I (Euclidean)
Results: Fig. 7-4

2- E*
1- -
I I I I

The voting NN procedure was applied. Nr indicates the number of test sam-
ples per class. In the H method, NT is half of the available sample size, N, and
NT = N for the L method. Note that the results are fairly close for the same
value of NT. Although the performance of the L method is slightly worse, it is
better to use the L method and double Nr than to divide the data into two
groups. Also, note that the experimental curves are above twice the Bayes

325 326 327 328 329 330 331 332 333 334 335