Page 252 - Introduction to Statistical Pattern Recognition

P. 252

234 Introduction to Statistical Pattern Recognition

the L and R errors is twice the bias between the L and true errors.
In order to confirm (5.156), the following experiment was conducted.

Experiment 8: Bias between the L and R error
Data: I-I (Normal, MTM = 2.562, E = 10%)
Dimensionality: n = 4, 8, 16, 32, 64
Classifier: Quadratic classifier of (5.54)
Sample size: N = N2 = kn, k = 3, 5, 10, 20, 40
I
No. of trials: z = 10
Results: Table 5-10, Fig. 5-3 [6]
The first line of Table 5-10 indicates the theoretical biases from (5.152) and
(5.156), and the second and third lines are the average and standard deviation
of the 10 trial experiment. Despite a series of approximations, the first and
second lines are close except for small k's and large n's, confirming the vali-
dity of our discussion.
An important fact is that, from (5.152) and (5.156), E(&] is roughly
proportional to n2/N for large n. A simpler explanation for this fact can be
obtained by examining (5.153) more closely. Assuming (5.155) and carrying
through the integration of (5.153) with respect to O,

(5.157)

It is known that d:(X) is chi-square distributed with an expected value of n and
standard deviation of d%, if X is normally distributed [see (3.59)-(3.61)].
This means that, when n is large, d?(X) is compactly distributed around the
expected value n (Le. n >> 6.) Therefore, d;(X) on the classification boun-
dary should be close to n2. Thus, should be roughly proportional to n2.
The analysis of the variance (5.154) is more complex. Though the order
of magnitude may not be immediately clear from (5.154), the experimental
results, presented in Fig. 5-4 and the third line of Table 5-10, show that the
standard deviation is roughly proportional to 1/N. The intuitive explanation
should be the same as that presented in Section 5.2.

247 248 249 250 251 252 253 254 255 256 257