Page 267 - Introduction to Statistical Pattern Recognition

P. 267

5 Parameter Estimation 249

ignored. Cov-..{yy),yf) J may be approximated by using the first term only of
(5.178). Again, using (5.184),

COV- ( 7:),7f) } = Ex ( 7y)yy) ] - E* ( y:' ]E. { yf ) }

1
=-sisn(h(X:"))sj~n(~(X1')))E { Awy)Awf) }-E.. { y:' ]E* (yf' ]
4
(5.186)

where E(Aw:"Aw~)] =-I/NP for j +k by (5.166), and E*(yy)]E*{yf)} is
proportional to I/Nf by (5.180) and therefore can be ignored.

Thus, substituting (5.185) and (5.186) into (5.183) and using
covx (y:",yp J = 0,

r 1

(5.187)
i=I [Vi

Note that Csign(L(X,Y'))/Ni = (-1)' [(# of correctly classified mi-samples by
.. 01 ,. 01
h >< O)/N, - (# of misclassified mi-samples by h >< O)/Ni] = (-l)'[(1-qR)
W? ,. W?
-eiR] ]= (-l)1(1-2~lR). Since h is the R discriminant function for the original
,.
sample set, the resulting error is the R error. The last column of Table 5-12(a)
shows the variance of E~. which is computed by the 10 trials of the conven-
tional R method. This should be compared with the last column of Table 5-
r*
12(b), Var,($IS). Both are close as (5.187) predicts. Var,{&,,ISi] is the
..*
variance of . . . [see Fig. 5-51. The last column is the average of
AX
Var( E,, I Si } over i.
Note that (5.187) is the variance expression of the R error estimate.

262 263 264 265 266 267 268 269 270 271 272