Page 131 - Introduction to Statistical Pattern Recognition

P. 131

3 Hypothesis Testing 113

@--'(€) E %
4 4
0-

-1 -

-
-2
1
0.5

1 2 3 4 5 6 7 8 9 1 0 6
Fig. 3-22 Effect of the number of observations.

On the other hand, the E [ h(X) wi 1's for the quadratic classifier do not become
I
zero unless both means and covariance matrices are the same for o, and 02, as
seen in (3.139) and (3.140).
The effectiveness of m for reducing the error is significantly diminished
when samples and subsequently h(Xj)'s are correlated. This can be observed
by computing the variance of s for correlated h's as

ni
Var(sIoj) = zVar(h(X,i)loiJ
;=I

That is, the second term does not disappear and contributes to increase
Var{ s I mi 1.

Example 17: Suppose that Var{h(X,)Io,} =o:, and E((h(X,)
-q,)(h (Xk)-q,) I w, J = p I '-I ' 02. Then, the second term of (3.182) becomes
20![p,(m-l)+p,'(m-2) + . . . +p?-']. When p, =OS and m=10 are used,
it becomes 160:. Therefore, Var( s I w, ] = 260; instead of lMf for p, = 0.

126 127 128 129 130 131 132 133 134 135 136