Page 208 - Introduction to Statistical Pattern Recognition

P. 208

190 Introduction to Statistical Pattern Recognition

Second term p2: Similarly, the derivatives of p2 can be obtained from
(A.37) and (A.38). The results are

(5.29)

(5.30)

(5.3 1)

Substituting (5.29) through (5.31) into (5.18) and (5.19), and noting that
hi') = 1 and hj2' =hi,

(5.33)

Discussions and experimental verification: Table 5- 1 shows the depen-
dence of E ( Apl } and E ( Ap2 } on n and k (=N ln) for three different cases [4].
In the first case, two sets of samples are drawn from the same source Nx(U,I),
a normal distribution with zero mean and identity covariance matrix. The
second and third cases are Data 1-1 and Data I-41 (with variable n), respec-
tively. As Table 5-1 indicates, for all three cases, E{Apl } is proportional to
llk while E(Ap2) is proportional to (n+l)lk. Also, note that E(Apl ) is the
same for the first and third cases because the sources have the same mean.
Similarly, E{Ap2) is the same for the first and second cases because the
sources share a covariance matrix.
Since the trend is the same for all three cases, let us study the first case
closely. Table 5-1 demonstrates that, in high-dimensional spaces (n >> I),
E(Ap2J = 0.125(n+l)/k dominates E(Ap, } = 0.25lk. Also, E(Ap2) =
0.125(n+l)/k indicates that an increasingly large value of k is required to main-
,.
1
A
tain a constant value of E 1 p 1 (= E { pl ) + E { p2 1) as the dimensionality
increases. For example, Table 5-2 shows the value of k required to obtain

203 204 205 206 207 208 209 210 211 212 213