Page 86 - Introduction to Statistical Pattern Recognition
P. 86

68                          Introduction to Statistical Pattern Recognition



                          Single hypothesis  schemes have  been  proposed  to  solve  this  problem.
                     Typically, they  involve measuring  the  distance of  the  object from  the  target
                     mean (normalized by  the target covariance matrix), and applying a threshold to
                      determine if  it is or is not a target.  This technique works well when the dimen-
                      sionality of  the data, n, is very low  (such as  1 or 2).  However, as n  increases,
                      the error of  this technique increases significantly. The mapping from the origi-
                      nal  n-dimensional feature space to  a one-dimensional distance space destroys
                      valuable classification information which existed in  the original feature space.
                      In  order to  understand this phenomena, let  us  study here  the  statistics of  the
                      distance.


                           Distribution of  the  distance:  Let  us  consider a distribution of  X  with
                      the expected vector M and the covariance matrix X. Then, the normalized dis-
                      tance of X from M is
                                                                  n
                                     d2 = (X-M)TX-l(X-M)  = ZTZ = ZZ? ,          (3.51)
                                                                 i=l
                      where  Z = AT(X-M)  and  A  is  the  whitening  transformation.  Since  the
                      expected vector and  covariance matrix of  Z  are 0 and I  respectively, the zi’s
                      are uncorrelated, and E { zi] = 0 and Var(zi ]  = 1.  Thus, the expected value and
                      variance of d2 are

                                  E{d2] =n E{z?] =n                              (3.52)


                                Var{d2] =E{(d2)2] -E2(d2]



                                                                                  (3.53)



                      When the 2;’s are uncorrelated (this is satisfied when the zi’s are independent),
                      and E { 24 ]  is independent of i, the variance of d2 can be further simplified to

                                               Var{d2)  =n y ,                    (3.54)
                      where
   81   82   83   84   85   86   87   88   89   90   91