Page 139 -
P. 139

126    4 Statistical Classification


                          of  the  two  closest  classes  as  a  merit  criterion.  Furthermore,  features  were  only
                          entered  or removed  from  the  selected set if  they  contributed  significantly  to  the
                          Anova  F.  The  solution  corresponding  to  Figure  4.37  used  a  5%  level  for  the
                          statistical significance of  a candidate feature to enter the selected set and  10% to
                          remove it. Notice that PRT, which had entered at step 1, was later removed, at step
                          5. The nested solution {PRM, N, ARTG, RAAR) would not have been found by a
                          direct forward search.


                          4.5  Classifier Evaluation


                          The determination  of  reliable  estimates of  a classifier error rate  is obviously an
                          essential  task  in order to assess  its usefulness and to compare  it with  alternative
                           solutions.
                             As explained in section 4.2.3 design set estimates are on average optimistic and
                           the same can be said about using an error formula such as (4-25), when true means
                           and covariance  are replaced by  their sample estimates. It is, therefore, mandatory
                           that  the classifier be empirically tested, using  a test set of  independent cases. As
                           mentioned  already  in  section  4.2.3,  these  test  set  estimates  are  on  average
                           pessimistic.
                             We describe  in  the  following  the  influence  of  the  finite  sample sizes  of  the
                           design and test sets on the classifier performance. For this purpose, we consider a
                           two-class classifier with Bayes error:




                             The  influence  of  the  finite  sample  sizes  can  be  summarized  as  follows  (for
                           details, consult Fukunaga,  1990).

                           Influence of finite test set

                           Let Pe,(n) be the test set estimate, influenced only by  the finiteness of the test set,
                           and  consider  the  ensemble  average  of  all  such  estimates,  E[Pe,(n)],  of  a given
                           classifier  with  Bayes  error Pe. The expectation E[Pe,(n)]  can  be  computed  with
                           arbitrarily  large  accuracy  for  a  growing  number  of  these  estimates,  with
                           independent sets of size n. The following results for the expectation and variance
                           are verified:









                             Therefore,  test  set  estimates  are  unbiased,  but  have  a  variance  inversely
                           proportional to the number of test samples (n, for w, and n2 for q).
   134   135   136   137   138   139   140   141   142   143   144