Page 350 - Introduction to Statistical Pattern Recognition
P. 350

332                        Introduction to Statistical Pattern Recognition



                      This option makes no assumptions about the densities of  the data or the shape
                      of the kernel function.  However, since the value of the threshold is customized
                      to the data being tested, using this option will consistently bias the results low.
                      This is not objectionable in the case of R errors, since the R error is used as a
                      lower  bound  of  the  Bayes  error.  However,  using  this  procedure can  give
                      erroneous results for the L error.  Options 3 and 4 are designed to alleviate this
                      problem.

                      Oprion 3:  For each value of r, find the value of t which minimizes the R error,
                      and then use this value oft to find the L error.  Since the selection of  the thres-
                      hold has been  isolated from the actual values of  the L  estimates of  the likeli-
                      hood  ratio, using  this method does in  fact help reduce the bias encountered in
                      Option 2.  Experimental results will show that this method does give reliable
                      results as long as r  is relatively large.  When r  is small, however, the L esti-
                      mates of  the likelihood ratio are heavily biased as is seen in Fig. 7-9(b), and
                      use  of  these estimates to determine the threshold  may  give far from  optimal
                      results.  An  advantage of  this  option  is  that  it requires no more  computation
                      time than Option 2.


                      Option 4: Under this option, the R  error is found exactly as in  Option 2,  by
                      finding the value oft which minimizes the R error, and using this error rate.  In
                      order to find the L error, we use an L procedure to determine the value of  t to
                      use  for each sample.  Hence, under Option 4,  we  use  a different threshold to
                      test  each  of  the  Nl+N2  samples, determining the  threshold  for  each  sample
                      from the other NI+N2-I  samples in the design set.  The exact procedure is as
                      follows.
                      (1)   Find the L density estimates at all samples,






                                                           i#r                    (7.57)


                      (2)  To test sample Xf):
                           (a)   Modify the density estimates by  removing the effect of  Xf) from
                                all estimates
   345   346   347   348   349   350   351   352   353   354   355