Page 81 - Introduction to Statistical Pattern Recognition
P. 81

3  Hypothesis Testing                                          63


                         If we select the special set of cost functions

                                        cI1 =c22  and   c12=c21,                (3.37)
                    (3.36) becomes

                                                                                (3.38)
                                         I, Pl(X)dX =I P2(X)dX '
                                                     L,
                    That is,  the decision  boundary  is  still determined by  the  likelihood ratio, but
                    the threshold is selected to satisfy el = e2.


                    Operating Characteristics

                         So far, we have found that the likelihood ratio test is commonly used for
                    various tests, and  only  the  selection of  the threshold varies depending on the
                    test.  Extending this,  it  is  a common  practice  to  plot  the  relation between el
                    and    by  changing  the  value  of  the  threshold  continuously.  This  curve  is
                    called  the  operating  characteristic  [5].  Figure  5  shows  an  example  of  the
                    operating characteristics where el  and  1-e2  are used  for  the x- and y-axes in
                    log  scale.  Three  curves  in  Fig.  3-5  show  the  performance of  the  likelihood
                    ratio test for 30, 20, and 9 features which are selected from the same data set.
                    They  indicate that  30 and 20 features give almost identical performance for a
                    wide  range of  operating points, while 9 features give much poor performance.
                    From  such curves, the designer of  the decision rule can select a proper operat-
                    ing point and the corresponding threshold, depending on one's need.

                         Burdick's chart: Various combinations of  log and linear scales are used
                    for operating characteristics.  However, the following scale gives a straight line
                    when h (X) of  (3.5) is normally distributed for both o1 and o2 [6].
                         Let @(a) be a normal error function defined by


                                                                                (3.39)

                    If  h  is  distributed as Nh(rn,,o:) for  o1 and  Nh(m2,0:) for  w2, and  r  is  the
                    value of the threshold as shown in Fig. 3-6, then
                                                               t-m2
                                                  and  e2 =@ [T]                (3.40)
                                                                      .
                    Or, taking the inverse operation,
   76   77   78   79   80   81   82   83   84   85   86