Page 266 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 266

6.4 The ROC Curve   247


                                            Decision
                                           A      N
                                    Reality  A  a  b
                                      N
                                            c
                                                  d

           Figure 6.16. The canonical classification matrix for two-class discrimination of an
           abnormal event (A) from the normal event (N).


              From the classification matrix of Figure 6.16, the following parameters are
           defined:

              −  True Positive Ratio  ≡ TPR =  a/(a+b). Also  known as  sensitivity, this
                 parameter tells us how sensitive our decision method is in the detection of
                 the abnormal event. A classification method with high sensitivity will rarely
                 miss the abnormal event when it occurs.

              −  True Negative Ratio  ≡ TNR =  d/(c+d).  Also known as  specificity, this
                 parameter tells us how specific our decision method is in the detection of the
                 abnormal event. A classification method with a high specificity will have a
                 very low rate of  false alarms, caused  by classifying a normal event as
                 abnormal.
              −  False Positive Ratio  ≡ FPR  = c/(c+d) = 1 − specificity.
              −  False Negative Ratio ≡ FNR = b/(a+b) = 1 − sensitivity.

              Both the sensitivity and specificity are usually given in percentages. A decision
           method is considered good if it simultaneously has a high sensitivity (rarely misses
           the abnormal event when it occurs) and a high specificity (has a low false alarm
           rate). The ROC curve depicts the sensitivity versus the FPR (complement of the
           specificity) for every possible decision threshold.

           Example 6.9

           Q: Consider the Programming   dataset (see Appendix E). Determine whether a
           threshold-based  decision  rule  using  attribute  AB,  previous  learning  of  Boolean
                                                     “
           Algebra , has a significant influence deciding the student passing (SCORE ≥ 10) or
                  ”
           flunking  (SCORE <  10) the Programming course,  by visual inspection of the
           respective ROC curve.
           A: Using the Programming   dataset we first establish the following Table 6.8.
           Next, we set the following decision rule for the attribute (feature) AB:

                                                        ≥
              Decide “Pass the Programming examination” if AB  Δ.
   261   262   263   264   265   266   267   268   269   270   271