Page 158 -
P. 158

Exercises   145


                              4.22 Perform a resubstitution  and leave-one-out estimation of the classification  errors for the
                                  three  classes  of  cork  stoppers,  using  the  features  obtained  by  dynamic  selection.
                                  Discuss the reliability  of these estimates.

                              4.23 Compute  the  95%  confidence  interval  of  the  error  for  the  classifier  designed  in
                                  Exercise 4.1 1, using the standard formula. Perform a partition method evaluation of the
                                  classifier, with  10 partitions, obtaining another estimate of the 95% confidence interval
                                  of the error.
                              4.24 Compute the decrease of impurity  in the trees  shown in  Figure 4.41 and Figure 4.45,
                                  using the entropy as impurity measure.
                              4.25 Compute the classification matrix car vs. {mas, gla, fad} for the Breast  Tissue dataset
                                  in the tree shown in Figure 4.41. Observe its dependence on the prevalences. Compute
                                  the linear discriminant shown in the same figure.

                              4.26 Using the CART approach, find decision trees that discriminate the three classes of the
                                  CTG  dataset,  N,  S  and  P,  using  several  initial  feature  sets  that  contain  the  four
                                  variability  indexes  ASTV,  ALTV,  MSTV,  MLTV.  Compare  search  times  and
                                  classification performances for the several initial feature sets.
   153   154   155   156   157   158   159   160   161   162   163