Page 257 - Introduction to Statistical Pattern Recognition
P. 257

5  Parameter Estimation                                      239



                                               TABLE 5-11

                                   EFFECT OF REMOVING ONE SAMPLE


                                                           Bias between errors with
                                                          and without removing Y (%)
                                          Error without
                          Case           removing Y (%)   d2 =n  d2 =2n  I  d2 =3n
                                    24        20.18       0.689    0.769    0.762
                           1-1      40        15.61       0.21  1   0.279   0.274
                        (E = 10%)   80        12.04       0.035    0.027    0.018
                                    160       11.04       0.010    0.014    0.013
                                    320       10.53       0.006    0.009    0.01  1
                                              23.53       1.213    1.45 1   1.356
                           1-41               16.19       0.423    0.619    0.658
                         (E  = 9%)            11.79       0.060    0.001  t   0.09 1   0.083
                                              10.32       0.005    0.014    0.013
                                              9.52        0.006    0.012    0.015
                                              5.58        0.555    0.664    0.673
                           I-A                3.70        0.088    0.110    0.103
                        (E = 1.9%)            2.54        0.007    0.008    0.003
                                              2.25        0.000    0.001    0.00 1
                                              2.08        0.000             0.00 1



                     is available, say SI, from which  we  wish  to learn as much  about the statistical
                     properties as possible that these S, ’s may have.
                         One  possible  way  of  doing  this  is  to  generate t  artificial  sample  sets
                     S;,  , . . . ,S;  from  S1,  and  study  the  statistical  properties  of  these  S;,’s
                     (i = 1, . . . ,t), hoping that these statistical properties are close to the ones of the
                     S,’s (i = 1, . . . ,T).  This  technique  is  called  the  bootstrap  method,  and  the
                     artificial samples are called the bootstrap  samples  [ 151.  The bootstrap method
                     may  be  applied to many  estimation problems.  However, in this book, we  dis-
                     cuss this technique only for the estimation of classification errors.
   252   253   254   255   256   257   258   259   260   261   262