Page 292 - Introduction to Statistical Pattern Recognition
P. 292

274                        Introduction to Statistical Pattern Recognition


                            The optimal k, k*, may be found by minimizing the mean-square error of
                       (6.94).  That is, solving aMSE lak = 0 for k yields [5]




                                        L       J


                                                                                   (6.95)



                       As  in the Parzen case, the optimal k  is a function of X. Equation (6.95) indi-
                       cates that k* is invariant under any non-singular transformation.  That is,

                                                 k*(Z) = k*(X) .                   (6.96)

                       Also, k* and I-*  of (6.36) are related by


                                                                                   (6.97)


                       This indicates that both  the Parzen and kNN  density estimates become optimal
                       in the same local range of  L(X).  The resulting mean-square error is obtained
                       by  substituting (6.95) into (6.94).


                                                                             4
                                            -
                               MSE* { &X)} =                                    .   (6.98)
                                                            IA 1''

                       Note  that  (6.98) and  (6.38)  are identical.  That  is, both  the  Parzen  (with the
                       uniform kernel) and kNN  density estimates produce the same optimal MSE.
                            The  globally  optimal  k  may  be obtained  by  minimizing  the  integral
                       mean-square error criterion.  From (6.94), with a fixed k,

                                  I           1
                                                                         .
                                              4
                                                                                   (6.99)
                          IMSE = -Jp2(X)dX  + -~~'"(~)~'~'~~~(X)p~~'"(X)dx
                                  k
                                                    N
   287   288   289   290   291   292   293   294   295   296   297