Page 227 -
P. 227

5.10 Support Vector Machines   2 15








                                        4000

                                        3000

                                        2000

                                        1000

                                          0
                                         Case 1     Case 101   Case 201   Case 301    Case 401
                                               Case 51    Case 151   Case 251   Case 351
                                  Figure  5.43.  Prediction  of  foetal  weight  using  an  RBF4:4:1  network  with
                                Gaussian kernel and k-means centroid adjustment (compare with Figure 5.40).




                                5.10  Support Vector Machines


                                Support vector machines (SVM) is a distinctive approach to pattern classification
                                and  regression,  since  it  tackles  the  principle  of  structural  risk  minimization,
                                described in  section  5.6.5,  in  a  special way.  As  a consequence,  support  vector
                                machines  can  provide  a  good  generalization  performance  independent  of  the
                                distributions of the patterns.
                                  The central idea of SVM is the adjustment of a discriminating function so that it
                                optimally uses the separability information of the boundary patterns. Let us first
                                assume  a linear  discriminating  function  and  two  linearly  separable  classes  with
                                target values +1 and -1.  A discriminating hyperplane will satisfy:






                                 or (perceptron rule),




                                   Taking  into  account  the  hyperplane  properties  mentioned  in  section  2.1,  the
                                 distance of  any  point  x;  to a hyperplane  is precisely  Iwlx,-twollllwII, as shown in
                                 Figure 5.44. In particular, the distance to the origin is simply Iwol/llwII.
                                   Given a hyperplane, the distance of the closest pattern to it is called the margin
                                 of  separation. The  SVM  approach,  in  its  simplest  linear  version,  consists  of
   222   223   224   225   226   227   228   229   230   231   232