Page 191 -
P. 191

5.5 Multi-Laver Percevtrons   179



















                                                                                   Train
                                        0.0
                                         o   500  1000  1500  2000  2500  3000  3500  4000 - Verify
                                                        Epoch
                            Figure  5.26.  Learning  curve  for  the  cork  stoppers  data  (3  classes)  with  a
                            MLP7:5:3. Notice the error degradation for the verification set after approximately
                            500 epochs.



                              Choosing the best neural net solution can be quite a tedious job  since one has to
                            try  several  architectures  and  starting  conditions.  As  previously  mentioned,
                            Statistics  has  a  helpful  IPS  tool  that  allows  the  user  to  perform  a  series  of
                            experiments and retain the best solutions, which are displayed at the end in a sorted
                            way. Using this tool it was possible to find an MLP2:2:3 solution with features N
                            and PRT and a much lower number of weights (15), which after training with 200
                            epochs  performed  similarly.  This  is  shown  in  Table  5.5  for  the  training,
                            verification  and  test  sets.  In  this  and  other  examples  the  logistic  sigmoid  was
                            always used  as activation function. Selection of  cases for training, validation and
                            test sets is always performed randomly.




                            Table 5.5.  MLP2:2:3 classification matrices of three classes of cork stoppers. True
                            classification along the rows; predicted classification along the columns.
   186   187   188   189   190   191   192   193   194   195   196