Page 827 - Mechanical Engineers' Handbook (Volume 2)
P. 827

818   Neural Networks in Feedback Control Systems

                          applications in closed-loop control are fundamentally different from open-loop applications
                          such as classification and image processing. The basic multilayer NN tuning strategy is
                          backpropagation. 17  Basic problems that had to be addressed for closed-loop NN control 33,44
                          included weight initialization for feedback stability, determining the gradients needed for
                          backpropagation tuning, determining what to backpropagate, obviating the need for prelim-
                          inary off-line tuning, modifying backprop so that it tunes the weights forward through time,
                          and providing efficient computer code for implementation. These issues have since been
                          addressed by many approaches.
                             Initial work in NN was for system identification and identification-based indirect control.
                          In closed-loop control applications, it is necessary to show the stability of the tracking error
                          as well as boundedness of the NN weight estimation errors. Proofs for internal stability,
                          bounded NN weights (e.g., bounded control signals), guaranteed tracking performance, and
                          robustness were absent in early works. Uncertainty as to how to initialize the NN weights
                          led to the necessity for ‘‘preliminary off-line tuning.’’ Work on off-line learning was for-
                                         52
                          malized by Kawato. Off-line learning can yield important structural information.
                             Subsequent work in NNs for control addressed closed-loop system structure and stability
                                                                   54
                          issues. Work by Sussmann and Albertini and Sontag was important in determining system
                                              53
                          properties of NNs (e.g., minimality and uniqueness of the ideal NN weights, observability
                          of dynamic NNs). The seminal work of Narendra and Parthasarathy 9,10  had an emphasis on
                          finding the gradients needed for backprop tuning in feedback systems, which, when the plant
                          dynamics are included, become recurrent nets. In recurrent nets, these gradients themselves
                                                                                       55
                          satisfy difference or differential equations, so they are difficult to find. Sadegh showed that
                          knowing an approximate plant Jacobian is often good enough to guarantee suitable closed-
                          loop performance.
                             The approximation properties of NN 2,3  are basic to their feedback controls applications.
                          Based on this and analysis of the error dynamics, various modifications to backprop were
                          presented that guaranteed closed-loop stability as well as weight error boundedness. These
                          are akin to terms added in adaptive control to make algorithms robust to high-frequency
                          unmodeled dynamics. Sanner and Slotine used radial basis functions in control and showed
                                                          5
                          how to select the NN basis functions, Polycarpou and Ioannou 56,57  used a projection method
                          for weight updates, and Lewis and Syrmos 37  used backprop with an e-modification term. 18
                          All this work used NNs that are linear in the unknown parameter. In linear NNs, the problem
                                                                                         5
                          is relegated to determining activation functions that form a basis set (e.g., RBF and func-
                                                                                           5
                                                           55
                          tional link programmable network (FLPN) ). It was shown by Sanner and Slotine how to
                          systematically derive stable NN controllers using approximation theory and basis functions.
                          Barron 58  has shown that using NNs that are linear in the tunable parameters gives a funda-
                          mental limitation of the approximation accuracy to the order of 1/L 2/ n , where L is the number
                          of hidden layer neurons and n is the number of inputs. Nonlinear-in-the-parameters NNs
                          overcome this difficulty and were first used by Chen and Khalil, who used backprop with
                                                                            59
                          deadzone weight tuning, and Lewis et al., who used Narendra’s e-modification term in the
                                                          60
                          backprop. In nonlinear-in-the-parameters NNs, the basis is automatically selected online by
                          tuning the first-layer weights and thresholds. Multilayer NNs were rigorously used for
                                                                61
                                                                          62
                          discrete-time control by Jagannathan and Lewis. Polycarpou derived NN controllers that
                          do not assume known bounds on the ideal weights. Dynamic/recurrent NNs were used for
                          control by Rovithakis and Christodoulou, 63  Poznyak, 64  Rovithakis, 65  who considered multi-
                          plicative disturbances; Zhang and Wang, 66  and others.
                             Most stability results on NN control have been local in nature, and global stability has
                                               67
                          been treated by Kwan et al., and others. Recently, NN control has been used in conjunction
                          with other control approaches to extend the class of systems that yields to nonparametric
                          control. Calise and coworkers 30,68,69  used NNs in conjunction with dynamic inversion to
   822   823   824   825   826   827   828   829   830   831   832