Page 141 - Rapid Learning in Robotics
P. 141

9.2 “Investment Learning” or “Mixture-of-Expertise” Architecture                        127


                 9.2 “Investment Learning” or “Mixture-of-Expertise”

                         Architecture


                 Here, we approach a solution in a modular way and suggest to split learn-
                 ing structurally and temporally: the structural split is implemented at the
                 level of the learning moduls:

                       the T-BOX;


                       the META-BOX, which has the responsibility for providing the map-
                       ping between the context information  c to the weight or parameter
                       set  .

                 The temporal split is implemented at the learning itself:


                       The first, the investment learning stage may be slow and has the task
                       to pre-structure the system for

                       the one-shot adaptation phase, in which the specialization to a par-
                       ticular solution (within the chosen domain) can be achieved extremely
                       rapidly.

                 These two stages are described next.



                 9.2.1 Investment Learning Phase



                                Prototypical
                                Context                     (2)  ω
                                              Meta-Box
                                                                 parameters
                                 c    (2)                        or  weights
                                                                              X
                                                                               2
                                      X 1                    T-Box
                                                (1)                        (1)


                                    Figure 9.2: The Investment Learning Phase.



                 In the investment learning phase a set of prototypical context situations is ex-
                 perienced: in each context j the T-BOX is trained and the appropriate set of
   136   137   138   139   140   141   142   143   144   145   146