Page 43 - Rapid Learning in Robotics
P. 43

3.3 Learning as Approximation Problem                                                    29


                     Usually, the process of learning is based on a certain amount of apriori
                 knowledge and a set of training examples. Its goal is two-fold:


                       the learner should be able to recognize the re-occurance of a previ-
                       ously seen situation (stimuli or input) and associate the correct an-
                       swer (response or output) as learned before;


                       in new, previously un-experienced situations, the learner should gen-
                       eralize its knowledge and infer the answer appropriately.

                     The primary problem of learning is to find an appropriate representa-
                 tion of the learned knowledge or skill: its input (stimuli) and output (re-
                 sponses). The reason is rather simple and fundamental: no system can
                 learn, what it cannot represent.
                     We can distinguish three basic types of task describing variables x:

                    1. the orderable, continuous valued representations (e.g. length, speed,
                       temperature, force) with a defined order relation (x i      j ) and an ex-      x
                       isting distance metric jx i   x j j;

                    2. a periodic or circular variable representation (e.g. azimuth angles,
                       hour of the day) with defined distance, but without a clear order
                       relation (“wrap-around”; is Monday before or after Saturday?);


                    3. the symbolic and categorical representation x           c        k g, like           c           f
                       cities, attributes and generally names. Here, nor an order, neither a
                       distance relation between x i and x j does exist, just the binary equiv-
                       alence relation is defined (x i   x j ,or x i    x j ).


                 Sometimes variables are represented depending on the context, e.g. “red”
                 color may get coded as category, as circular hue value, or as orderable
                 location in the color triangle.
                     A desired skill can be modeled as the mapping between an input space
                                                                              in
                 and an output space. The input space (later notated X ) captures all
                 relevant system (observable and non-observed) variables. The problem of
                 learning such a mapping is then equivalent to the problem of synthesizing
                 an associative memory (Poggio and Girosi 1990). The appropriate output
                 is retrieved when the input is presented and the system generalizes when
                 a new input is presented. Different frameworks of this problem depend
                 strongly on the type of input and output variables.
   38   39   40   41   42   43   44   45   46   47   48