Page 41 - Human Inspired Dexterity in Robotic Manipulation
P. 41

Sensorimotor Learning of Dexterous Manipulation  37


              To generate motor output in trial n, the sensorimotor system had to first
              generate a plan based on visual context cues:
                                                 T
                                      x c nðÞ ¼ x nðÞ c nðÞ               (3.4)
                                                                 T
                                                                          T
              where c(n) is a selection vector that takes the value of [1,0] or [0,1] for
              context A and B. The final motor output, however, is biased due to a non-
              linear interaction between two internal states x c and u:

                                     ynðÞ ¼ x c nðÞ + bias nðÞ            (3.5)
                         bias nðÞ ¼ D•unðÞ= 1 + exp sign unðÞ•E•x c nðÞððð  ÞÞÞ  (3.6)
              where sign(u) is 1 and  1if u is positive and negative, respectively. Although
              Eq. (3.6) seems to be arbitrary, it effectively captures the nonlinear combi-
              nation of two sensorimotor processes with only two parameters (D and E).
              Note that two parameters are necessary to account for such a context-
              dependent combination of the two processes. In fact, Eq. (3.6) is essentially
              a sigmoid function whose shape (both magnitude and direction) is modu-
              lated by u. A positive u generates a small- and large-positive bias to a positive
              and negative x c , respectively, whereas a negative u generates a small- and
              large-negative bias to negative and positive x c , respectively.
                 Finally, to model the effect of the break duration, we again assume that
              subjects can well retain the context-dependent memory component, and
              that the context-independent and use-dependent memory decays exponen-
              tially, such that the half-life of the decay is F•ln(2)

                                     un + tÞ ¼ unðÞ•e  t=F                (3.7)
                                      ð
                 We used a nonlinear optimization procedure in Matlab (“fmincon”) to
              estimate the six parameters (A, B, C, D, E, F) of our DPNI model. This
              procedure minimizes a mean-squared error between the output of the
              model and experimental data (T com ) from selected trials in multiple groups
              (Rndm, Ctrl, and IF). The mean T com averaged within each of these trials
              was used because the data from individual subjects was too noisy to obtain
              reliable fits. Confidence intervals for parameter estimates were calculated
              using a boot-strap procedure [5,13] that resampled the experimental data
              with replacement data to obtain 1000 boot-strap data sets. The model
              was fitted separately to the mean T com of each of these data sets. The
              95% confidence intervals were calculated as the 2.5 and 97.5 percentile
              values from the distribution for each parameter obtained across the 1000
              individual fits.
   36   37   38   39   40   41   42   43   44   45   46