Page 40 - Human Inspired Dexterity in Robotic Manipulation
P. 40

36    Human Inspired Dexterity in Robotic Manipulation


          the current trial. Specifically, A ¼ [A, 0; 0, 1] and B ¼ [B, 0] for context A,
          or A ¼ [1, 0; 0, A] and B ¼ [0, B] for context B. Eq. (3.2) assumes that neg-
          ligible generalization occurs across contexts during learning, which has been
          demonstrated in the TF group. It assumes that the “retention rate” A is also
          context dependent. That is, no decay of the inactive context should occur,
          which is a phenomenon that is supported by our experiment showing long-
          term recall of the learned context (RT groups). Both of these assumptions
          are consistent with the finding that the error-based update of the internal
          model is context dependent [37]. Similar to the DRMC model, we also
                                                T
          set the initial value of x(1) to (180,  180) to match the data-revealing
          subjects’ ability to visually initialize the internal representation.
             The novel feature of our DPNI model is the use-dependent memory
          that generates a nonlinear bias to the motor output in the subsequent trial.
          Specifically, we defined the use-dependent memory u(n) as a simple aver-
          aging process of previous manipulations:


                                    ð
                          un +1Þ ¼ 1 CÞ•un ðÞ + C•T target n ðÞ        (3.3)
                           ð
          where C denotes the contribution of the previous trial in the averaging pro-
          cess. This equation is conceptually similar to the use-dependent learning
          model [16]. However, unlike reaching movements, which are single ballistic
          motor actions, our task (and other object-manipulation tasks) essentially
          consists of two phases: a lifting phase and a 2-s holding phase. While the
          maximum error occurred during lifting, subjects had to consistently exert
          the compensatory torque corresponding to the T target during the holding
          phase to maintain zero-object roll based on real-time sensory feedback.
          It has been demonstrated that grip-force bias can be generated with only
          squeezing objects without lifting them [25]. Therefore, we use the T target
          to build up the use-dependent memory. The same study also demonstrated
          that such use-dependent memory was possibly independent from the con-
          text of the task. Therefore, we assume use-dependent memory to be the
          source of context-independent bias that generates the interference in our
          experimental paradigm [31].
             Most importantly, we think that the bias caused by use-dependent mem-
          ory is also a function of the internal state x that is actively engaged for the
          context in the current trial. Specifically, this bias is small when the direction
          of the use-dependent memory of the torque is the same as the engaged
          internal state, whereas the bias is large when they have opposite directions.
   35   36   37   38   39   40   41   42   43   44   45