Page 48 - Human Inspired Dexterity in Robotic Manipulation
P. 48

44    Human Inspired Dexterity in Robotic Manipulation


          process quickly generates a short-term “bias” through the performance of
          manipulation within only a few trials (one to three) that competes with
          the internal representation.
             We first rule out models that suggest that memory of manipulation in the
          first context is fragile and learning of the second context would overwrite
          the previously learned manipulation in the first context (i.e., unlearned).
          Such unlearning of the first context during learning of the second context
          has been proposed to be due to either a single learning process [13],orto
          be insensitive to contextual cues [5,34]. Specifically, these models predict
          that a break given after the AB paradigm would not reduce the interference
          on the recall of context A. We then considered the dual-rate multiple
          context model [14], which supports the protection of a learned context
          by assuming a context-independent fast process and a context-dependent
          slow process. This model can successfully explain anterograde interference
          as the fast process cannot be switched to the opposite direction on the first
          trial following a context switch, therefore competing with the slow process
          which is sensitive to contextual cues. However, we found that it is challeng-
          ing for the DRMC model to account for our data from both the IF and RT
          groups, although the DRMC model could reasonably predict the behavior
          of Ctrl and Rndm groups. Specifically, the output of the DRMC model
          results from the linear summation of the slow and fast processes, therefore
          the decay of the fast component would cause a decrease of the interference
          for the IF group, as well as a decrease of retention for the RT groups in a
          similar fashion. This contradicts our data showing that the longer break
          duration only significantly reduced the interference, but not the retention.
          In fact, other models, such as the use-dependent learning model from [16],
          that use linear combination of multiple sensorimotor processes when gen-
          erating total motor output would have come to the same conclusion.
             To better account for our data, here we propose a new model that
          combines an error-based learning process and a use-dependent memory
          (Eqs. 3.1–3.7) with six free parameters (A, B, C, D, E, F), which were deter-
          mined using nonlinear optimization with data from the Rndm group, Ctrl
          group, as well as the IF groups. By implementing the boot-strap technique,
          we estimated parameters with 95% confidence intervals as follows:
          A ¼ 0.9771 (0.9321, 0.9891), B ¼ 0.2613 (0.1848, 0.4435), C ¼ 0.4865
          (0.2746, 0.7212), D ¼ 0.5513 (0.4619, 0.6410), E ¼ 0.0057 (0.0031,
          0.0097), and F ¼ 13.37 (7.21, 28.19).
             The new model also correctly predicted the data from the RT and TF
          groups which were not used in the parameter search. Specifically, the new
   43   44   45   46   47   48   49   50   51   52   53