Page 255 - Mechatronics for Safety, Security and Dependability in a New Era

P. 255

Ch48-I044963.fm Page 239 Tuesday, August 1, 2006 4:04 PM
Tuesday, August
Page 239
1, 2006
4:04 PM
Ch48-I044963.fm
239
239

Temporal filter Fm
(a) Feature extractors (b) Tasks in which the robot learned feature extractors

Figure 2: Feature extractors which are generated in the past tasks

EXPERIMENT
Experimental Setting

We used a small mobile robot which is about 40 cm high and has a camera with a fixed orientation to
look ahead at the floor. The task is to move along a given path to a destination. The size of I o and I f
in pixels is 64 x 54 and that of I c is 8 x 6. We defined the dimension of a substate as m = 1. The
robot can move at a translational speed v and a steering speed a> independently. To reduce the
computation cost, we discretized the state and action space and calculated the probabilities. We set the
histoiy length to h = 10 . The thresholds are set to H lh = 0.4 and C lh = 0.8h.

The robot was given three feature extractors shown in Figure 2. Fs,Fc, and Fm are generated in
tasks A, B, and C, respectively. The feature extractors have the following characteristics.
• 3 x 3 spatial filter Fs: This type of filter calculates sum of weighted brightness values of nine
neighbouring pixels. The generated filter emphasizes and inhibits horizontal edge.
• Color filter Fc : This type of filter calculates sum of weighted red, green, and blue. The generated
filter inhibits red.
• Spatial filter Fm : This type of filter calculates sum of weighted past five images. The generated
filter emphasizes the current image and inhibits the past image.

Feature Extractor Selection
The task given to the robot is shown in Figure 3 (a). The robot moves to the front of the door and waits
for it to open. It moves to the destination after the door opens. The environment is the same as that of
tasks A, B, and C. We gave three episodes of successful instances (L = 234,254,233). After learning,
the robot divided all instances into 13 subsets using ISODATA algorithm.

Figure 3(b) shows the learned behavior and Figure 3(c) shows the selected feature extractors at each
time step. The selected feature extractors differ depending on the situation. The average number of
selected feature extractors per step is 1.57. Figure 3(d) shows the selected subsets of instances at each
step. When the robot could not choose an action from the selected subsets because of low reliability, it
used all instances to decide again. 9) in the figure shows the step when the robot could choose an
action from the selected subsets. It is verified that the robot accomplishes the task while selecting

250 251 252 253 254 255 256 257 258 259 260