Page 172 - Socially Intelligent Agents Creating Relationships with Computers and Robots
P. 172

Designing Sociable Machines                                      155

                              quality of the interaction improves. Furthermore, many of these social cues
                              will eventually be offered in the context of teaching the robot. To be able to
                              take advantage of this scaffolding, the robot must be able to correctly interpret
                              and react to these social cues. There are two cases where the robot can read
                              the human’s social cues.
                                The first is the ability to recognize praise, prohibition, soothing, and atten-
                              tional bids from robot-directed speech [9, 2]. This could serve as an important
                              teaching cue for reinforcing and shaping the robot’s behavior. Several inter-
                              esting interactions have been witnessed between Kismet and human subjects
                              when Kismet recognizes and expressively responds to their tone of voice. They
                              use Kismet’s facial expression and body posture to determine when Kismet
                              “understood” their intent. The video of these interactions suggests evidence of
                              affective feedback where the subject might issue an intent (say, an attentional
                              bid), the robot responds expressively (perking its ears, leaning forward, and
                              rounding its lips), and then the subject immediately responds in kind (perhaps
                              by saying, “Oh!” or, “Ah!”). Several subjects appeared to empathize with
                              the robot after issuing a prohibition—often reporting feeling guilty or bad for
                              scolding the robot and making it “sad.”
                                The second is the ability of humans to direct Kismet’s attention using natural
                              cues [1]. This could play an important role in socially situated learning by
                              giving the caregiver a way of showing Kismet what is important for the task,
                              and for establishing a shared reference. We have found that it is important for
                              the robot’s attention system to be tuned to the attention system of humans. It is
                              important that both human and robot find the same types of stimuli salient in
                              similar conditions. Kismet has a set of perceptual biases based on the human
                              pre-attentive visual system. In this way, both robot and humans are more likely
                              to find the same sorts of things interesting or attention-grabbing. As a result,
                              people can very naturally and quickly direct the robot’s attention by bringing
                              the target close and in front of the robot’s face, shaking the object of interest, or
                              moving it slowly across the centerline of the robot’s face. Each of these cues
                              increases the saliency of a stimulus by making it appear larger in the visual
                              field, or by supplementing the color or skin-tone cue with motion. Kismet’s
                              attention system coupled with gaze direction provides people with a powerful
                              and intuitive social cue for when they have succeeded in steering the robot’s
                              interest.

                              3.     Summary

                                In this chapter, we have outlined a set of four core design issues that have
                              guided our work in building Kismet. When engaging another socially, humans
                              bring a complex set of well-established social machinery to the interaction.
                              Our aim is not a matter of re-engineering the human side of the equation to suit
   167   168   169   170   171   172   173   174   175   176   177