Page 166 - Robots Androids and Animatrons : 12 Incredible Projects You Can Build
P. 166

The operating distance one can speak from the microphone on the
                                       SRC without shouting is about 1 foot (ft). Because of this, when
                                       using  the  SRC  on  a  mobile  robot  platform,  we  incorporate  two
                                       small walkie-talkies. The output of one walkie-talkie is connected
                                       to the speech input of the SRC. The other walkie-talkie is used to
                                       speak  to  the  robot  via  the  SRC.  This  setup  eliminates  distance
                                       problems and extraneous noise.
                                       Speech  recognition  is  not  speech  understanding.  Just  because  a
                                       computer can respond to a vocal command does not mean it under-
                                       stands the command spoken. Future voice-recognition systems will
                                       have the ability to distinguish nuances and meaning of words, to
                                       “Do what I mean, not what I say!” However, those systems are still
                                       years away.

                               Speaker-dependent and speaker-independent speech recognition

                                       Speech recognition is classified into two categories, speaker de-
                                       pendent  and  speaker  independent.  Speaker-dependent  systems
                                       are trained by the individual who will be using the system. These
                                       systems are capable of achieving a high command count and bet-
                                       ter than 95 percent accuracy for word recognition. The drawback
                                       to this approach is that the system responds accurately only to the  145
                                       individual who trained the system. This is the most common ap-
                                       proach employed in software for personal computers.
                                       A speaker-independent system is trained to respond to a word
                                       regardless of who speaks. Therefore the system must be able to
                                       respond  to  a  large  variety  of  speech  patterns,  inflections,  and
                                       enunciations of the target word. The number of command words is
                                       usually less than for the speaker-dependent system; however, high
                                       accuracy can still be maintained within processing limits. Indus-
                                       trial  requirements  more  often  need  speaker-independent  voice
                                       systems.

                                       Our  SRC  will  be  speaker  dependent.  We  can  build  in  a  little
                                       speaker independency by allocating more than one word space to
                                       a target word and then programming different word enunciations
                                       in the allocated spaces. Each of these word spaces would trigger
                                       the same command.

                               Recognition style
                                       Speech-recognition systems have another constraint concerning
                                       the style of speech they can recognize. There are assumed to be
                                       three styles of speech: isolated, connected, and continuous.


                                                       Team LRN                     Speech-controlled mobile robot
   161   162   163   164   165   166   167   168   169   170   171