Page 131 - Dynamic Vision for Perception and Control of Motion
P. 131

4.1  Structuring of Application Domains      115


            bases and algorithms for recognizing these items; they have to be swapped when
            entering a zone with new regulations.
              In section Appendix A.3 the different types of vehicles are listed. They have to
            be recognized and treated according to their form (shape), appearance and function
            of the vehicle (Appendix A.4). This type of structuring may not seem systematic at
            first glance. There is, of course, one column like A.4 for each type of vehicle under
            A.3. Since this book concentrates on the most common wheeled vehicles (cars and
            trucks), only these types are discussed in more detail here. Geometric size and 3-D
            shape (Appendix A.4.1) have been treated to some extent in Section 2.2.3 and will
            be revisited for recognition in Chapters 7 to 10.
              Subpart hierarchies (Appendix A.4.2) are  only partially  needed  for vehicles
            driving, but  when  standing, open doors  and hoods may yield  quite different ap-
            pearances of the same vehicle. The property of glass with respect to mirroring of
            light rays has a fundamental effect on features detected in these regions. Driving
            through an environment with tall buildings and trees at the side or with branches
            partially over the road may lead to strongly varying features on the glass surfaces
            of the  vehicle, which have nothing to  do  with the vehicle itself. These regions
            should, therefore, be discarded  for vehicle recognition,  in general. On the other
            hand, with low light levels in the environment, the glass surfaces of the lighting
            elements on the front and rear of the vehicle (or even highlights on windscreens)
            may be the only parts discernible well and moving in conjunction; under these en-
            vironmental conditions, these groups are sufficient indication for assuming a vehi-
            cle at the location observed.
              Variability of image shape over time depending on the 3-D aspect conditions of
            the 3-D object “vehicle” (Appendix A.3) is important knowledge for recognizing
            and tracking vehicles. When machine vision was started in the second half of the
            last century, some researchers called the appearance or disappearance of features
            due to self-occlusion a “catastrophic event” because the structure of their (insuffi-
            cient) algorithm with fixed  feature arrangements changed.  In the 4-D  approach
            where objects and aspect conditions are represented as in reality and where tempo-
            ral changes also are systematically represented by motion models, there is nothing
            exciting with the appearance of new or disappearance of previously stable features.
            It has been found rather early that whenever the aspect conditions bring two fea-
            tures close to each other so that they may be confused (wrong feature correspon-
            dence), it is better to discard these features altogether and to try to find unambigu-
            ous ones  [Wünsche 1987]. The recursive estimation process to be  discussed in
            Chapter 6 will be perturbed by wrong feature correspondence to a larger extent
            than by using slightly less  well-suited, but unambiguous features.  Grouping re-
            gimes of aspect conditions with the same highly recognizable set of features into
            classes is important knowledge for hypothesis generation and tracking of objects.
            When  detecting  new feature sets in a task domain, it  may be  necessary to start
            more than one object hypothesis for fast recognition of the object observed. Such
            4-D object hypotheses allow predicting other features which should be easily visi-
            ble; in case they cannot be found in the next few images, the hypothesis can be dis-
            carded immediately. An early jump to several 4-D hypotheses thus has advantages
            over too many feature combinations before daring an object hypothesis (known as
            a combinatorial explosion in the vision literature).
   126   127   128   129   130   131   132   133   134   135   136