Page 148 - Introduction to Autonomous Mobile Robots
P. 148

Perception

                                  x l   y l            y r                                   133
                                 r ---- +  r ---- +  r 23   z' +  r 02  =  ----z' r        (4.33)
                                                 l
                                       22
                                  21
                                         f
                                   f
                                                         f
                                  x l   y l  
                                 r ---- +  r ---- +  r 33   z' +  r 03  =  z' r            (4.34)
                                  31
                                       32
                                                 l
                                   f
                                         f
                                                                            y'
                                                                      x'
                             The same process can be used to identify values for   and  , yielding complete infor-
                           mation about the position of point  . However, using the above equations requires us to
                                                       P
                           have identified conjugate pairs in the left and right camera images: image points that orig-
                                                   P
                           inate at the same object point   in the scene. This fundamental challenge, identifying the
                           conjugate pairs and thereby recovering disparity, is the correspondence problem. Intu-
                           itively, the problem is, given two images of the same scene from different perspectives,
                           how can we identify the same object points in both images? For every such identified object
                           point, we will be able to recover its 3D position in the scene.
                             The correspondence problem, or the problem of matching the same object in two differ-
                           ent inputs, has been one of the most challenging problems in the computer vision field and
                           the artificial intelligence fields. The basic approach in nearly all proposed solutions
                           involves converting each image in order to create more stable and more information-rich
                           data. With more reliable data in hand, stereo algorithms search for the best conjugate pairs
                           representing as many of the images’ pixels as possible.
                             The search process is well understood, but the quality of the resulting depth maps
                           depends heavily upon the way in which images are treated to reduce noise and improve sta-
                           bility. This has been the chief technology driver in stereo vision algorithms, and one par-
                           ticular method has become widely used in commercially available systems.

                           The zero crossings of Laplacian of Gaussian (ZLoG). ZLoG is a strategy for identify-
                           ing features in the left and right camera images that are stable and will match well, yielding
                           high-quality stereo depth recovery. This approach has seen tremendous success in the field
                           of stereo vision, having been implemented commercially in both software and hardware
                           with good results. It has led to several commercial stereo vision systems and yet it is
                           extremely simple. Here we summarize the approach and explain some of its advantages.
                             The core of ZLoG is the Laplacian transformation of an image. Intuitively, this is noth-
                           ing more than the second derivative. Formally, the Laplacian  Lx y,(  )   of an image with
                           intensities Ix y,(  )   is defined as

                                          2    2
                                 (
                                   ,
                                Lxy) =  ∂ I  +  ∂ I                                          (4.35)
                                         x ∂  2  ∂ y 2
   143   144   145   146   147   148   149   150   151   152   153