Page 256 - Innovations in Intelligent Machines
P. 256

Toward Robot Perception through Omnidirectional Vision  249

                                                                D ij T ij
                                                              i,j
                                                    d (D, T)=                              (19)
                                                                 T ij
                                                               i,j
                           where T is a template edge-image and D is the distance transform of the edges
                           of the current run-time image. As weaker edges (small gradient magnitudes)
                           are more susceptible to noise, we set T ij to the gradient magnitudes of the
                           template images, instead of binary edges. Hence, we give more weight to the
                           strongest edges.
                              Equation 19 says that the chamfer distance is an average of the distances
                           between corresponding points of the shapes. In a strict sense, it is an approx-
                           imation as the underlying chamfer distance transform is itself an approxima-
                           tion to the Euclidean distance. In practice this difference is not relevant as
                           typically the shapes to compare are at similar poses and the distances between
                           the points are small enough to make negligible the difference of the chamfer
                           and the Euclidean distances.
                              In the topological localisation application, we want to find the database
                           image corresponding to the current run-time image. In order to find the best
                           matching we search the database using the chamfer distance as the comparison
                           measure. The comparison of images is done from an edge-image to a distance
                           transformed edge-image. The distance transformation may be applied either to
                           the run time or to the database images [36]. We apply the distance transform
                           to the run time edge-images and leave to the template edge-images the role
                           of selecting the relevant edge locations.
                              The distance as defined by Eq. (19) is zero for perfectly matching images.
                           Therefore we search for the image matching the current image I m in a set
                           T 1 ...T n by minimizing Eq. (19),
                                                 ˆ n =arg min d (D(I m ),T n ) .           (20)
                                                       n
                           Notice that, unlike recognition applications such as pedestrian and sign detec-
                           tion in an image [36], in the localisation application the template and run-time
                           images have equal sizes. The search parameter is an image index instead of
                           translation, rotation and scaling. The range of the index is the size of the
                           database.
                              Usually there are a large number of database images, and thus under-
                           taking localisation, as in Eq. (20), is computationally expensive. However it
                           only needs to be performed once, when the robot is dropped-in-scene. During
                           normal operation there is a causality constraint along the consecutive loca-
                           tions. We reduce the search range to a window around the last location, typ-
                           ically ±5 images.

                           Eigenspace Approximation to the Hausdorff Fraction

                           The Hausdorff distance [73] (of which the Hausdorff fraction is a subset) is a
                           technique whereby one can measure the distance between two sets of points,
   251   252   253   254   255   256   257   258   259   260   261