Page 256 - Innovations in Intelligent Machines

P. 256

Toward Robot Perception through Omnidirectional Vision 249

D ij T ij
i,j
d (D, T)= (19)
T ij
i,j
where T is a template edge-image and D is the distance transform of the edges
of the current run-time image. As weaker edges (small gradient magnitudes)
are more susceptible to noise, we set T ij to the gradient magnitudes of the
template images, instead of binary edges. Hence, we give more weight to the
strongest edges.
Equation 19 says that the chamfer distance is an average of the distances
between corresponding points of the shapes. In a strict sense, it is an approx-
imation as the underlying chamfer distance transform is itself an approxima-
tion to the Euclidean distance. In practice this diﬀerence is not relevant as
typically the shapes to compare are at similar poses and the distances between
the points are small enough to make negligible the diﬀerence of the chamfer
and the Euclidean distances.
In the topological localisation application, we want to ﬁnd the database
image corresponding to the current run-time image. In order to ﬁnd the best
matching we search the database using the chamfer distance as the comparison
measure. The comparison of images is done from an edge-image to a distance
transformed edge-image. The distance transformation may be applied either to
the run time or to the database images [36]. We apply the distance transform
to the run time edge-images and leave to the template edge-images the role
of selecting the relevant edge locations.
The distance as deﬁned by Eq. (19) is zero for perfectly matching images.
Therefore we search for the image matching the current image I m in a set
T 1 ...T n by minimizing Eq. (19),
ˆ n =arg min d (D(I m ),T n ) . (20)
n
Notice that, unlike recognition applications such as pedestrian and sign detec-
tion in an image [36], in the localisation application the template and run-time
images have equal sizes. The search parameter is an image index instead of
translation, rotation and scaling. The range of the index is the size of the
database.
Usually there are a large number of database images, and thus under-
taking localisation, as in Eq. (20), is computationally expensive. However it
only needs to be performed once, when the robot is dropped-in-scene. During
normal operation there is a causality constraint along the consecutive loca-
tions. We reduce the search range to a window around the last location, typ-
ically ±5 images.

Eigenspace Approximation to the Hausdorﬀ Fraction

The Hausdorﬀ distance [73] (of which the Hausdorﬀ fraction is a subset) is a
technique whereby one can measure the distance between two sets of points,

251 252 253 254 255 256 257 258 259 260 261