Page 403 -

P. 403

Section 12.1 Registering Rigid Objects 371

Transformation Frame-bearing groups
Three points, or
Rigid (Euclidean) one line and one point oﬀ the line, or
two intersecting lines
Three points, or
Rigid and scale one line and one point oﬀ the line, or
two intersecting lines and a point oﬀ their plane.
Aﬃne Four points, no two co-planar

TABLE 12.2: Some frame-bearing groups for estimating transformations from 3D to 3D.
Assume we have one such group in the source, another in the target, and a correspondence
between the items in the group; then, we can estimate the transformation uniquely.

FIGURE 12.1: On the left, frames from a video taken by an aircraft overﬂying an airport.
These frames are rectiﬁed to one another to form a mosaic on the right, which reveals (a)
the overall structure of what was seen and (b) the ﬂight path of the aircraft. This ﬁgure
was originally published as Figure 1 of “Video Indexing Based on Mosaic Representations,”
by M. Irani and P. Anandan, Proc. IEEE, v86 n5, 1998, c IEEE, 1998.

One application is building larger images. There are several other important appli-
cations. For example, imagine we have image frames taken by, say, an orthographic
camera attached to an aircraft; then, if we register the frames to one another, we
see not only the pictures taken by the aircraft in a form that exposes all that it
saw, but also a representation of the ﬂight path, and so of what it could have seen
(Figure 12.1). As another example, imagine we have a ﬁxed camera, that collects
video. By registering the frames with one another, we can make estimates of (a)
the moving objects and (b) the background, and expose this information to viewers
in a novel way (Figure 12.2). As yet another example, we could build either a
cylindrical panorama, a set of pixel samples that mimic the image produced by a
cylindrical camera, or even a spherical panorama, a set of pixel samples that mimic
the image produced by a spherical camera. One feature of these panoramas is that
it is easy to query them for a set of pixels that looks like a perspective image. In
particular, it is easy to use these panoramas to imitate what one would see if a
perspective camera were to rotate about its focal point.
Building mosaics is a useful application of registration. In the simplest case,
we wish to register two images to one another. We do so by ﬁnding tokens, deciding
which ones should match, and then choosing the transformation that minimizes

398 399 400 401 402 403 404 405 406 407 408