Page 264 -

P. 264

5.1 Active contours 243

form
1 T 2
E shape = b diag(λ 0 ...λ M−1 ) b = b /2λ m . (5.17)
m
2
m
Alternatively, the range of allowable b m values can be limited to some range, e.g., |b m |≤
√
3 λ m (Cootes, Cooper, Taylor et al. 1995). Alternative approaches for deriving a set of
shape vectors are reviewed by Isard and Blake (1998).
√ √
Varying the individual shape parameters b m over the range −2 λ m ≤ 2 λ m can give
a good indication of the expected variation in appearance, as shown in Figure 5.4d. Another
example, this time related to face contours, is shown in Figure 5.5a.
In order to align a point distribution model with an image, each control point searches
in a direction normal to the contour to ﬁnd the most likely corresponding image edge point
(Figure 5.5b). These individual measurements can be combined with priors on the shape
parameters (and, if desired, position, scale, and orientation parameters) to estimate a new set
of parameters. The resulting Active Shape Model (ASM) can be iteratively minimized to ﬁt
images to non-rigidly deforming objects such as medical images or body parts such as hands
(Cootes, Cooper, Taylor et al. 1995). The ASM can also be combined with a PCA analysis of
the underlying gray-level distribution to create an Active Appearance Model (AAM) (Cootes,
Edwards, and Taylor 2001), which we discuss in more detail in Section 14.2.2.

5.1.2 Dynamic snakes and CONDENSATION

In many applications of active contours, the object of interest is being tracked from frame
to frame as it deforms and evolves. In this case, it makes sense to use estimates from the
previous frame to predict and constrain the new estimates.
One way to do this is to use Kalman ﬁltering, which results in a formulation called Kalman
snakes (Terzopoulos and Szeliski 1992; Blake, Curwen, and Zisserman 1993). The Kalman
ﬁlter is based on a linear dynamic model of shape parameter evolution,
x t = Ax t−1 + w t , (5.18)

where x t and x t−1 are the current and previous state variables, A is the linear transition
matrix, and w is a noise (perturbation) vector, which is often modeled as a Gaussian (Gelb
1974). The matrices A and the noise covariance can be learned ahead of time by observing
typical sequences of the object being tracked (Blake and Isard 1998).
The qualitative behavior of the Kalman ﬁlter can be seen in Figure 5.6a. The linear dy-
namic model causes a deterministic change (drift) in the previous estimate, while the process
noise (perturbation) causes a stochastic diffusion that increases the system entropy (lack of
certainty). New measurements from the current frame restore some of the certainty (peaked-
ness) in the updated estimate.
In many situations, however, such as when tracking in clutter, a better estimate for the
contour can be obtained if we remove the assumptions that the distribution are Gaussian,
which is what the Kalman ﬁlter requires. In this case, a general multi-modal distribution is
propagated, as shown in Figure 5.6b. In order to model such multi-modal distributions, Isard
and Blake (1998) introduced the use of particle ﬁltering to the computer vision community. 5

5 Alternatives to modeling multi-modal distributions include mixtures of Gaussians (Bishop 2006) and multiple
hypothesis tracking (Bar-Shalom and Fortmann 1988; Cham and Rehg 1999).

259 260 261 262 263 264 265 266 267 268 269