Page 133 -

P. 133

112 3 Image processing

1998; Weickert 1998). It has also been shown to be closely related to other adaptive smooth-
ing techniques (Saint-Marc, Chen, and Medioni 1991; Barash 2002; Barash and Comaniciu
2004) as well as Bayesian regularization with a non-linear smoothness term that can be de-
rived from image statistics (Scharr, Black, and Haussecker 2003).
In its general form, the range kernel r(i, j, k, l)= r( f(i, j)−f(k, l) ), which is usually
called the gain or edge-stopping function, or diffusion coefﬁcient, can be any monotonically

increasing function with r (x) → 0 as x →∞. Black, Sapiro, Marimont et al. (1998) show
how anisotropic diffusion is equivalent to minimizing a robust penalty function on the image
gradients, which we discuss in Sections 3.7.1 and 3.7.2). Scharr, Black, and Haussecker
(2003) show how the edge-stopping function can be derived in a principled manner from
local image statistics. They also extend the diffusion neighborhood from N 4 to N 8 , which
allows them to create a diffusion operator that is both rotationally invariant and incorporates
information about the eigenvalues of the local structure tensor.
Note that, without a bias term towards the original image, anisotropic diffusion and itera-
tive adaptive smoothing converge to a constant image. Unless a small number of iterations is
used (e.g., for speed), it is usually preferable to formulate the smoothing problem as a joint
minimization of a smoothness term and a data ﬁdelity term, as discussed in Sections 3.7.1
and 3.7.2 and by Scharr, Black, and Haussecker (2003), which introduce such a bias in a
principled manner.

3.3.2 Morphology

While non-linear ﬁlters are often used to enhance grayscale and color images, they are also
used extensively to process binary images. Such images often occur after a thresholding
operation,

1if f ≥ t,
θ(f, t)= (3.41)
0 else,
e.g., converting a scanned grayscale document into a binary image for further processing such
as optical character recognition.
The most common binary image operations are called morphological operations, since
they change the shape of the underlying binary objects (Ritter and Wilson 2000, Chapter 7).
To perform such an operation, we ﬁrst convolve the binary image with a binary structuring
element and then select a binary output value depending on the thresholded result of the
convolution. (This is not the usual way in which these operations are described, but I ﬁnd it
a nice simple way to unify the processes.) The structuring element can be any shape, from
a simple 3 × 3 box ﬁlter, to more complicated disc structures. It can even correspond to a
particular shape that is being sought for in the image.
Figure 3.21 shows a close-up of the convolution of a binary image f with a 3 × 3 struc-
turing element s and the resulting images for the operations described below. Let
c = f ⊗ s (3.42)

be the integer-valued count of the number of 1s inside each structuring element as it is scanned
over the image and S be the size of the structuring element (number of pixels). The standard
operations used in binary morphology include:

• dilation: dilate(f, s)= θ(c, 1);

128 129 130 131 132 133 134 135 136 137 138