Page 133 -
P. 133

112                                                                       3 Image processing


                                1998; Weickert 1998). It has also been shown to be closely related to other adaptive smooth-
                                ing techniques (Saint-Marc, Chen, and Medioni 1991; Barash 2002; Barash and Comaniciu
                                2004) as well as Bayesian regularization with a non-linear smoothness term that can be de-
                                rived from image statistics (Scharr, Black, and Haussecker 2003).
                                   In its general form, the range kernel r(i, j, k, l)= r( f(i, j)−f(k, l) ), which is usually
                                called the gain or edge-stopping function, or diffusion coefficient, can be any monotonically

                                increasing function with r (x) → 0 as x →∞. Black, Sapiro, Marimont et al. (1998) show
                                how anisotropic diffusion is equivalent to minimizing a robust penalty function on the image
                                gradients, which we discuss in Sections 3.7.1 and 3.7.2). Scharr, Black, and Haussecker
                                (2003) show how the edge-stopping function can be derived in a principled manner from
                                local image statistics. They also extend the diffusion neighborhood from N 4 to N 8 , which
                                allows them to create a diffusion operator that is both rotationally invariant and incorporates
                                information about the eigenvalues of the local structure tensor.
                                   Note that, without a bias term towards the original image, anisotropic diffusion and itera-
                                tive adaptive smoothing converge to a constant image. Unless a small number of iterations is
                                used (e.g., for speed), it is usually preferable to formulate the smoothing problem as a joint
                                minimization of a smoothness term and a data fidelity term, as discussed in Sections 3.7.1
                                and 3.7.2 and by Scharr, Black, and Haussecker (2003), which introduce such a bias in a
                                principled manner.

                                3.3.2 Morphology

                                While non-linear filters are often used to enhance grayscale and color images, they are also
                                used extensively to process binary images. Such images often occur after a thresholding
                                operation,

                                                                    1if f ≥ t,
                                                          θ(f, t)=                                   (3.41)
                                                                    0  else,
                                e.g., converting a scanned grayscale document into a binary image for further processing such
                                as optical character recognition.
                                   The most common binary image operations are called morphological operations, since
                                they change the shape of the underlying binary objects (Ritter and Wilson 2000, Chapter 7).
                                To perform such an operation, we first convolve the binary image with a binary structuring
                                element and then select a binary output value depending on the thresholded result of the
                                convolution. (This is not the usual way in which these operations are described, but I find it
                                a nice simple way to unify the processes.) The structuring element can be any shape, from
                                a simple 3 × 3 box filter, to more complicated disc structures. It can even correspond to a
                                particular shape that is being sought for in the image.
                                   Figure 3.21 shows a close-up of the convolution of a binary image f with a 3 × 3 struc-
                                turing element s and the resulting images for the operations described below. Let
                                                                c = f ⊗ s                            (3.42)

                                be the integer-valued count of the number of 1s inside each structuring element as it is scanned
                                over the image and S be the size of the structuring element (number of pixels). The standard
                                operations used in binary morphology include:

                                   • dilation: dilate(f, s)= θ(c, 1);
   128   129   130   131   132   133   134   135   136   137   138