Page 157 -
P. 157

136                                                                       3 Image processing





                  coarse                              l = 4     coarse                             l = 2
                                                      l = 3
                  medium                              l = 2     medium                             l = 1


                                                      l = 1


                  fine                                l = 0     fine                               l = 0



                                     (a)                                           (b)

                Figure 3.36 Multiresolution pyramids: (a) pyramid with half-octave (quincunx) sampling (odd levels are colored
                gray for clarity). (b) wavelet pyramid—each wavelet level stores 3/4 of the original pixels (usually the horizontal,
                vertical, and mixed gradients), so that the total number of wavelet coefficients and original pixels is the same.



                                cent levels, the authors claim that coarse-to-fine algorithms perform better. In the image-
                                processing community, half-octave pyramids combined with checkerboard sampling grids
                                are known as quincunx sampling (Feilner, Van De Ville, and Unser 2005). In detecting multi-
                                scale features (Section 4.1.1), it is often common to use half-octave or even quarter-octave
                                pyramids (Lowe 2004; Triggs 2004). However, in this case, the subsampling only occurs
                                at every octave level, i.e., the image is repeatedly blurred with wider Gaussians until a full
                                octave of resolution change has been achieved (Figure 4.11).


                                3.5.4 Wavelets

                                While pyramids are used extensively in computer vision applications, some people use wavelet
                                decompositions as an alternative. Wavelets are filters that localize a signal in both space
                                and frequency (like the Gabor filter in Table 3.2) and are defined over a hierarchy of scales.
                                Wavelets provide a smooth way to decompose a signal into frequency components without
                                blocking and are closely related to pyramids.
                                   Wavelets were originally developed in the applied math and signal processing communi-
                                ties and were introduced to the computer vision community by Mallat (1989). Strang (1989);
                                Simoncelli and Adelson (1990b); Rioul and Vetterli (1991); Chui (1992); Meyer (1993) all
                                provide nice introductions to the subject along with historical reviews, while Chui (1992) pro-
                                vides a more comprehensive review and survey of applications. Sweldens (1997) describes
                                the more recent lifting approach to wavelets that we discuss shortly.
                                   Wavelets are widely used in the computer graphics community to perform multi-resolution
                                geometric processing (Stollnitz, DeRose, and Salesin 1996) and have also been used in com-
                                puter vision for similar applications (Szeliski 1990b; Pentland 1994; Gortler and Cohen 1995;
                                Yaou and Chang 1994; Lai and Vemuri 1997; Szeliski 2006b), as well as for multi-scale ori-
                                ented filtering (Simoncelli, Freeman, Adelson et al. 1992) and denoising (Portilla, Strela,
                                Wainwright et al. 2003).
   152   153   154   155   156   157   158   159   160   161   162