Page 258 -
P. 258

5.1 Active contours                                                                    237


               Image segmentation is the task of finding groups of pixels that “go together”. In statistics, this
               problem is known as cluster analysis and is a widely studied area with hundreds of different
               algorithms (Jain and Dubes 1988; Kaufman and Rousseeuw 1990; Jain, Duin, and Mao 2000;
               Jain, Topchy, Law et al. 2004).
                  In computer vision, image segmentation is one of the oldest and most widely studied prob-
               lems (Brice and Fennema 1970; Pavlidis 1977; Riseman and Arbib 1977; Ohlander, Price,
               and Reddy 1978; Rosenfeld and Davis 1979; Haralick and Shapiro 1985). Early techniques
               tend to use region splitting or merging (Brice and Fennema 1970; Horowitz and Pavlidis 1976;
               Ohlander, Price, and Reddy 1978; Pavlidis and Liow 1990), which correspond to divisive and
               agglomerative algorithms in the clustering literature (Jain, Topchy, Law et al. 2004). More
               recent algorithms often optimize some global criterion, such as intra-region consistency and
               inter-region boundary lengths or dissimilarity (Leclerc 1989; Mumford and Shah 1989; Shi
               and Malik 2000; Comaniciu and Meer 2002; Felzenszwalb and Huttenlocher 2004b; Cremers,
               Rousson, and Deriche 2007).
                  We have already seen examples of image segmentation in Sections 3.3.2 and 3.7.2.In
               this chapter, we review some additional techniques that have been developed for image seg-
               mentation. These include algorithms based on active contours (Section 5.1) and level sets
               (Section 5.1.4), region splitting and merging (Section 5.2), mean shift (mode finding) (Sec-
               tion 5.3), normalized cuts (splitting based on pixel similarity metrics) (Section 5.4), and bi-
               nary Markov random fields solved using graph cuts (Section 5.5). Figure 5.1 shows some
               examples of these techniques applied to different images.
                  Since the literature on image segmentation is so vast, a good way to get a handle on some
               of the better performing algorithms is to look at experimental comparisons on human-labeled
               databases (Arbel´ aez, Maire, Fowlkes et al. 2010). The best known of these is the Berkeley
                                               1
               Segmentation Dataset and Benchmark (Martin, Fowlkes, Tal et al. 2001), which consists
               of 1000 images from a Corel image dataset that were hand-labeled by 30 human subjects.
               Many of the more recent image segmentation algorithms report comparative results on this
               database. For example, Unnikrishnan, Pantofaru, and Hebert (2007) propose new metrics
               for comparing such algorithms. Estrada and Jepson (2009) compare four well-known seg-
               mentation algorithms on the Berkeley data set and conclude that while their own SE-MinCut
               algorithm (Estrada, Jepson, and Chennubhotla 2004) algorithm outperforms the others by a
               small margin, there still exists a wide gap between automated and human segmentation per-
                       2
               formance. A new database of foreground and background segmentations, used by Alpert,
               Galun, Basri et al. (2007), is also available. 3


               5.1 Active contours


               While lines, vanishing points, and rectangles are commonplace in the man-made world,
               curves corresponding to object boundaries are even more common, especially in the natural
               environment. In this section, we describe three related approaches to locating such boundary
               curves in images.

                  1  http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/segbench/
                  2  An interesting observation about their ROC plots is that automated techniques cluster tightly along similar
               curves, but human performance is all over the map.
                  3  http://www.wisdom.weizmann.ac.il/ ∼ vision/Seg Evaluation DB/index.html
   253   254   255   256   257   258   259   260   261   262   263