Page 20 -
P. 20

Preface  xviii


                               • We have written a comprehensive treatment of the modern features, par-
                                 ticularly HOG and SIFT (both in Chapter 5), that drive applications ranging
                                 from building image mosaics to object recognition.
                               • We give a detailed treatment of modern image editing techniques,in-
                                 cluding removing shadows (Section 3.5), filling holes in images (Section 6.3),
                                 noise removal (Section 6.4), and interactive image segmentation (Section 9.2).
                               • We give a comprehensive treatment of modern object recognition tech-
                                 niques. We start with a practical discussion of classifiers (Chapter 15); we
                                 then describe standard methods for image classification techniques (Chapter
                                 16), and object detection (Chapter 17). Finally, Chapter 18 reviews a wide
                                 range of recent topics in object recognition.

                               • Finally, this book has a very detailed index, and a bibliography that is as
                                 comprehensive and up-to-date as we could make it.

                            WHY STUDY VISION?

                            Computer vision’s great trick is extracting descriptions of the world from pictures
                            or sequences of pictures. This is unequivocally useful. Taking pictures is usually
                            nondestructive and sometimes discreet. It is also easy and (now) cheap. The de-
                            scriptions that users seek can differ widely between applications. For example, a
                            technique known as structure from motion makes it possible to extract a representa-
                            tion of what is depicted and how the camera moved from a series of pictures. People
                            in the entertainment industry use these techniques to build three-dimensional (3D)
                            computer models of buildings, typically keeping the structure and throwing away
                            the motion. These models are used where real buildings cannot be; they are set fire
                            to, blown up, etc. Good, simple, accurate, and convincing models can be built from
                            quite small sets of photographs. People who wish to control mobile robots usually
                            keep the motion and throw away the structure. This is because they generally know
                            something about the area where the robot is working, but usually don’t know the
                            precise robot location in that area. They can determine it from information about
                            how a camera bolted to the robot is moving.
                                 There are a number of other, important applications of computer vision. One
                            is in medical imaging: one builds software systems that can enhance imagery, or
                            identify important phenomena or events, or visualize information obtained by imag-
                            ing. Another is in inspection: one takes pictures of objects to determine whether
                            they are within specification. A third is in interpreting satellite images, both for
                            military purposes (a program might be required to determine what militarily inter-
                            esting phenomena have occurred in a given region recently; or what damage was
                            caused by a bombing) and for civilian purposes (what will this year’s maize crop
                            be? How much rainforest is left?) A fourth is in organizing and structuring collec-
                            tions of pictures. We know how to search and browse text libraries (though this is
                            a subject that still has difficult open questions) but don’t really know what to do
                            with image or video libraries.
                                 Computer vision is at an extraordinary point in its development. The subject
                            itself has been around since the 1960s, but only recently has it been possible to
                            build useful computer systems using ideas from computer vision. This flourishing
   15   16   17   18   19   20   21   22   23   24   25