Page 19 -
P. 19

Preface






                                 Computer vision as a field is an intellectual frontier. Like any frontier, it is
                            exciting and disorganized, and there is often no reliable authority to appeal to.
                            Many useful ideas have no theoretical grounding, and some theories are useless
                            in practice; developed areas are widely scattered, and often one looks completely
                            inaccessible from the other. Nevertheless, we have attempted in this book to present
                            a fairly orderly picture of the field.
                                 We see computer vision—or just “vision”; apologies to those who study human
                            or animal vision—as an enterprise that uses statistical methods to disentangle data
                            using models constructed with the aid of geometry, physics, and learning theory.
                            Thus, in our view, vision relies on a solid understanding of cameras and of the
                            physical process of image formation (Part I of this book) to obtain simple inferences
                            from individual pixel values (Part II), combine the information available in multiple
                            images into a coherent whole (Part III), impose some order on groups of pixels to
                            separate them from each other or infer shape information (Part IV), and recognize
                            objects using geometric information or probabilistic techniques (Part V). Computer
                            vision has a wide variety of applications, both old (e.g., mobile robot navigation,
                            industrial inspection, and military intelligence) and new (e.g., human computer
                            interaction, image retrieval in digital libraries, medical image analysis, and the
                            realistic rendering of synthetic scenes in computer graphics). We discuss some of
                            these applications in part VII.

                            IN THE SECOND EDITION
                            We have made a variety of changes since the first edition, which we hope have
                            improved the usefulness of this book. Perhaps the most important change follows
                            a big change in the discipline since the last edition. Code and data are now widely
                            published over the Internet. It is now quite usual to build systems out of other
                            people’s published code, at least in the first instance, and to evaluate them on
                            other people’s datasets. In the chapters, we have provided guides to experimental
                            resources available online. As is the nature of the Internet, not all of these URL’s
                            will work all the time; we have tried to give enough information so that searching
                            Google with the authors’ names or the name of the dataset or codes will get the
                            right result.
                                 Other changes include:

                               • We have simplified. We give a simpler, clearer treatment of mathematical
                                 topics. We have particularly simplified our treatment of cameras (Chapter
                                 1), shading (Chapter 2), and reconstruction from two views (Chapter 7) and
                                 from multiple views (Chapter 8)
                               • We describe a broad range of applications, including image-based mod-
                                 elling and rendering (Chapter 19), image search (Chapter 22), building image
                                 mosaics (Section 12.1), medical image registration (Section 12.3), interpreting
                                 range data (Chapter 14), and understanding human activity (Chapter 21).

                                                                                                 xvii
   14   15   16   17   18   19   20   21   22   23   24