Page 309 -
P. 309

288                                                                6 Feature-based alignment













                             (a)                 (b)                     (c)                (d)
                Figure 6.6 The VideoMouse can sense six degrees of freedom relative to a specially printed mouse pad using
                its embedded camera (Hinckley, Sinclair, Hanson et al. 1999) c   1999 ACM: (a) top view of the mouse; (b) view
                of the mouse showing the curved base for rocking; (c) moving the mouse pad with the other hand extends the
                interaction capabilities; (d) the resulting movement seen on the screen.



                                outdoor locations, such as film sets, it is more common to place special markers such as
                                brightly colored balls in the scene to make it easier to find and track them (Bogart 1991). In
                                older applications, surveying techniques were used to determine the locations of these balls
                                before filming. Today, it is more common to apply structure-from-motion directly to the film
                                footage itself (Section 7.4.2).
                                   Rapid pose estimation is also central to tracking the position and orientation of the hand-
                                held remote controls used in Nintendo’s Wii game systems. A high-speed camera embedded
                                in the remote control is used to track the locations of the infrared (IR) LEDs in the bar that
                                is mounted on the TV monitor. Pose estimation is then used to infer the remote control’s
                                location and orientation at very high frame rates. The Wii system can be extended to a variety
                                of other user interaction applications by mounting the bar on a hand-held device, as described
                                by Johnny Lee. 11
                                   Exercises 6.4 and 6.5 have you implement two different tracking and pose estimation sys-
                                tems for augmented-reality applications. The first system tracks the outline of a rectangular
                                object, such as a book cover or magazine page, and the second has you track the pose of a
                                hand-held Rubik’s cube.



                                6.3 Geometric intrinsic calibration

                                As described above in Equations (6.42–6.43), the computation of the internal (intrinsic) cam-
                                era calibration parameters can occur simultaneously with the estimation of the (extrinsic)
                                pose of the camera with respect to a known calibration target. This, indeed, is the “classic”
                                approach to camera calibration used in both the photogrammetry (Slama 1980) and the com-
                                puter vision (Tsai 1987) communities. In this section, we look at alternative formulations
                                (which may not involve the full solution of a non-linear regression problem), the use of alter-
                                native calibration targets, and the estimation of the non-linear part of camera optics such as
                                radial distortion. 12


                                 11  http://johnnylee.net/projects/wii/.
                                 12  In some applications, you can use the EXIF tags associated with a JPEG image to obtain a rough estimate of a
                                camera’s focal length but this technique should be used with caution as the results are often inaccurate.
   304   305   306   307   308   309   310   311   312   313   314