Page 46 - Dynamic Vision for Perception and Control of Motion

P. 46

30 2 Basic Relations: Image Sequences – “the World”

rows of the HTMs in Equation 2.8. It has not been done here because the direction
of the local road tangents would not be the same for a curved road. Therefore, be-
fore performing the second translation to the origin of the camera CS, a rotation
around the vertical axis by the difference F in local road direction would have to be
inserted (yielding another rotation matrix R Fco between T Rc and T Ro.
Now the two rotation-angles \ c and 4 c for the optical axis of the camera have to
be applied. Since \ c according to the definition has to be applied first, R \c has to
stand to the right because this matrix will be encountered first when the column-
vector x o is multiplied from the right. This finally yields the state vector for the
point T in camera coordinates:

x R \ c R T c T Rc T Ro R \ o x . (2.9)
co
o
Applying perspective projection (Equation (2.4) to this 3-D-point x co yields the
“homogeneous” feature data for the image coordinates e according to Equation 2.5;
note that the five HTMs contain the unknown variables of the vision task written
below each matrix:
e [ PR p T c R \ c T Rc T Ro R \ o ] x o T tot x .
o
(2.10)
unknowns: ș , ȥ , y (x ,y ) ȥ !
c c gc co oR o
T
The explicitly written down (transposed) form (e) = (e 1, e 2, e 3, e 4) then yields
the image coordinates
e
e
y i / ; e 2 e 4 z i / . (2.11)
3
4
The expression in square brackets is the same for any point to be transformed
from object- into camera- coordinates. Therefore, in computer graphics, where all
elements entering the HTMs are known beforehand, the so-called concatenated
transformation matrix T tot is computed as the product of all single HTMs once for
each object and aspect condition. A single matrix-vector multiplication T tot · x o then
yields the position in homogeneous feature coordinates for the image of a point on
the object at x o in object-centered coordinates. Equation 2.11 finally gives the im-
age coordinates.
Note that these coordinates are real numbers, which means that the positions of
the points mapped into the image are known to subpixel accuracy. If measurements
of image features (Chapter 5) can be done to subpixel accuracy, too, the methods
applied in recursive estimation (see Chapter 6) yield improved results by not
rounding off feature coordinates to integer numbers (as is often done in a naive ap-
proach).

2.1.1.6 General Concatenations of HCTs; the Scene Tree
While the use of these transformation matrices in computer graphics is common-
place as a flexible tool for adaptation to new or modified tasks, in machine vision,
until very recently, they have been exploited only during the formulation phase of
the problem. Then, to be numerically more efficient on general-purpose processors,
the resulting expressions of the matrix product T tot have been hand-coded, initially.
With the processing power available now, more easily adaptable codes become
preferable; this is achieved by keeping each HTMs separate until numerical evalua-
tion because in each matrix variables to be iterated may appear.

41 42 43 44 45 46 47 48 49 50 51