Page 31 - Dynamic Vision for Perception and Control of Motion

P. 31

1.5 What Type of Vision System Is Most Adequate? 15

A few years ago, many systems were investigated with single processors de-
voted to single pixels (Connection Machine [Hillis 1985, 1992], Content-
Addressable Associative Parallel Processors (CAAPP) [Scudder, Weems 1990] and
others). The trend now clearly is toward more coarsely granulated parallel architec-
tures. Since a single microprocessor on the market at the turn of the century is ca-
9
pable of performing about 10 instructions per second, this means in excess of
2000 instructions per pixel of a 770 × 525 pixel image. Of course, this should not
be confused with information processing operations. For the year 2010, general-
11
purpose PC processors are expected to have a performance level of about 10 in-
structions per second.
On the other hand, the communication bandwidths of single channels will be so
high, that several image matrices may be transferred at a sufficiently high rate to
allow smooth recognition and control of motion processes. (One should refrain
from video norms, presently dominating the discussion, once imaging sensors with
digital output are in wide use.) Therefore, there is no need for more elaborate data
processing on the imaging chip except for ensuring sufficiently high intensity dy-
namics. Technical systems do not have the bandwidth problems, which may have
forced biological systems to do extensive data preprocessing near the retina (from
120 million light sensitive elements in the retina to 1.2 million nerves leading to
the lateral geniculate nucleus in humans).
Interesting studies have been made at several research institutions which tried to
exploit analog data processing on silicon chips [Koch 1995]; future comparisons of
results will have to show whether the space needed on the chip for this purpose can
be justified by the advantages claimed.
The mainstream development today is driven by commercial TV for the sensors
and by personal computers and games for the processors. With an expected in-
crease in computing power of one order of magnitude every 4 to 5 years over the
next decade, real-time machine vision will be ready for a wide range of applica-
tions using conventional engineering methods as represented by the 4-D approach.
A few (maybe a dozen) of these processors will be sufficient for solving even
rather complex tasks like ground and air vehicle guidance; dual processors on a
single chip are just entering the market. It is the goal of this monograph to make
the basic methods needed available to a wide public for efficient information ex-
traction from huge data streams.

1.7 What Is Intelligence? A Practical (Ecological)
Definition

The sensors of complex autonomous biological or technical systems yield an enor-
mous data rate containing information about both the state of the vehicle body rela-
tive to the environment and about other objects or subjects in the environment. It is
the task of an intelligent information extraction (data interpretation) system to
quickly get rid of as many data as possible, however simultaneously, to retain all of
the essential information for the task to be solved. Essential information is geared
to task domains; however, complex systems like animals and autonomous vehicles

26 27 28 29 30 31 32 33 34 35 36