Page 321 -

P. 321

Chapter 8 ■ Classiﬁcation 295

system for visual objects. The system learns — that is, establishes ranges for
the features that were selected for use — by being given a known object and
then identifying some pattern among the feature values. This works better if
a large number of objects and images are used in the training process, and
that means having a large set of classiﬁed objects on hand before the system is
even completed. This is called training data. It may turn out that some of the
selected features are not useful and will need to be discarded or replaced, and
this will require modiﬁcations to the system.
So, for each object in each test image, all the proposed features are measured
and stored. A classiﬁer is built that uses these features to determine the class
of the objects as well as can be done. Rarely will this be perfect, but it could
be perfect for the training data. Finding out the actual rate of successful
classiﬁcation must be done using a different set of data, not the training data,
because the system has been tuned speciﬁcally to recognize the training data
set. We must know the actual classiﬁcations for the test data, too, since we
need to determine how often the system returns the correct class. If we have
100 objects that are of known classes, then this set of data needs to be split
into two sets: one for training, one for testing. For the time being, they should
be split into two equal parts, but alternatives will be described starting in the
next section and in the remainder of the chapter.

8.1.3 Variation: In-Class and Out-Class
Part of the problem with visual classiﬁcation is that objects do not look the
same in different images, in different orientations, and when seen through
different cameras. Examining the data for the carrot problem, it is easy to see
that tomatoes have a variety of different values for each of the features we have
measured so far: color (red, green, blue) and area. Indeed, no two tomatoes
have the same values for these four features. Consider the green component
of tomato regions: the values in each row of the following table are an average
of the green components of all pixels in the region, meaning that even within
each region they are not all the same. The means over the various regions are
not the same either.

GREEN AREA
MEAN OVER ALL TOMATOES 48.80 1756.00
STANDARD DEVIATION 4.95 299.79
MEAN OVER ALL CARROTS 106.0 2520.40
STANDARD DEVIATION 2.28 173.16

Samples of features such as these usually follow a statistical normal distri-
bution. This is the famous bell-shaped curve, where the mean is in the center
and the standard deviation speciﬁes the width of the bell. The variation of
the measurements is greater if the bell is wide, of course. A narrow range of

316 317 318 319 320 321 322 323 324 325 326