Page 151 - Classification Parameter Estimation & State Estimation An Engg Approach Using MATLAB

P. 151

140 SUPERVISED LEARNING

(a) (b)

bolts 1
nuts
1 scrap 0.8
measure of eccentricity 0.8 measure of eccentricity 0.6
rings

0.6
0.4
0.4
0.2
0.2
0
0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
measure of six-fold rotational symmetry measure of six-fold rotational symmetry

Figure 5.1 Training sets. (a) Labelled. (b) Unlabelled

The chapter starts with a section on the representation of training sets.
In Sections 5.2 and 5.3 two approaches to supervised learning are dis-
cussed: parametric and nonparametric learning. Section 5.4 addresses
the problem of how to evaluate a classifier empirically. The discussion
here is restricted to classification problems only. However, many tech-
niques that are useful for classification problems are also useful for
estimation problems. Especially Section 5.2 (parametric learning) is
useful for estimation problems too.

5.1 TRAINING SETS

The set of samples is usually called the training set (or: learning data or
design set). The selection of samples should occur randomly from the
population. In almost all cases it is assumed that the samples are i.i.d.,
independent and identically distributed. This means that all samples are
selected from the same population of objects (in the simplest case, with
equal probability). Furthermore, the probability of one member of the
population being selected is not allowed to depend on the selection of
other members of the population.
Figure 5.1 shows scatter diagrams of the mechanical parts application
of Chapter 2. In Figure 5.1(a) the samples are provided with a label
carrying the information of the true class of the corresponding object.
There are several methods to find the true class of a sample, e.g. manual
inspection, additional measurements, destructive analysis, etc. Often,

146 147 148 149 150 151 152 153 154 155 156