Page 55 -

P. 55

2.4 Principal Components 41

variance of one feature if the total variance was equally distributed. For the cork
stoppers data this would correspond to retaining only the first 2 eigenvalues. It can
also be useful to inspect a plot of the eigenvalues as illustrated in Figure 2.17. A
criterion based on such a plot (called scree lest) suggests discarding the
eigenvalues starting where the plot levels off, which, in this case, would amount to
retaining only the first 4 eigenvalues.
Principal components analysis as a dimensionality reduction method must be
applied with caution, for the follow~ng reasons:

- Principal components are linear trunsfbrmations of the original features,
therefore, reduction to significant principal components may not appropriately
reflect non-linearities that may be present in the data (see e.g. chapter eight of
Bishop, 1995).
- Principal components with negligible contribution to the overall variance may
nevertheless provide a crucial contribution to pattern discrimination. By
discarding such components we may inadvertently impair the classification or
regression performance (idem).
- It is usually difficult to attach any semantic meaning to principal components.
Such semantic meaning, provided by the original features, is often useful when
developing and applying classification or regression solutions.

Although principal components analysis suffers from these shortcomings there
is still something to be gained from it since it provides the designer with desirable
low dimensional representations of the original data, to be explored further in the
next chapter. It also provides meaningful estimates of the intrinsic data
dimensionality, which can sonlehow serve as reference for more sophisticated
feature selection methods, to be explained in chapter four.

2.5 Feature Assessment

Assessing the discriminative capability of features is an important task either in the
initial phase of a PR project or in a more advanced phase, for example, when
choosing between alternative feature sets. This assessment is usually of great help
in guiding subsequent project phases by giving some insight in what is going on in
the feature space, e.g., concerning class separability.
The feature assessment task is usually conducted in the following sequence of
subtasks:

1. Graphic inspection
2. Distribution model assessment
3. Statistic inference tests

We will illustrate now these subtasks using the cork stoppers data. We assume
that the reader is familiar with the statistical techniques used, which are described

50 51 52 53 54 55 56 57 58 59 60