Page 194 - Classification Parameter Estimation & State Estimation An Engg Approach Using MATLAB
P. 194

6





            Feature Extraction and


            Selection










            In some cases, the dimension N of a measurement vector z, i.e. the
            number of sensors, can be very high. In image processing, when raw
            image data is used directly as the input for a classification, the dimen-
                                          4
            sion can easily attain values of 10 (a 100   100 image) or more. Many
            elements of z can be redundant or even irrelevant with respect to the
            classification process.
              For two reasons, the dimension of the measurement vector cannot be
            taken arbitrarily large. The first reason is that the computational com-
            plexity becomes too large. A linear classification machine requires in the
            order of KN operations (K is the number of classes; see Chapter 2).
                                              2
            A quadratic machine needs about KN operations. For a machine acting
            on binary measurements the memory requirement is on the order of
              N
            K2 . This, together with the required throughput (number of classifica-
            tions per second), the state of the art in computer technology and the
            available budget define an upper bound to N.
              A second reason is that an increase of the dimension ultimately causes
            a decrease of performance. Figure 6.1 illustrates this. Here, we have a
            measurement space with the dimension N varying between 1 and 13.
            There are two classes (K ¼ 2) with equal prior probabilities. The (true)
            minimum error rate E min is the one which would be obtained if all class
            densities of the problem were fully known. Clearly, the minimum error



            Classification, Parameter Estimation and State Estimation: An Engineering Approach using MATLAB
            F. van der Heijden, R.P.W. Duin, D. de Ridder and D.M.J. Tax
            Ó 2004 John Wiley & Sons, Ltd  ISBN: 0-470-09013-8
   189   190   191   192   193   194   195   196   197   198   199