Page 211 - Classification Parameter Estimation & State Estimation An Engg Approach Using MATLAB
P. 211

200                         FEATURE EXTRACTION AND SELECTION

            included in the selected sets, it cannot be removed, even if at a higher
            level in the tree, when more features are added, this feature would be less
            useful.
              The SFS adds one feature at a time to the current set. An improvement
            is to add more than one, say l, features at a time. Suppose that at a given
            stage of the selection process we have a set F j (n) consisting of n features.
            Then, the next step is to expand this set with l features taken from the
            remaining  N   n   measurements.  For  that  we   have  (N   n)!=
            ((N   n   l)! l!) combinations which all must be checked to see which
            one is most profitable. This strategy is called the generalized sequential
            forward selection (GSFS(l)).
              Both the SFS and the GSFS(l) lack a backtracking mechanism. Once a
            feature has been added, it cannot be undone. Therefore, a further
            improvement is to add l features at a time, and after that to dispose of
            some of the features from the set obtained so far. Hence, starting with a
            set of F j (n) features we select the combination of l remaining measure-
            ments that when added to F j (n) yields the best performance. From this
            expanded set, say F i (n þ l), we select a subset of r features and remove
            it from F i (n þ l) to obtain a set, say F k (n þ l   r). The subset of r features
            is selected such that F k (n þ l   r) has the best performance. This strategy
            is called ‘Plus l – take away r selection’.

              Example 6.2   Character classification for license plate recognition
              In traffic management, automated license plate recognition is useful
              for a number of tasks, e.g. speed checking and automated toll collec-
              tion. The functions in a system for license plate recognition include
              image acquisition, vehicle detection, license plate localization, etc.
              Once the license plate has been localized in the image, it is partitioned
              such that each part of the image – a bitmap – holds exactly one
              character. The classes include all alphanumerical values. Therefore,
              the number of classes is 36. Figure 6.6(a) provides some examples of
              bitmaps obtained in this way.
                The size of the bitmaps ranges from 15   6upto30   15. In order
              to fix the number of pixels, and also to normalize the position, scale
              and contrasts of the character within the bitmap, all characters are
              normalized to 15   11.
                Using the raw image data as measurements, the number of mea-
              surements per object is 15   11 ¼ 165. A labelled training set con-
              sisting of about 70 000 samples, i.e. about 2000 samples/class, is
              available. Comparing the number of measurements against the num-
              ber of samples/class we conclude that overfitting is likely to occur.
   206   207   208   209   210   211   212   213   214   215   216