Page 20 -
P. 20

6       1 Basic Notions

       outside its scope. As for the  greenish orange its  feature vector  is nearly  at equal
       distance  from  both  prototypes  and  its  classification  is  problematic.  If  we  use,
       instead  of  the  Euclidian  distance,  another  distance  measure  that  weighs  more
       heavily  vertical  deviations than  horizontal  deviations,  the greenish  orange would
       also be wrongly classified.
         In general practice pattern  classification  systems are  not  flawless and we  may
       expect errors due to several causes:
       - The features used  are inadequate or insufficient.  For instance, the classification
         of  the  problematic  greenish  orange  would  probably  improve  by  using  an
         additional texture feature measuring the degree of surface roughness.
       - The  pattern  samples  used  to  design  the  classifier  are  not  sufficiently
         representative.  For  instance,  if  our  intention  is  to  discriminate  apples  from
         oranges we should have to include in the apples sample a representative variety
         of apples, including the red ones as well.
       - The classifier is not efficient enough in separating the classes. For instance, an
         inefficient distance measure or inadequate prototypes are being used.
       - There is an intrinsic overlap of the classes that no classifier can resolve.

         In this book we will focus our attention on the aspects that relate to the selection
       of adequate features and to the design of efficient classifiers. Concerning the initial
       choice of features  it is worth noting that  this is more an art than a science and, as
       with  any  art,  it  is  improved  by  experimentation  and  practice.  Besides  the
       appropriate choice of features and similarity measures, there are also other aspects
       responsible for the high degree of classifying accuracy in humans. Aspects such as
       the use of contextual information and advanced knowledge structures fall mainly in
       the  domain  of  an  artificial  intelligence course and  will  be  not  dealt  with  in  this
       book. Even the human recognition  of objects is not always flawless and contextual
       information risks classifying a greenish orange as a lemon if  it lies in a basket with
       lemons.



       1.2.2  Regression Tasks

       We consider  now  another type of task, directly  related  to  the cognitive  inference
       process.  We  observe  such  a  process  when  animals  start  a  migration  based  on
       climate  changes  and  physiological  changes  of  their  internal  biological  cycles. In
       daily life, inference is an important tool since it guides decision optimisation. Well-
       known  examples  are,  for  instance,  keeping  the  right  distance  from  the  vehicle
       driving ahead in a road, forecasting weather conditions, predicting firm revenue of
       investment and assessing loan granting based on economic variables.
          Let  us consider an example consisting of forecasting firm A  share value in  the
       stock exchange market, based on past information about: the share values of firm A
       and of other firms; the currency exchange rates; the interest rate. In  this situation
       we want to predict the value of a variable based on a sequence of past values of the
       same and other variables, which in the one-day forecast situation of Figure  1.5 are:
       r,, r~, rc, Euro-USD rate, Interest rate for 6 months.
   15   16   17   18   19   20   21   22   23   24   25