Page 289 -
P. 289

272                                                              Chapter 8



               techniques, and database technology, data mining detects hidden patterns and subtle
               relationships in data and infers rules that allow the prediction of future results. Raw
               data are analyzed to put forth a model that attempts to explain the observed patterns.
               This model can then be used to predict future occurrences, and to forecast expected
               outcomes (see   fi gure 8.2 ).
                    A large number of inputs are required, usually over a signifi cant period of time, and
               the types of models produced range from easy to almost impossible to understand.
               Easy to understand models are decision trees, for example. Regression analyses are
               moderately easy to understand and neural networks remain black boxes. The major
               drawback of the black box models is that it becomes very diffi cult to hypothesize about
               causal relationships (see   fi gure 8.3 ).






                 Historical
                   data


                                           If                Then xxxx


                               Data
                              mining



                                       If                      Then yyyy




                 Figure 8.2
                 Predictive models


                      Age

                                                    How well will the
                 Education
                                                    student perform on
                                                    the entrance exam?
               Eye color
                                 Model
                 Figure 8.3
                 Black box models
   284   285   286   287   288   289   290   291   292   293   294