Page 395 -
P. 395

Chapter 9  Business Intelligence Systems
                394
























                Figure 9-22
                Credit Score Decision Tree
                Source: Used with permission of
                TIBCO Software Inc. Copyright ©
                1999-2005 TIBCO Software Inc. All
                rights reserved.



                                            Of course, the financial institution will need to combine this risk data with an economic analysis
                                            of the value of each loan to determine which loans to take.
                                               Decision  trees are easy  to understand and, even better, easy  to implement using decision
                                            rules. They also can work with many types of variables, and they deal well with partial data.
                                            Organizations can use decision trees by themselves or combine them with other techniques. In
                                            some cases, organizations use decision trees to select variables that are then used by other types of
                                            data mining tools. For example, decision trees can be used to identify good predictor variables for
                                            neural networks.



                         Q9-6               How Do Organizations Use BigData Applications?



                                            BigData (also spelled Big Data) is a term used to describe data collections that are characterized
                                            by huge volume, rapid velocity, and great variety. In general, the following statements are true of
                                            BigData:
                                               •  BigData data sets are at least a petabyte in size, and usually larger.
                                               •  BigData is generated rapidly.
                                               •  BigData has structured data, free-form text, log files, possibly graphics, audio, and video.


                                            MapReduce

                                            Because BigData is huge, fast, and varied, it cannot be processed using traditional techniques.
                                            MapReduce is a  technique for harnessing  the  power of  thousands of computers working in
                                              parallel. The basic idea is that the BigData collection is broken into pieces, and hundreds or thou-
                                            sands of independent processors search these pieces for something of interest. That process is
   390   391   392   393   394   395   396   397   398   399   400