Page 382 -
P. 382

Q9-3  How Do Organizations Use Data Warehouses and Data Marts to Acquire Data?

                                                                                   Data             Data                381
                                                 Operational                     Warehouse
                                                 Databases                       Metadata         Warehouse
                                                                                                  Database


                                                                        Data
                                                   Other              Extraction/           Data               Business
                                                  Internal            Cleaning/           Warehouse           Intelligence
                                                   Data               Preparation          DBMS                 Tools
                                                                      Programs




                                                  External
                                                   Data
                    Figure 9-12
                    Components of a Data                                                                   Business Intelligence
                    Warehouse                                                                                   Users



                                                   Metadata concerning the data—its source, its format, its assumptions and constraints, and
                                               other facts about the data—is kept in a data warehouse metadata database. The data warehouse
                                               DBMS extracts and provides data to BI applications.
                                                   The term business intelligence users is different from knowledge workers in Figure 9-1. BI users
                                               are generally specialists in data analysis, whereas knowledge workers are often nonspecialist users
                                               of BI results. A loan approval officer at a bank is a knowledge worker, but not a BI user.
                                               Problems with Operational Data

                                               Most operational and purchased data has problems that inhibit its usefulness for business intel-
                                               ligence. Figure 9-14 lists the major problem categories. First, although data that is critical for
                                               successful operations must be complete and accurate, marginally necessary data need not be. For
                                               example, some systems gather demographic data in the ordering process. But, because such data
                                               is not needed to fill, ship, and bill orders, its quality suffers.
                    Security concerns about access   Problematic data is termed dirty data. Examples are a value of B for customer gender and of
                    to data are problematic. See the   213 for customer age. Other examples are a value of 999–999–9999 for a U.S. phone number, a
                    Security Guide on pages 406–407    part color of “gren,” and an email address of WhyMe@GuessWhoIAM.org. The value of zero for
                    for more information.
                                               Units in Figure 9-6 is dirty data. All of these values can be problematic for BI purposes.
                                                   Purchased data often contains missing elements. The contact data in Figure 9-6 is a typical
                                               example; orders can be shipped without contact data, so its quality is spotty and has many miss-
                                               ing values. Most data vendors state the percentage of missing values for each attribute in the data




                                                                   •  Name, address, phone  •  Magazine subscriptions
                                                                   •  Age             •  Hobbies
                                                                   •  Gender          •  Catalog orders
                                                                   •  Ethnicity       •  Marital status, life stage
                                                                   •  Religion        •  Height, weight, hair and
                                                                   •  Income               eye color
                                                                   •  Education       •  Spouse name, birth date
                                                                   •   Voter registration  •   Children‘s names and
                    Figure 9-13                                    •  Home ownership       birth dates
                    Examples of Consumer Data                      •  Vehicles
                    That Can Be Purchased
   377   378   379   380   381   382   383   384   385   386   387