Page 118 -
P. 118

Chapter 3  •  Data Warehousing  117

                    ineffective use. Mehra (2005) indicated that few organizations really understand metadata,
                    and fewer understand how to design and implement a metadata strategy. Metadata are
                    generally defined in terms of usage as technical or business metadata. Pattern is another
                    way to view metadata. According to the pattern view, we can differentiate between syn-
                    tactic metadata (i.e., data describing the syntax of data), structural metadata (i.e., data
                    describing the structure of the data), and semantic metadata (i.e., data describing the
                    meaning of the data in a specific domain).
                        We next explain traditional metadata patterns and insights into how to implement
                    an  effective metadata strategy via a holistic approach to enterprise metadata integration.
                    The approach includes ontology and metadata registries; enterprise information integration
                    (EII); extraction, transformation, and load (ETL); and service-oriented  architectures (SOA).
                    Effectiveness, extensibility, reusability, interoperability, efficiency and performance,  evolution,
                    entitlement, flexibility, segregation, user interface, versioning, versatility, and low maintenance
                    cost are some of the key requirements for building a successful  metadata-driven enterprise.
                        According to Kassam (2002), business metadata comprise information that increases
                    our understanding of traditional (i.e., structured) data. The primary purpose of metadata
                    should be to provide context to the reported data; that is, it provides enriching informa-
                    tion that leads to the creation of knowledge. Business metadata, though difficult to pro-
                    vide efficiently, release more of the potential of structured data. The context need not
                    be the same for all users. In many ways, metadata assist in the conversion of data and
                    information into knowledge. Metadata form a foundation for a metabusiness architecture
                    (see Bell, 2001). Tannenbaum (2002) described how to identify metadata requirements.
                    Vaduva and Vetterli (2001) provided an overview of metadata management for data ware-
                    housing. Zhao (2005) described five levels of metadata management maturity: (1)  ad
                    hoc, (2) discovered, (3) managed, (4) optimized, and (5) automated. These levels help in
                    understanding where an organization is in terms of how and how well it uses its metadata.
                        The design, creation, and use of metadata—descriptive or summary data about
                    data—and its accompanying standards may involve ethical issues. There are ethical
                    considerations involved in the collection and ownership of the information contained
                    in  metadata, including privacy and intellectual property issues that arise in the design,
                      collection, and dissemination stages (for more, see Brody, 2003).

                    sectiOn 3.2 revieW QuestiOns
                      1. What is a data warehouse?
                      2. How does a data warehouse differ from a database?
                      3. What is an ODS?
                      4. Differentiate among a data mart, an ODS, and an EDW.
                      5. Explain the importance of metadata.


                    3.3  Data Warehousing proCess overvieW
                    Organizations, private and public, continuously collect data, information, and knowledge
                    at an increasingly accelerated rate and store them in computerized systems. Maintaining
                    and using these data and information becomes extremely complex, especially as
                      scalability issues arise. In addition, the number of users needing to access the informa-
                    tion continues to increase as a result of improved reliability and availability of network
                    access, especially the Internet. Working with multiple databases, either integrated in a
                    data warehouse or not, has become an extremely difficult task requiring considerable
                    expertise, but it can provide immense benefits far exceeding its cost. As an illustrative
                    example, Figure 3.2 shows business benefits of the enterprise data warehouse built by
                    Teradata for a major automobile manufacturer.







           M03_SHAR9209_10_PIE_C03.indd   117                                                                     1/25/14   7:35 AM
   113   114   115   116   117   118   119   120   121   122   123