Page 160 - Building Big Data Applications
P. 160

Chapter 9   Governance 159


                   The value of having metadata can be easily established in these situations by
                 measuring the cost impact with and without metadata:
                   Cost of commissioning new applications
                   Learning curve for new employees
                   Troubleshooting application problems
                   Creating new business intelligence and analytics applications
                   Data auditing
                   Compliance auditing
                   Traditionally in the world of data management, metadata has been often ignored and
                 implemented as a postimplementation process. When you start looking at big data, you
                 need to create a strong metadata library, as you will be having no idea about the content
                 of the data format that you need to process. Remember in the big data world, we ingest
                 and process data, then tag it, and post these steps consume it for processing.
                   Let us revisit the metadata subject area first and then understand how it integrates
                 into the world of big data.
                   Source of metadata include the following:

                   Metadata generated automatically for data and business rules
                   Metadata created by the designer of the systems
                   Metadata procured from third party sources
                   There are fundamentally five types of metadata that are useful for information
                 technology and data management across the enterprise from transaction processing to
                 analytical and reporting platforms, these include the following:
                   Technical metadatadconsists of metadata that is associated with data trans-
                   formation rules, data storage structures, semantic layers, and interface layers.
                     Metadata for data model and physical database includes length of a field, the
                      shape of a data structure, the name of a table, the physical characteristics of a
                      field, the number of bytes in a table, the indexes on a table, and DDL for the
                      table
                     Business processing metadata includes information about:
                      - The system of record for a specific piece of data,
                      - Transformations that were performed on what source data to produce data
                         in the data warehouse/data mart,
                      - Tables and columns used in the particular process in the data warehouse/
                         data mart and what do the transformations mean
                     Administrative metadata.
                   Business metadatadBusiness metadata refers to the data describing the content
                   available in the data warehouse/data mart and describes the following:
                     The structure of the data
                     The values that are stored within the attributes
   155   156   157   158   159   160   161   162   163   164   165