Page 73 - Data Architecture
P. 73

Chapter 1.6: The Life Cycle of Data: Understanding Data Over Time
           The life cycle of data shows that raw data enter the corporate information systems. The
           entry of raw data can be made in many ways. The customer may do a transaction, and
           the data are captured as a by-product of the transaction. An analog computer may make a
           reading, and the data are entered as part of the analog processing. A customer may

           initiate an activity (such as make a phone call), and a computer captures that information.
           There are many ways that data can enter the information systems of the corporation.


           After the raw detailed data have entered the system, the next step is that the raw detailed
           data pass through a capture/edit process. In the capture/edit process, the raw detailed
           data pass through a basic edit process. In the edit process, the raw detailed data can be
           adjusted (or even rejected). In general, the data that enter the information systems of the
           corporation are at the most detailed level.


           After the raw detailed data have passed through the edit/capture process, the raw detailed
           data then go through an organization process. The organization process can be as simple
           as simple indexing the data. Or the raw detailed data may be subjected to an elaborate
           filtering/calculation/merging process. At this point, the raw detailed data are like putty
           that can be shaped in many ways by the system designer.


           Once the raw detailed data have passed through the organization process, the data are
           then fit to be stored. The data can be stored in a standard DBMS or in big data (or in
           other forms of storage). After the data are stored, before the data are fit for analysis, it
           typically passes through an integration process. The purpose of the integration process is
           to restructure the data so that they are fit to be combined with other types of data.


           It is at this point that the data enter the cycle of usefulness. The cycle of usefulness will
           be discussed at length later. After the data have fulfilled its usefulness, the data can be

           either archived or discarded.

           The life cycle of data that has been described is for raw detailed data. There is a slightly

           different life cycle of data for summarized or aggregated data.

           The life cycle of summarized or aggregated data is seen in Fig. 1.6.3.














                                                                                                                73
   68   69   70   71   72   73   74   75   76   77   78