Page 32 - Data Architecture
P. 32

Chapter 1.2: The Data Infrastructure
           Chapter 1.2



           The Data Infrastructure



           Abstract



           Corporate data include everything found in the corporation in the way of data. The most
           basic division of corporate data is by structured data and unstructured data. As a rule,
           there are much more unstructured data than structured data. Unstructured data have two
           basic divisions—repetitive data and nonrepetitive data. Big data is made up of

           unstructured data. Nonrepetitive big data has a fundamentally different form than
           repetitive unstructured big data. In fact, the differences between nonrepetitive big data
           and repetitive big data are so large that they can be called the boundaries of the “great
           divide.” The divide is so large; many professionals are not even aware that there is this
           divide. As a rule, nonrepetitive big data has MUCH greater business value than repetitive
           big data.



           Keywords


           Structured data; Unstructured data; Corporate data; Repetitive data; Nonrepetitive data;
           Business value; The great divide of data; Big data


           If there is any secret to data management and data architecture, it is understanding data
           in terms of its infrastructure. Stated differently, trying to understand the larger
           architecture under which data are managed and operate is almost impossible without
           understanding the underlying infrastructure, which surrounds data. Therefore, we shall
           spend some time understanding infrastructure.



           Two Types of Repetitive Data



           A good starting point for understanding infrastructure is to start with the observation that
           there are two types of repetitive data found in corporate data. In the structured side of
           corporate data, repetitive data are found. In the unstructured big data side of corporate
           data, repetitive data are also found. Despite the fact that the types of data sound the
           same, there are significant differences between the different types of repetitive data.
           When it comes to structured repetitive data, it is normal to have transactions as part of


                                                                                                                32
   27   28   29   30   31   32   33   34   35   36   37