Page 41 - Data Architecture
P. 41

Chapter 1.3: The “Great Divide”
           Chapter 1.3



           The “Great Divide”



           Abstract



           Corporate data include everything found in the corporation in the way of data. The most
           basic division of corporate data is by structured data and unstructured data. As a rule,
           there are much more unstructured data than structured data. Unstructured data have two
           basic divisions—repetitive data and nonrepetitive data. Big data is made up of

           unstructured data. Nonrepetitive big data has a fundamentally different form than
           repetitive unstructured big data. In fact, the differences between nonrepetitive big data
           and repetitive big data are so large that they can be called the boundaries of the “great
           divide.” The divide is so large that many professionals are not even aware that there is
           this divide. As a rule, nonrepetitive big data has MUCH greater business value than
           repetitive big data.



           Keywords


           Structured data; Unstructured data; Corporate data; Repetitive data; Nonrepetitive data;
           Business value; The great divide of data; Big data



           Classifying Corporate Data



           Corporate data can be classified in many different ways. One of the major classifications
           is by structured versus unstructured data. And unstructured data can be further broken
           into two categories—repetitive unstructured data and nonrepetitive unstructured data.
           This division of data is shown in Fig. 1.3.1.


















                                                                                                                41
   36   37   38   39   40   41   42   43   44   45   46