Page 323 - Data Architecture
P. 323

Chapter 8.2: Big Data/Existing System Interface



































               Fig. 8.2.6 Nonrepetitive data can contain raw data and context enriched data.


           In Fig. 8.2.6, it is seen that there is the division of big data into the repetitive and
           nonrepetitive sections. However, in the repetitive section, it is seen that when content-
           enriched big data is added to the big data environment, those content-enriched data
           simply become another type of repetitive data. Stated differently, there are two types of

           repetitive data in big data—simple repetitive data and content-enriched repetitive data.

           This division becomes important when doing analytic processing. Repetitive data are

           analyzed in a completely different fashion than content-enriched repetitive data.


           Analyzing Structured Data/Unstructured Data Together



           The final interface of interest in the big data environment is those data that have come
           from big data either through the distillation process or textual disambiguation. The data
           that arrive here can be placed into a standard DBMS.


           Fig. 8.2.7 shows the database that has been created from unstructured data being placed
           in the same environment as the classical data warehouse. Of course, the data in the

           classical data warehouse have been created from structured data entirely.



                                                                                                               323
   318   319   320   321   322   323   324   325   326   327   328