Page 50 - Data Architecture
P. 50

Chapter 1.3: The “Great Divide”
               Fig. 1.3.9 Some of the services needed to turn unstructured into structured data.


           There is a concern regarding the volume of data that is managed by textual
           disambiguation. But the volume of data that can be processed is secondary to the
           transformation of data that occurs during the transformation process. Simply stated, it
           doesn’t matter how fast you can process data if you cannot understand what it is that you
           are processing. The fact that textual disambiguation is dominated by transformation is

           depicted in Fig. 1.3.10.





























               Fig. 1.3.10 Transformation.


           There is then a completely different emphasis on the processing that occurs in the
           repetitive unstructured world versus the processing that occurs in the nonrepetitive

           unstructured world.


           Different Worlds



           This difference is seen in Fig. 1.3.11.















                                                                                                                50
   45   46   47   48   49   50   51   52   53   54   55