Page 46 - Data Architecture
P. 46

Chapter 1.3: The “Great Divide”
































               Fig. 1.3.5 Services needed by big data.


           The services that surround Hadoop/big data are familiar to anyone that has ever used a
           standard DBMS. The difference is that in a standard DBMS, the services are found in the
           DBMS itself, while in Hadoop, many of the services have to be done externally. A

           second major difference is that throughout the Hadoop/big data environment, there is the
           need to service huge volumes of data. The developer in the Hadoop/big data environment
           must be prepared to manage and handle extremely large volumes of data. This means that
           many infrastructure tasks can be handled only in the Hadoop/big data environment itself.


           Indeed, the Hadoop environment is permeated by the need to be able to handle
           extraordinarily large amounts of data. The need to handle large amounts of data—indeed,
           almost unlimited amounts of data—is seen in Fig. 1.3.6






















                                                                                                                46
   41   42   43   44   45   46   47   48   49   50   51