Page 44 - Data Architecture
P. 44

Chapter 1.3: The “Great Divide”























               Fig. 1.3.3 Hadoop centric unstructured data.


           The center of the Hadoop environment naturally enough is Hadoop. Hadoop is one of the
           technologies by which data can be managed over very large amounts of data. Hadoop/big
           data is at the center of what is known as “big data.” Hadoop is one of the primary storage
           mechanism for big data. The essential characteristics of Hadoop are that Hadoop


               - is capable of managing very large volumes of data,
               - manages data on less expensive storage,
               - manages data by the “Roman census” method,
               - stores data in an unstructured manner.


           Because of these operating characteristics of Hadoop, very large volumes of data can be
           managed. Hadoop is capable of managing volumes of data significantly larger than
           standard relational database management systems.


           The big data technology of Hadoop is depicted in Fig. 1.3.4.

























                                                                                                                44
   39   40   41   42   43   44   45   46   47   48   49