Page 12 - Big Data Analytics for Intelligent Healthcare Management
P. 12

2       CHAPTER 1 BIO-INSPIRED ALGORITHMS FOR BIG DATA ANALYTICS




                                             Dimensions of data management


                        Volume        Variety       Velocity       Veracity    Variability

             FIG. 1.1
             Dimensions of data management.



                                                   Variety of data

                                                                        Cloud   Machine to
                       Text  Audio    Video   Social  Transactional  Operational  service  machine data

             FIG. 1.2
             Variety of data.




             1.1.1 DIMENSIONS OF DATA MANAGEMENT
             As identified from existing literature [1–6], there are five kinds of dimensions of data, which are re-
             quired for effective management. Fig. 1.1 shows the dimensions of data management for big data an-
             alytics: (1) volume, (2) variety, (3) velocity, (4) veracity, and (5) variability.
                The Volume represents the magnitude of data in terms of data sizes (terabytes or petabytes). For
             example, Facebook processes a large amount of data such as millions of photographs and videos. Va-
             riety refers to heterogeneity in a dataset, which can be different types of data. Fig. 1.2 shows the variety
             of data, which can be text, audio, video, social, transactional, operational, cloud service, or machine to
             machine data (M2M data).
                Velocity refers to the rate of production of data and analysis for processing a huge amount of data.
             For example, velocity can be 250MB/minute or more [3]. Veracity refers to abnormality, noise, and
             biases in data, while variability refers to the change in the rate of flow of data for generation and
             analysis.
                The rest of the chapter is organized as follows. In Section 1.2, we present the big data analytical
             model. In Section 1.3, we propose the taxonomy of bio-inspired algorithms for big data analytics. In
             Section 1.4, we analyze research gaps and present some promising directions toward future research in
             this area. Finally, we summarize the findings and conclude the chapter in Section 1.5.





             1.2 BIG DATA ANALYTICAL MODEL
             Big data analytics is a term, which is a combination of “big data” and “deep analysis” as shown in
             Fig. 1.3. Every minute, a large amount of user data is being transferred from one device to another
             device, which needs high processing power to perform data mining for the extraction of useful infor-
             mation from the database. Fig. 1.3 shows the model for big data analytics, which shows that an OLTP
             (on-line transaction processing) system creates data (txn data). A data cube represents a big data, out of
   7   8   9   10   11   12   13   14   15   16   17