Page 19 - Big Data Analytics for Intelligent Healthcare Management
P. 19

1.3 BIO-INSPIRED ALGORITHMS FOR BIG DATA ANALYTICS: A TAXONOMY                 9




               two types: large-vocabulary continuous speech recognition (LVCSR) and phonetic-based technique.
               LVCSR performs indexing (to transliterate the speech content of audio) followed by searching (to find
               an index-term). The phonetic-based technique deals with phonemes or sounds and performs phonetic
               indexing and searching.
                  Video analytics visualize, examine, and extract meaningful information from video streams such as
               CCTV footage, live streaming of sport matches etc. Video analytics can be performed at end devices
               (edge) or centralized systems (server).
                  Social media analytics examines the unstructured or structured data of social media websites
               (a platform that enables an exchange of information among users) such as Facebook, Twitter etc. There
               are two kinds of social media analytics: (1) content-based (data posted by users) or (2) structure-based
               (synthesizing the structural attributes). Predictive analytics is a method that uses historical and current
               data to predict future outcomes, which can be done based on: heterogeneity (data coming from different
               sources), noise accumulation (an estimation error during interpretation of data), spurious correlation
               (uncorrelated variable due to huge size of dataset), or incidental endogeneity (predictors or explanatory
               variables, which are independent of the residual term).
                  Fig. 1.9 shows the different parameters that are considered in different bio-inspired algorithms for
               big data analytics.
                  There are four types of data mining techniques as studied from literature: classification, prediction,
               clustering, or association. In classification, model attributes are used to arrange the data in a different
               set of categories. The prediction technique is used to find out the unknown values. Clustering is an


                                                                                 Hbase
                                                      Scalability
                                                                                Cassandra
                                                    NoSQL server                MongodB

                                                       Storage                  Couchbase
                                                    Fault tolerance              Neo4J

                                                       Agility
                                                                               Classification
                          Parameters for big         Virtualization
                           data analytics                                       Prediction
                                                   Analytical technique
                                                                                Clustering
                                                       Cost
                                                                                Association
                                                     Ease of use
                                                                                Reactive
                                                      Mechanism
                                                                                Proactive
                                                    Data management

               FIG. 1.9
               Parameters of different bio-inspired algorithms for big data analytics.
   14   15   16   17   18   19   20   21   22   23   24