Page 145 - Handbook of Deep Learning in Biomedical Engineering Techniques and Applications
P. 145

134   Chapter 5 Depression discovery in cancer communities using deep learning




                                    They further show that incorporating the effect of adjacent words
                                    increases the efficiency of SA. There are several limitations of this
                                    work. Firstly, current window size for context is the whole docu-
                                    ment, which needs refinement. Second, there needs to be a strat-
                                    egy for smart filtering of useless context terms to reduce the size of
                                    the contextualized lexicon and improve its quality. Thirdly, this
                                    approach takes into account context only for ambiguous terms
                                    and not for all the terms in the document.
                                       In Ref. [52], the authors carry out SA on Twitter using POS-
                                    specific prior polarity attributes. They investigate using a tree
                                    kernel for an efficient tree representation of tweets to enable the
                                    amalgamation of several types of attributes appropriately. The
                                    major drawbacks of their approach are that parsing, SA, and topic
                                    modeling are not considered.
                                       Authors in Ref. [53] carry out SA on tweets using the Weka data
                                    mining APIs. First, they use a sentiment lexicon and then research
                                    in what manner to train the classifiers and define which is better
                                    for tweet classification. The main limitations of their approach are
                                    no enrichment of classification rules and ignoring of the SVM clas-
                                    sifier, which is generally very effective in text processing and SA
                                    applications.
                                       In Ref. [74], the author has proposed an approach that solves
                                    the problem of k-mean clustering and supervised ML algorithm
                                    for the classification problem. The main problem of k-means is
                                    to identify the value of “k,” which the author has sorted with lex-
                                    ical chain approach that helped in finding the total number of
                                    broad categories found in the given dataset. Based on the value
                                    of “k,” the clustered data set is passed for the classification using
                                    supervised algorithm that actually helps in improving the super-
                                    vised algorithms in terms of time efficiency and accuracy both.
                                    Such model falls under the category of semisupervised models.

                                    2.4Other techniques
                                       These constitute of approaches which are neither strictly ML
                                    based nor lexicon based. Formal concept analysis (FCA) is one
                                    such methodology. Fuzzy formal concept analysis (FFCA) is an
                                    FCA variant that deals with ambiguity and vague facts and fig-
                                    ures. FFCA has been effectively used in the information field in
                                    the past [54].
                                       Authors in Ref. [55] propose the use of FCA and FFCA for SA of
                                    movies and e-book reviews corpora. They propose an approach
                                    that utilizes FFCA to visualize the reviews as concepts. They use
                                    training instances to decrease the random results triggered by
                                    vague and confusing phrases. Using FFCA, they train a classifier
   140   141   142   143   144   145   146   147   148   149   150