Page 145 - Handbook of Deep Learning in Biomedical Engineering Techniques and Applications

P. 145

134 Chapter 5 Depression discovery in cancer communities using deep learning

They further show that incorporating the effect of adjacent words
increases the efﬁciency of SA. There are several limitations of this
work. Firstly, current window size for context is the whole docu-
ment, which needs reﬁnement. Second, there needs to be a strat-
egy for smart ﬁltering of useless context terms to reduce the size of
the contextualized lexicon and improve its quality. Thirdly, this
approach takes into account context only for ambiguous terms
and not for all the terms in the document.
In Ref. [52], the authors carry out SA on Twitter using POS-
speciﬁc prior polarity attributes. They investigate using a tree
kernel for an efﬁcient tree representation of tweets to enable the
amalgamation of several types of attributes appropriately. The
major drawbacks of their approach are that parsing, SA, and topic
modeling are not considered.
Authors in Ref. [53] carry out SA on tweets using the Weka data
mining APIs. First, they use a sentiment lexicon and then research
in what manner to train the classiﬁers and deﬁne which is better
for tweet classiﬁcation. The main limitations of their approach are
no enrichment of classiﬁcation rules and ignoring of the SVM clas-
siﬁer, which is generally very effective in text processing and SA
applications.
In Ref. [74], the author has proposed an approach that solves
the problem of k-mean clustering and supervised ML algorithm
for the classiﬁcation problem. The main problem of k-means is
to identify the value of “k,” which the author has sorted with lex-
ical chain approach that helped in ﬁnding the total number of
broad categories found in the given dataset. Based on the value
of “k,” the clustered data set is passed for the classiﬁcation using
supervised algorithm that actually helps in improving the super-
vised algorithms in terms of time efﬁciency and accuracy both.
Such model falls under the category of semisupervised models.

2.4Other techniques
These constitute of approaches which are neither strictly ML
based nor lexicon based. Formal concept analysis (FCA) is one
such methodology. Fuzzy formal concept analysis (FFCA) is an
FCA variant that deals with ambiguity and vague facts and ﬁg-
ures. FFCA has been effectively used in the information ﬁeld in
the past [54].
Authors in Ref. [55] propose the use of FCA and FFCA for SA of
movies and e-book reviews corpora. They propose an approach
that utilizes FFCA to visualize the reviews as concepts. They use
training instances to decrease the random results triggered by
vague and confusing phrases. Using FFCA, they train a classiﬁer

140 141 142 143 144 145 146 147 148 149 150