Page 138 - Handbook of Deep Learning in Biomedical Engineering Techniques and Applications

P. 138

Chapter 5 Depression discovery in cancer communities using deep learning 127

Authors in Ref. [28] detect favorable and unfavorable opinions
from web pages and news articles using a sentiment lexicon. They
ﬁrst identify the subjective or sentiment bearing expressions and
then determine the relationship of these expressions to their sub-
ject. Finally, they compute document sentiment by evaluating the
polarity of the subjective expressions. They ticket verbs as senti-
ment transmitters that transfer sentiment from one argument to
the other. For example, in the sentence “this book is good,” the
verb is transmits the sentiment extracted from good to the subject
book and associates them. Terms having part-of-speech tags
different from verb are dealt with in an easier waydthey directly
transfer their sentiment to the related argument. There are certain
weaknesses of their approach. They are unable to resolve corefer-
ences or expressions that refer to the same entity in text. More-
over, they do not have in place any mechanism for dealing with
negations.

2.2Machine learningebased approaches
ML approaches incorporate the famous ML algorithms that
treat SA as a regular text classiﬁcation problem that makes use
of linguistic features. Most ML-based methods use a supervised
ML classiﬁer.

2.2.1 Supervised machine learning
Probabilistic or generative classiﬁers: This set of classiﬁers use
mixture models for grouping and work on the assumption that
each of the categories is a component of the mixture. Each
mixture constituent is a generative model that offers the likeli-
hood of selection of a speciﬁc word for that constituent. The three
most famous generative classiﬁers are Bayesian network (BN),
naïve Bayes (NB) classiﬁer, and maximum entropy classiﬁers
(MaxEnt).
Linear classiﬁers: Given ¼ {x1, . xn} is the normalized docu-
ment word frequency, vector ¼ {a1, . an} is a vector of linear co-
efﬁcients with the same dimensionality as the feature space, and b
is a scalar; the output of the linear predictor is deﬁned as p ¼þb,
which is the output of the linear classiﬁer. The predictor p is a
separating hyperplane between different classes. Two of the
most famous linear classiﬁers are ṣupport vector machines
(SVMs) [29], [30] and neural networks (NNs).
Decision tree classiﬁers: Decision tree classiﬁer offers a tiered
disintegration of the training data space. In this, a condition
on the feature value splits the data [31]. The condition is the

133 134 135 136 137 138 139 140 141 142 143