TY - JOUR
T1 - Comparison of feature-level learning methods for mining online consumer reviews
AU - Chen, Li
AU - Qi, Luole
AU - Wang, Feng
N1 - Funding Information:
This research has been supported by Hong Kong Baptist University Faculty Research Grant ( HKBU/FRG2/10-11/041 ).
PY - 2012/8
Y1 - 2012/8
N2 - The tasks of feature-level opinion mining usually include the extraction of product entities from consumer reviews, the identification of opinion words that are associated with the entities, and the determining of these opinions' polarities (e.g.; positive, negative, or neutral). In recent years, two major approaches have been proposed to determine opinions at the feature level: model based methods such as the one based on lexicalized Hidden Markov Model (L-HMMs), and statistical methods like the association rule mining based technique. However, little work has compared these algorithms regarding their practical abilities in identifying various types of review elements, such as features, opinions, intensifiers, entity phrases and infrequent entities. On the other hand, little attentions has been paid to applying more discriminative learning models to accomplish these opinion mining tasks. In this paper, we not only experimentally compared these methods based on a real-world review dataset, but also in particular adopted the Conditional Random Fields (CRFs) model and evaluated its performance in comparison with related algorithms. Moreover, for CRFs-based mining algorithm, we tested the role of a self-tagging process in two automatic training conditions, and further identified the ideal combination of learning functions to optimize its learning performance. The comparative experiment eventually revealed the CRFs-based method's outperforming accuracy in terms of mining multiple review elements, relative to other methods.
AB - The tasks of feature-level opinion mining usually include the extraction of product entities from consumer reviews, the identification of opinion words that are associated with the entities, and the determining of these opinions' polarities (e.g.; positive, negative, or neutral). In recent years, two major approaches have been proposed to determine opinions at the feature level: model based methods such as the one based on lexicalized Hidden Markov Model (L-HMMs), and statistical methods like the association rule mining based technique. However, little work has compared these algorithms regarding their practical abilities in identifying various types of review elements, such as features, opinions, intensifiers, entity phrases and infrequent entities. On the other hand, little attentions has been paid to applying more discriminative learning models to accomplish these opinion mining tasks. In this paper, we not only experimentally compared these methods based on a real-world review dataset, but also in particular adopted the Conditional Random Fields (CRFs) model and evaluated its performance in comparison with related algorithms. Moreover, for CRFs-based mining algorithm, we tested the role of a self-tagging process in two automatic training conditions, and further identified the ideal combination of learning functions to optimize its learning performance. The comparative experiment eventually revealed the CRFs-based method's outperforming accuracy in terms of mining multiple review elements, relative to other methods.
KW - Association rule mining
KW - Conditional Random Fields (CRFs)
KW - Consumer reviews
KW - E-commerce
KW - Feature-level opinion mining
KW - Lexicalized Hidden Markov Model (L-HMMs)
UR - http://www.scopus.com/inward/record.url?scp=84862783728&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2012.02.158
DO - 10.1016/j.eswa.2012.02.158
M3 - Journal article
AN - SCOPUS:84862783728
SN - 0957-4174
VL - 39
SP - 9588
EP - 9601
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 10
ER -