TY - GEN
T1 - Semantic Dependent Word Pairs Generative Model for Fine-Grained Product Feature Mining
AU - Zhan, Tian Jie
AU - Li, Chun Hung
N1 - This work is partially supported by HKBU research grant FRG2/09-10/052.
Publisher copyright:
© Springer-Verlag Berlin Heidelberg 2011
PY - 2011/5/9
Y1 - 2011/5/9
N2 - In the field of opinion mining, extraction of fine-grained product feature is a challenging problem. Noun is the most important features to represent product features. Generative model such as the latent Dirichlet allocation (LDA) has been used for detecting keyword clusters in document corpus. As adjectives often dominate review corpus, they are often excluded from the vocabulary in such generative model for opinion sentiment analysis. On the other hand, adjectives provide useful context for noun features as they are often semantically related to the nouns. To take advantage of such semantic relations, dependency tree is constructed to extract pairs of noun and adjective with semantic dependency relation. We propose a semantic dependent word pairs generative model for pairs of noun and adjective for each sentence. Product features and their corresponding adjectives are simultaneously clustered into distinct groups which enable improved accuracy of product features as well as providing clustered adjectives. Experimental results demonstrated the advantage of our models with lower perplexity, average cluster entropies, compared to baseline models based on LDA. Highly semantic cohesive, descriptive and discriminative fine-grained product features are obtained automatically.
AB - In the field of opinion mining, extraction of fine-grained product feature is a challenging problem. Noun is the most important features to represent product features. Generative model such as the latent Dirichlet allocation (LDA) has been used for detecting keyword clusters in document corpus. As adjectives often dominate review corpus, they are often excluded from the vocabulary in such generative model for opinion sentiment analysis. On the other hand, adjectives provide useful context for noun features as they are often semantically related to the nouns. To take advantage of such semantic relations, dependency tree is constructed to extract pairs of noun and adjective with semantic dependency relation. We propose a semantic dependent word pairs generative model for pairs of noun and adjective for each sentence. Product features and their corresponding adjectives are simultaneously clustered into distinct groups which enable improved accuracy of product features as well as providing clustered adjectives. Experimental results demonstrated the advantage of our models with lower perplexity, average cluster entropies, compared to baseline models based on LDA. Highly semantic cohesive, descriptive and discriminative fine-grained product features are obtained automatically.
KW - generative model
KW - Product feature mining
KW - semantic dependency
UR - http://www.scopus.com/inward/record.url?scp=79957939459&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-20841-6_38
DO - 10.1007/978-3-642-20841-6_38
M3 - Conference proceeding
AN - SCOPUS:79957939459
SN - 9783642208409
T3 - Lecture Notes in Computer Science
SP - 460
EP - 475
BT - Advances in Knowledge Discovery and Data Mining
A2 - Huang, Joshua Zhexue
A2 - Cao, Longbing
A2 - Srivastava, Jaideep
PB - Springer Berlin Heidelberg
T2 - 15th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2011
Y2 - 24 May 2011 through 27 May 2011
ER -