Semantic Dependent Word Pairs Generative Model for Fine-Grained Product Feature Mining

Tian Jie Zhan*, Chun Hung Li

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

16 Citations (Scopus)

Abstract

In the field of opinion mining, extraction of fine-grained product feature is a challenging problem. Noun is the most important features to represent product features. Generative model such as the latent Dirichlet allocation (LDA) has been used for detecting keyword clusters in document corpus. As adjectives often dominate review corpus, they are often excluded from the vocabulary in such generative model for opinion sentiment analysis. On the other hand, adjectives provide useful context for noun features as they are often semantically related to the nouns. To take advantage of such semantic relations, dependency tree is constructed to extract pairs of noun and adjective with semantic dependency relation. We propose a semantic dependent word pairs generative model for pairs of noun and adjective for each sentence. Product features and their corresponding adjectives are simultaneously clustered into distinct groups which enable improved accuracy of product features as well as providing clustered adjectives. Experimental results demonstrated the advantage of our models with lower perplexity, average cluster entropies, compared to baseline models based on LDA. Highly semantic cohesive, descriptive and discriminative fine-grained product features are obtained automatically.

Original languageEnglish
Title of host publicationAdvances in Knowledge Discovery and Data Mining
Subtitle of host publication15th Pacific-Asia Conference, PAKDD 2011, Shenzhen, China, May 24-27, 2011, Proceedings, Part I
EditorsJoshua Zhexue Huang, Longbing Cao, Jaideep Srivastava
PublisherSpringer Berlin Heidelberg
Pages460-475
Number of pages16
Edition1st
ISBN (Electronic)9783642208416
ISBN (Print)9783642208409
DOIs
Publication statusPublished - 9 May 2011
Event15th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2011 - Shenzhen, China
Duration: 24 May 201127 May 2011

Publication series

NameLecture Notes in Computer Science
Volume6634
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349
NameLecture Notes in Artificial Intelligence
NamePAKDD: Pacific-Asia Conference on Knowledge Discovery and Data Mining

Conference

Conference15th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2011
Country/TerritoryChina
CityShenzhen
Period24/05/1127/05/11

User-Defined Keywords

  • generative model
  • Product feature mining
  • semantic dependency

Fingerprint

Dive into the research topics of 'Semantic Dependent Word Pairs Generative Model for Fine-Grained Product Feature Mining'. Together they form a unique fingerprint.

Cite this