Sense prediction study: Two corpus-driven linguistic approaches

Jia-Fei Hong, Sue-Jin Ker, Kathleen Virginia Ahrens, Chu-Ren Huang

Research output: Chapter in book/report/conference proceedingConference proceeding

10 Downloads (Pure)


In this study, we propose to use two corpus-driven linguistic approaches for a sense prediction study. We will concentrate on the character similarity clustering approach and concept similarity clustering approach to predict the senses of non-assigned words by using corpora and tools, such as Chinese Gigaword Corpus, and HowNet. In this study, we would then like to evaluate their predictions via the sense divisions of Chinese Wordnet (CWN) and Xiandai Hanyu Cidian (Xian Han). Using these corpora, we will determine their clusters of our four target words ---- chi1 “eat”, wan2 “play”, huan4 “change” and shao1 “burn” in order to predict their all possible senses and evaluate them. This requirement will demonstrate the visibility of the corpus-based approaches.

Original languageEnglish
Title of host publicationProceedings of the 11th Chinese Lexical Semantic Workshop, CLSW 2010
PublisherSoochow University
Number of pages8
Publication statusPublished - May 2010
EventThe 11th Chinese Lexical Semantic Workshop, CLSW 2010 - Suzhou, China
Duration: 21 May 201023 May 2010


ConferenceThe 11th Chinese Lexical Semantic Workshop, CLSW 2010

User-Defined Keywords

  • Lexical ambiguity
  • Sense prediction
  • Corpus-based approach
  • Character similarity clustering approach
  • Concept similarity clustering approach
  • Evaluation


Dive into the research topics of 'Sense prediction study: Two corpus-driven linguistic approaches'. Together they form a unique fingerprint.

Cite this