Abstract
Sentiment analysis is an important and challenging task in natural language processing. It has been studied for a few decades. Recently, Bidirectional Encoder Representations from Transformer (BERT) model has been introduced to tackle this task and gain very promising results. However, most existing studies on fine-tuning BERT models for sentiment analysis focus on high-resource language (e.g., En-glish or Mandarin). This paper studies the sentiment analysis of Cantonese political posts on Hong Kong local forums. We first collected and labeled posts related to Anti-Extradition Law Amendment Bill (Anti-ELAB) movement in Hong Kong discussion forums. We then examined the performance of dictionary-based sentiment analysis, traditional machine learning-based, fine-tuned BERT and fine-tuned multilingual BERT (mBERT) models. Our results show that fine-tuned mBERT model achieves the best performance on our collected and labeled Cantonese dataset.
Original language | English |
---|---|
Title of host publication | Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022 |
Editors | Shusaku Tsumoto, Yukio Ohsawa, Lei Chen, Dirk Van den Poel, Xiaohua Hu, Yoichi Motomura, Takuya Takagi, Lingfei Wu, Ying Xie, Akihiro Abe, Vijay Raghavan |
Publisher | IEEE |
Pages | 6763-6765 |
Number of pages | 3 |
ISBN (Electronic) | 9781665480451 |
ISBN (Print) | 9781665480468 |
DOIs | |
Publication status | Published - Dec 2022 |
Event | 2022 IEEE International Conference on Big Data, Big Data 2022 - Osaka, Japan Duration: 17 Dec 2022 → 20 Dec 2022 https://ieeexplore.ieee.org/xpl/conhome/10020192/proceeding |
Publication series
Name | Proceedings - IEEE International Conference on Big Data |
---|
Conference
Conference | 2022 IEEE International Conference on Big Data, Big Data 2022 |
---|---|
Country/Territory | Japan |
City | Osaka |
Period | 17/12/22 → 20/12/22 |
Internet address |
Scopus Subject Areas
- Modelling and Simulation
- Computer Networks and Communications
- Information Systems
- Information Systems and Management
- Safety, Risk, Reliability and Quality
- Control and Optimization
User-Defined Keywords
- BERT
- Fine-tuned BERT
- multilingual BERT
- Sentiment Analysis