Abstract
Recent analyses have disclosed that existing rumor detection techniques, despite playing a pivotal role in countering the dissemination of misinformation on social media, are vulnerable to both white-box and surrogate-based black-box adversarial attacks. However, such attacks depend heavily on unrealistic assumptions, e.g., modifiable user data and white-box access to the rumor detection models, or appropriate selections of surrogate models, which are impractical in the real world. Thus, existing analyses fail to uncover the robustness of rumor detectors in practice. In this work, we take a further step towards the investigation about the robustness of existing rumor detection solutions. Specifically, we focus on the state-of-the-art rumor detectors, which leverage graph neural network based models to predict whether a post is rumor based on the Message Propagation Tree (MPT), a conversation tree with the post as its root and the replies to the post as the descendants of the root. We propose a novel black-box attack method, HMIA-LLM, against these rumor detectors, which uses the Large Language Model to generate malicious messages and inject them into the targeted MPTs. Our extensive evaluation conducted across three rumor detection datasets, four target rumor detectors, and three baselines for comparison demonstrates the effectiveness of our proposed attack method in compromising the performance of the state-of-the-art rumor detectors.
| Original language | English |
|---|---|
| Title of host publication | WWW '24: Proceedings of the ACM Web Conference 2024 |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 4512-4522 |
| Number of pages | 11 |
| ISBN (Electronic) | 9798400701719 |
| DOIs | |
| Publication status | Published - 13 May 2024 |
| Event | 33rd ACM Web Conference, WWW 2024 - Singapore, Singapore Duration: 13 May 2024 → 17 May 2024 https://dl.acm.org/doi/proceedings/10.1145/3589334 https://dl.acm.org/doi/proceedings/10.1145/3589335 |
Publication series
| Name | WWW 2024 - Proceedings of the ACM Web Conference |
|---|
Conference
| Conference | 33rd ACM Web Conference, WWW 2024 |
|---|---|
| Country/Territory | Singapore |
| City | Singapore |
| Period | 13/05/24 → 17/05/24 |
| Internet address |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
User-Defined Keywords
- black-box evasion setting
- graph neural networks
- large language model
- message injection attack
- rumor detection
Fingerprint
Dive into the research topics of 'Message Injection Attack on Rumor Detection under the Black-Box Evasion Setting Using Large Language Model'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver