Abstract
Large Reasoning Models (LRMs) have exhibited extraordinary prowess in tasks like mathematics and coding, leveraging their advanced reasoning capabilities. Nevertheless, as these capabilities progress, significant concerns regarding their vulnerabilities and safety have arisen, which can pose challenges to their deployment and application in real-world settings. This paper presents the first comprehensive survey of LRMs, meticulously exploring and summarizing the newly emerged safety risks, attacks, and defense strategies specific to these powerful reasoning-enhanced models. By organizing these elements into a detailed taxonomy, this work aims to offer a clear and structured understanding of the current safety landscape of LRMs, facilitating future research and development to enhance the security and reliability of these powerful models.
| Original language | English |
|---|---|
| Title of host publication | EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2025 |
| Editors | Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 3468-3482 |
| Number of pages | 15 |
| ISBN (Electronic) | 9798891763357 |
| DOIs | |
| Publication status | Published - Nov 2025 |
| Event | 30th Conference on Empirical Methods in Natural Language Processing - Suzhou, China Duration: 4 Nov 2025 → 9 Nov 2025 https://aclanthology.org/volumes/2025.findings-emnlp/ (Conference Proceedings) https://underline.io/events/502/reception (Conference website) |
Publication series
| Name | EMNLP - Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP |
|---|
Conference
| Conference | 30th Conference on Empirical Methods in Natural Language Processing |
|---|---|
| Abbreviated title | EMNLP 2025 |
| Country/Territory | China |
| City | Suzhou |
| Period | 4/11/25 → 9/11/25 |
| Internet address |
|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
Fingerprint
Dive into the research topics of 'Safety in Large Reasoning Models: A Survey'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver