Abstract
Physics problems constitute a significant aspect of reasoning, necessitating complicated reasoning ability and abundant physics knowledge. However, existing large language models (LLMs) frequently fail due to a lack of knowledge or incorrect knowledge application. To mitigate these issues, we propose Physics Reasoner, a knowledge-augmented framework to solve physics problems with LLMs. Specifically, the proposed framework constructs a comprehensive formula set to provide explicit physics knowledge and utilizes checklists containing detailed instructions to guide effective knowledge application. Namely, given a physics problem, Physics Reasoner solves it through three stages: problem analysis, formula retrieval, and guided reasoning. During the process, checklists are employed to enhance LLMs' self-improvement in the analysis and reasoning stages. Empirically, Physics Reasoner mitigates the issues of insufficient knowledge and incorrect application, achieving state-of-the-art performance on SciBench with an average accuracy improvement of 5.8%.
Original language | English |
---|---|
Title of host publication | Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025 |
Editors | Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 11274-11289 |
Number of pages | 16 |
ISBN (Electronic) | 9798891761964 |
Publication status | Published - Jan 2025 |
Event | 31st International Conference on Computational Linguistics, COLING 2025 - Abu Dhabi, United Arab Emirates Duration: 19 Jan 2025 → 24 Jan 2025 https://aclanthology.org/volumes/2025.coling-main/ (Conference proceedings) |
Publication series
Name | Proceedings - International Conference on Computational Linguistics, COLING |
---|---|
Volume | Part F206484-1 |
ISSN (Print) | 2951-2093 |
Conference
Conference | 31st International Conference on Computational Linguistics, COLING 2025 |
---|---|
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 19/01/25 → 24/01/25 |
Internet address |
|