Abstract
Circuit discovery has recently attracted attention as a potential research direction to explain the non-trivial behaviors of language models. It aims to find the computational subgraphs, also known as circuits, within the model that are responsible for solving specific tasks. However, most existing studies overlook the holistic nature of these circuits and require designing specific corrupted activations for different tasks, which is inaccurate and inefficient. In this work, we propose an end-to-end approach based on the principle of Information Bottleneck, called IBCircuit, to holistically identify informative circuits. In contrast to traditional causal interventions, IBCircuit is an optimization framework for holistic circuit discovery and can be applied to any given task without tediously corrupted activation design. In both the Indirect Object Identification (IOI) and Greater-Than tasks, IBCircuit identifies more faithful and minimal circuits in terms of critical node components and edge components compared to recent related work.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 42nd International Conference on Machine Learning, ICML 2025 |
| Editors | Aarti Singh, Maryam Fazel, Daniel Hsu, Simon Lacoste-Julien, Felix Berkenkamp, Tegan Maharaj, Kiri Wagstaff, Jerry Zhu |
| Publisher | ML Research Press |
| Pages | 4289-4302 |
| Number of pages | 14 |
| Publication status | Published - 13 Jul 2025 |
| Event | 42nd International Conference on Machine Learning, ICML 2025 - Vancouver Convention Center, Vancouver, Canada Duration: 13 Jul 2025 → 19 Jul 2025 https://icml.cc/Conferences/2025 (Conference Website) https://icml.cc/virtual/2025/calendar (Conference Calendar) https://proceedings.mlr.press/v267/ (Conference Proceedings) |
Publication series
| Name | Proceedings of the International Conference on Machine Learning |
|---|
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 267 |
| ISSN (Print) | 2640-3498 |
Conference
| Conference | 42nd International Conference on Machine Learning, ICML 2025 |
|---|---|
| Country/Territory | Canada |
| City | Vancouver |
| Period | 13/07/25 → 19/07/25 |
| Internet address |
|
User-Defined Keywords
- Circuit Discovery
Fingerprint
Dive into the research topics of 'IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver