IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck

Tian Bian (Co-first author), Yifan Niu (Co-first author), Chaohao Yuan, Chengzhi Piao, Bingzhe Wu, Long-Kai Huang, Yu Rong, Tingyang Xu*, Hong Cheng, Jia Li*

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Circuit discovery has recently attracted attention as a potential research direction to explain the non-trivial behaviors of language models. It aims to find the computational subgraphs, also known as circuits, within the model that are responsible for solving specific tasks. However, most existing studies overlook the holistic nature of these circuits and require designing specific corrupted activations for different tasks, which is inaccurate and inefficient. In this work, we propose an end-to-end approach based on the principle of Information Bottleneck, called IBCircuit, to holistically identify informative circuits. In contrast to traditional causal interventions, IBCircuit is an optimization framework for holistic circuit discovery and can be applied to any given task without tediously corrupted activation design. In both the Indirect Object Identification (IOI) and Greater-Than tasks, IBCircuit identifies more faithful and minimal circuits in terms of critical node components and edge components compared to recent related work.
Original languageEnglish
Title of host publicationProceedings of the 42nd International Conference on Machine Learning, ICML 2025
PublisherML Research Press
Number of pages14
Publication statusE-pub ahead of print - 18 Jun 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver Convention Center, Vancouver, Canada
Duration: 13 Jul 202519 Jul 2025
https://icml.cc/Conferences/2025

Publication series

NameProceedings of the International Conference on Machine Learning
NameProceedings of Machine Learning Research

Conference

Conference42nd International Conference on Machine Learning, ICML 2025
Abbreviated titleICML 2025
Country/TerritoryCanada
CityVancouver
Period13/07/2519/07/25
Internet address

User-Defined Keywords

  • Circuit Discovery

Fingerprint

Dive into the research topics of 'IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck'. Together they form a unique fingerprint.

Cite this