Skip to main navigation Skip to search Skip to main content

IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck

  • Tian Bian (Co-first author)
  • , Yifan Niu (Co-first author)
  • , Chaohao Yuan
  • , Chengzhi Piao
  • , Bingzhe Wu
  • , Long-Kai Huang
  • , Yu Rong
  • , Tingyang Xu*
  • , Hong Cheng
  • , Jia Li*
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Circuit discovery has recently attracted attention as a potential research direction to explain the non-trivial behaviors of language models. It aims to find the computational subgraphs, also known as circuits, within the model that are responsible for solving specific tasks. However, most existing studies overlook the holistic nature of these circuits and require designing specific corrupted activations for different tasks, which is inaccurate and inefficient. In this work, we propose an end-to-end approach based on the principle of Information Bottleneck, called IBCircuit, to holistically identify informative circuits. In contrast to traditional causal interventions, IBCircuit is an optimization framework for holistic circuit discovery and can be applied to any given task without tediously corrupted activation design. In both the Indirect Object Identification (IOI) and Greater-Than tasks, IBCircuit identifies more faithful and minimal circuits in terms of critical node components and edge components compared to recent related work.
Original languageEnglish
Title of host publicationProceedings of the 42nd International Conference on Machine Learning, ICML 2025
EditorsAarti Singh, Maryam Fazel, Daniel Hsu, Simon Lacoste-Julien, Felix Berkenkamp, Tegan Maharaj, Kiri Wagstaff, Jerry Zhu
PublisherML Research Press
Pages4289-4302
Number of pages14
Publication statusPublished - 13 Jul 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver Convention Center, Vancouver, Canada
Duration: 13 Jul 202519 Jul 2025
https://icml.cc/Conferences/2025 (Conference Website)
https://icml.cc/virtual/2025/calendar (Conference Calendar)
https://proceedings.mlr.press/v267/ (Conference Proceedings)

Publication series

NameProceedings of the International Conference on Machine Learning
NameProceedings of Machine Learning Research
Volume267
ISSN (Print)2640-3498

Conference

Conference42nd International Conference on Machine Learning, ICML 2025
Country/TerritoryCanada
CityVancouver
Period13/07/2519/07/25
Internet address

User-Defined Keywords

  • Circuit Discovery

Fingerprint

Dive into the research topics of 'IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck'. Together they form a unique fingerprint.

Cite this