Boosting with Fewer Tokens: Multi-Query Optimization for LLMs Using Node Text and Neighbor Cues

Yujie Fang, Xin LI, Yuangang Pan, Xin Huang, Ivor W. Tsang

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Recent studies have explored querying large language models (LLMs) to serve as predictors for graph mining tasks on text-attributed graphs (TAGs), establishing a promising paradigm that surpasses Graph Neural Networks (GNNs) in scalability and generalization. However, the high token costs of LLMs make this approach prohibitively expensive for large-scale node queries, and effective multi-query optimization solutions are currently lacking. By conducting information theory analysis at the single query level, we have gained insights that enabled the development of two multi-query optimization strategies: token pruning and query boosting. The token pruning strategy is designed to reduce token usage without compromising task performance by identifying saturated node queries and pruning tokens for these queries. Meanwhile, the query boosting strategy is designed to enhance task performance by enriching the context of unexecuted queries with pseudo-labels derived from previous queries through strategic scheduling, thereby maximizing the utility of these pseudo-labels. Extensive experiments applying these two strategies, either jointly or individually, to various existing methods demonstrate that the proposed approach serves our intentions well. Besides, this paper offers a fresh methodology for optimizing LLM processing of graph tasks, demonstrating great potential. For most natural graph data benchmarks in the field, it can save tokens by several orders of magnitude. For example, on the Ogbn-Products dataset, it could theoretically save up to 2 × 10^9 tokens.
Original languageEnglish
Title of host publicationProceedings of the 41st IEEE International Conference on Data Engineering, ICDE 2025
PublisherIEEE
Pages2684-2697
Number of pages13
DOIs
Publication statusPublished - 20 May 2025
Event41st IEEE International Conference on Data Engineering, ICDE 2025 - The Hong Kong Polytechnic University, Hong Kong, China
Duration: 19 May 202523 May 2025
https://ieee-icde.org/2025/
https://ieee-icde.org/2025/research-papers/
https://www.computer.org/csdl/proceedings/icde/2025/26FZy3xczFS

Publication series

NameInternational Conference on Data Engineering

Conference

Conference41st IEEE International Conference on Data Engineering, ICDE 2025
Abbreviated titleICDE 2025
Country/TerritoryChina
CityHong Kong
Period19/05/2523/05/25
Internet address

Fingerprint

Dive into the research topics of 'Boosting with Fewer Tokens: Multi-Query Optimization for LLMs Using Node Text and Neighbor Cues'. Together they form a unique fingerprint.

Cite this