TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning

  • Yitong Song
  • , Pengcheng Zhang
  • , Chao Gao
  • , Bin Yao*
  • , Kai Wang
  • , Zongyuan Wu
  • , Lin Qu
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

High-dimensional vector similarity search (HVSS) is critical for many data processing and AI applications. However, traditional HVSS methods often require extensive data access for distance calculations, leading to inefficiencies. Triangle-inequality-based lower bound pruning is a widely used technique to reduce the number of data access in low-dimensional spaces but becomes less effective in high-dimensional settings. This is attributed to the ''distance concentration'' phenomenon, where the lower bounds derived from the triangle inequality become too small to be useful. To address this, we propose TRIM, which enhances the effectiveness of traditional triangle-inequality-based pruning in high-dimensional vector similarity search using two key ways: (1) optimizing landmark vectors used to form the triangles, and (2) relaxing the lower bounds derived from the triangle inequality, with the relaxation degree adjustable according to user's needs. TRIM is a versatile operation that can be seamlessly integrated into both memory-based (e.g., HNSW, IVFPQ) and disk-based (e.g., DiskANN) HVSS methods, reducing distance calculations and disk access. Extensive experiments show that TRIM enhances memory-based methods, improving graph-based search by up to 90% and quantization-based search by up to 200%, while achieving a pruning ratio of up to 99%. It also reduces I/O costs by up to 58% and improves efficiency by 102% for disk-based methods, while preserving high query accuracy. Our source code is available at https://github.com/petrizhang/TRIM.
Original languageEnglish
Article number373
Pages (from-to)1-26
Number of pages26
JournalProceedings of the ACM on Management of Data
Volume3
Issue number6
DOIs
Publication statusPublished - 5 Dec 2025

User-Defined Keywords

  • High-dimensional vector similarity search
  • Triangle inequality
  • Approximate 𝑘 nearest neighbor search
  • Approximate range search

Fingerprint

Dive into the research topics of 'TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning'. Together they form a unique fingerprint.

Cite this