leSAX Index: A Learned SAX Representation Index for Time Series Similarity Search

Guozhong Li*, Byron Choi, Rundong Zuo, Sourav S. Bhowmick, Jianliang Xu

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Time series similarity search (TSSS) is a fundamental task across various applications, including classification, motif discovery, and anomaly detection. However, existing iSAXbased index methods, while known for their efficiency, often rely on hand-crafted techniques (e.g., PAA and SAX) for znormalized time series data. However, these techniques do not fully exploit the full representation space and pose challenges to indexing. In this paper, we propose a learned index approach for TSSS. Specifically, we introduce SAXNET, a novel twostage neural network that generates the learned SAX representation (LESAX representation) for both z-normalized and nonz- normalized time series data. The benefits of SAXNET are threefold: 1 full exploitation of latent space, 2 preservation of time series shapes and global information for indexing, and 3 elimination of the need for hand-crafted techniques. We then propose LESAX index, a novel learned SAX representation index, which consists of a LESAX tree and a learned index. The distribution of the LESAX representations in the LESAX tree is adjusted to achieve a near-uniform distribution for index efficiency. Furthermore, we propose a learned index structure that works alongside the LESAX tree, applied recursively in case of large index leaf nodes. We have conducted comprehensive experiments on exact similarity search using our SAXNET and LESAX index on both real and synthetic time series datasets. The results demonstrate that our LESAX method outperforms state-of-the-art methods in efficiency, achieving performance improvements ranging from 3.6× to 17×.
Original languageEnglish
Title of host publicationProceedings of the 41st IEEE International Conference on Data Engineering, ICDE 2025
Place of PublicationHong Kong
PublisherIEEE
Publication statusPublished - 20 May 2025
Event41st IEEE International Conference on Data Engineering, ICDE 2025 - The Hong Kong Polytechnic University, Hong Kong, China
Duration: 19 May 202523 May 2025
https://ieee-icde.org/2025/
https://ieee-icde.org/2025/research-papers/
https://www.computer.org/csdl/proceedings/icde/2025/26FZy3xczFS

Publication series

NameInternational Conference on Data Engineering

Conference

Conference41st IEEE International Conference on Data Engineering, ICDE 2025
Abbreviated titleICDE 2025
Country/TerritoryChina
CityHong Kong
Period19/05/2523/05/25
Internet address

Fingerprint

Dive into the research topics of 'leSAX Index: A Learned SAX Representation Index for Time Series Similarity Search'. Together they form a unique fingerprint.

Cite this