COLE: A Column-based Learned Storage for Blockchain Systems

Ce Zhang, Cheng Xu, Haibo Hu, Jianliang Xu*

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

1 Citation (Scopus)

Abstract

Blockchain systems suffer from high storage costs as every node needs to store and maintain the entire blockchain data. After investigating Ethereum’s storage, we find that the storage cost mostly comes from the index, i.e., Merkle Patricia Trie (MPT). To support provenance queries, MPT persists the index nodes during the data update, which adds too much storage overhead. To reduce the storage size, an initial idea is to leverage the emerging learned index technique, which has been shown to have a smaller index size and more efficient query performance. However, directly applying it to the blockchain storage results in even higher overhead owing to the requirement of persisting index nodes and the learned index’s large node size. To tackle this, we propose COLE, a novel column-based learned storage for blockchain systems. We follow the column-based database design to contiguously store each state’s historical values, which are indexed by learned models to facilitate efficient data retrieval and provenance queries. We develop a series of write-optimized strategies to realize COLE in disk environments. Extensive experiments are conducted to validate the performance of the proposed COLE system. Compared with MPT, COLE reduces the storage size by up to 94% while improving the system throughput by 1.4×-5.4×.

Original languageEnglish
Title of host publicationProceedings of the 22nd USENIX Conference on File and Storage Technologies (FAST ’24)
PublisherUSENIX Association
Pages329-345
Number of pages17
ISBN (Electronic)9781939133380
Publication statusPublished - 29 Feb 2024
Event22nd USENIX Conference on File and Storage Technologies, FAST 2024 - Hyatt Regency Santa Clara, Santa Clara, United States
Duration: 27 Feb 202429 Feb 2024
https://www.usenix.org/conference/fast24 (Conference website)
https://www.usenix.org/system/files/fast24-full_proceedings.pdf (Conference proceeding)

Conference

Conference22nd USENIX Conference on File and Storage Technologies, FAST 2024
Country/TerritoryUnited States
CitySanta Clara
Period27/02/2429/02/24
Internet address

Scopus Subject Areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'COLE: A Column-based Learned Storage for Blockchain Systems'. Together they form a unique fingerprint.

Cite this