XBlock-ETH: Extracting and Exploring Blockchain Data From Ethereum

Peilin Zheng, Zibin Zheng, Jiajing Wu, Hong-Ning Dai

Research output: Contribution to journalJournal articlepeer-review

125 Citations (Scopus)


Blockchain-based cryptocurrencies have received extensive attention recently. Massive data has been stored on permission-less blockchains. The analysis of massive blockchain data can bring huge business values. However, the absence of well-processed up-to-date blockchain datasets impedes big data analytics of blockchain data. To fill this gap, we collect and process the up-to-date on-chain data from Ethereum, which is one of the most popular permission-less blockchains. We name such well-processed Ethereum data as XBlock-ETH, which consists of transactions, smart contracts, and cryptocurrencies (i.e., tokens). However, it is non-trivial to partition and categorize the collected raw Ethereum data to the well-processed datasets since the whole processing procedure requires sophisticated knowledge on software engineering as well as big data analytics. Moreover, we also present basic statistics and exploration for each of the well-processed datasets. Furthermore, we also outline the possible research opportunities based on XBlock-ETH, with the data and code released online.
Original languageEnglish
Pages (from-to)95-106
Number of pages12
JournalIEEE Open Journal of the Computer Society
Publication statusPublished - 5 May 2020


Dive into the research topics of 'XBlock-ETH: Extracting and Exploring Blockchain Data From Ethereum'. Together they form a unique fingerprint.

Cite this