Creating a Dataset for Characterizing Smart Contract Code on the Ethereum Network

  • João Ricardo Pereira Marques UTFPR
  • João Fabrício Filho UTFPR
  • Rogério Aparecido Gonçalves UTFPR

Abstract


This article presents an approach to creating a dataset containing contract codes deployed on the Ethereum network. The selection criterion covers the three types of transactions: regular, creation, and execution of contract codes. The ethereumetl and DuckDB tools are used for data extraction and database creation. The database gathers block information, transactions, contract codes, and information about the instructions used in each contract such as how many times each contract was executed, and in which versions of the Ethereum Virtual Machine (EVM).

References

Day, A. and Medvedev, E. (2018). Ethereum in BigQuery: a Public Dataset for Smart Contract Analytics.

Medvedev, E. and the D5 team (2023). Ethereum ETL.

Szabo, N. (1997). Formalizing and securing relationships on public networks. First Monday, 2(9).

Wood, G. et al. (2014). Ethereum: A Secure Decentralised Generalised Transaction Ledger. Ethereum project yellow paper, 151(2014):1–32. [link].
Published
2024-05-16
MARQUES, João Ricardo Pereira; FABRÍCIO FILHO, João; GONÇALVES, Rogério Aparecido. Creating a Dataset for Characterizing Smart Contract Code on the Ethereum Network. In: REGIONAL SCHOOL OF HIGH PERFORMANCE COMPUTING FROM SÃO PAULO (ERAD-SP), 15. , 2024, Rio Claro/SP. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 45-48. DOI: https://doi.org/10.5753/eradsp.2024.239862.

Most read articles by the same author(s)

1 2 > >>