There is a newer version of the record available.

Published April 25, 2024 | Version v1
Dataset Open

Comprehensive Ethereum Execution Data for Object-Centric Process Mining of Decentralized Applications (DApps)

Creators

Description

The dataset pertains to the collection and analysis of blockchain execution data, particularly from Ethereum-based Decentralized Applications (DApps). This data includes transactions, transaction receipts, and detailed transaction traces, documenting the execution steps performed by the Ethereum Virtual Machine (EVM). Such traces are essential for understanding the interaction between smart contracts and accounts, including Contract Accounts (CAs) and Externally Owned Accounts (EOAs).

A blockchain is an append-only ledger that chronologically records data in blocks. Each block contains transactions that signify state transitions, and transaction receipts that provide a hashed result of these transitions to ensure uniform results across different executions. The dataset includes a classification of Ethereum accounts, detailing the functions and interactions between EOAs and CAs, where CAs deploy and execute smart contract code.

The dataset captures the granular operational data of blockchain transactions, such as function calls, contract creations, and log entries generated by smart contracts. These details are crucial for creating object-centric event logs, aiding in process mining and analysis to bridge the gap between theoretical process models and actual execution.

Contract creations and function calls are fundamental components of the dataset. The former documents the deployment of smart contracts, including the mechanics of contract updates and additions through various design patterns. Function calls between accounts are also extensively logged, providing insights into the flow of Ethereum's native token, Ether, and other transactional data within the blockchain.

Delegated calls and log entries represent more specialized interactions within Ethereum, where delegated calls allow contracts to use code from other contracts to manipulate their own state, supporting upgradeable contract designs. Log entries, specified within smart contract code, facilitate the communication of contract execution details to external systems.

To handle the diverse and dynamic nature of blockchain data, the dataset employs the Object-Centric Event Log (OCEL) format. This format accommodates multiple object types in a single log, addressing issues such as event divergence and convergence, typical of traditional single-case logs. The latest version, OCEL 2.0, supports documenting dynamic object roles and relationships, improving the fidelity of logs in capturing blockchain operations.

In summary, the dataset is structured to support a comprehensive analysis of blockchain behaviors, particularly focusing on Ethereum DApps. It is tailored to assist researchers and practitioners in understanding and analyzing the decentralized execution of smart contracts and the associated data flows within the blockchain environment.

Files

calls_dapp.csv

Files (9.6 GB)

Name Size Download all
md5:45c88586b2ac958528a023df1e71235c
1.2 GB Download
md5:4e5a83e090c95112eaa1115b1e56d62d
79.3 MB Preview Download
md5:61c7253dd870c6e3603fa8ed9619a911
5.1 GB Preview Download
md5:3ff61005e99b67971f4bab4b70d62ba6
75.9 MB Preview Download
md5:99976584617c66a224c31e550a0850b3
2.3 GB Preview Download
md5:206b6363f4bf1694e70ede0ded0358cd
765.8 MB Preview Download

Additional details

Dates

Available
2024-04-25