Published January 20, 2021 | Version 1.0.1
Dataset Open

Smart Contract Code Summarization Dataset

Creators

  • 1. City University of Hong Kong

Description

The Paper has been accepted by ICPC'21.

If you find this dataset useful, please cite our paper, here's the link: https://arxiv.org/abs/2103.07164

The whole data includes: 

(1) contracts: 347,410 smart contract <method, comment> pair raw data.

(2)  dataset: 

                  a. dictionaries: the dictionary of each sequence.

                  b. token_idx: each input that has translated to digital index.

                  c. dataset.pkl: 317,680 (SBT sequence, nodes equence, adjacency matrix, comment) tuples.

Files

contracts.zip

Files (904.4 MB)

Name Size Download all
md5:7a76f3c54d186801cd38f2a7f179af44
261.7 MB Preview Download
md5:73164b4246ec0ef85470f3399a65fe3f
642.6 MB Preview Download