Published January 20, 2021
| Version 1.0.1
Dataset
Open
Smart Contract Code Summarization Dataset
Description
The Paper has been accepted by ICPC'21.
If you find this dataset useful, please cite our paper, here's the link: https://arxiv.org/abs/2103.07164
The whole data includes:
(1) contracts: 347,410 smart contract <method, comment> pair raw data.
(2) dataset:
a. dictionaries: the dictionary of each sequence.
b. token_idx: each input that has translated to digital index.
c. dataset.pkl: 317,680 (SBT sequence, nodes equence, adjacency matrix, comment) tuples.
Files
contracts.zip
Files
(904.4 MB)
Name | Size | Download all |
---|---|---|
md5:7a76f3c54d186801cd38f2a7f179af44
|
261.7 MB | Preview Download |
md5:73164b4246ec0ef85470f3399a65fe3f
|
642.6 MB | Preview Download |