Published January 20, 2021
                      
                       | Version 1.0.1
                    
                    
                      
                        
                          Dataset
                        
                      
                      
                        
                          
                        
                        
                          Open
                        
                      
                    
                  Smart Contract Code Summarization Dataset
Description
The Paper has been accepted by ICPC'21.
If you find this dataset useful, please cite our paper, here's the link: https://arxiv.org/abs/2103.07164
The whole data includes:
(1) contracts: 347,410 smart contract <method, comment> pair raw data.
(2) dataset:
a. dictionaries: the dictionary of each sequence.
b. token_idx: each input that has translated to digital index.
c. dataset.pkl: 317,680 (SBT sequence, nodes equence, adjacency matrix, comment) tuples.
Files
      
        contracts.zip
        
      
    
    
      
        Files
         (904.4 MB)
        
      
    
    | Name | Size | Download all | 
|---|---|---|
| md5:7a76f3c54d186801cd38f2a7f179af44 | 261.7 MB | Preview Download | 
| md5:73164b4246ec0ef85470f3399a65fe3f | 642.6 MB | Preview Download |