Published April 9, 2022 | Version v1
Journal article Open

Dataset and Source Code for COSER

Creators

Description

The Dataset.zip contains the training, validation and test samples of different programming languages for our experiments.

The way to transform the stack-based instructions bytecode data of Python to 3-address version and finally hierarchically construct the graph is shown in stack_instruction_unify.py.

The way to construct our semantic graph based on the unified 3-address instructions is shown in three_address_ins2_graph.py.

The source code for our customized hierarchical graph pooling mechanism and the way to train the whole architecture end-to-end are shown in the remaining files.

 

Files

Data.zip

Files (146.7 MB)

Name Size Download all
md5:154a3c4e65b93224e7942f3bc6204205
1.7 kB Download
md5:822ff9a7a34491e2eafd31053a83afdd
4.0 kB Download
md5:d5277000e7b6f9f4ba4d193aafc9e271
3.7 kB Download
md5:d7da347e6191a84aa17f10afbceb2674
146.5 MB Preview Download
md5:12e665a81d56606248a7d41463531f3a
9.1 kB Download
md5:bbdc64ffa83910d8bd64f7bc716abd01
10.4 kB Download
md5:ebda585cf78bbf624d4b7390965ae5cc
46.4 kB Download
md5:8b4d98c17c4656129614ed6be95035f4
54.0 kB Download
md5:2fd7bf6860f605d8e813056dbb7f4058
33.5 kB Download
md5:a8c3e9a245a96aca1925b39aec47d144
2.0 kB Download
md5:1d9fac5951c495a682e01b752664be04
18.7 kB Download