Published February 5, 2023
| Version v2
Journal article
Open
Towards Detection of Semantic Clones Across Code Components in Distributed Systems: Semantic Clones Dataset
Description
This dataset includes 27,221 pairs of control flow graphs from the TrainTicket/v0.1.0 benchmark. They have been classified according to manual analysis of the code as A and B clones, as well as non-clones. The similarity values calculated between the component attributes are listed and were used in automating the approach in our paper.
Files
Sematic-clones-dataset-TrainTicketv0.1.0.csv
Files
(7.5 MB)
Name | Size | Download all |
---|---|---|
md5:e6c7770e45971d2e9732257ad4bda807
|
5.9 MB | Preview Download |
md5:f8f2e98f0a723418aa170a6652802a78
|
1.6 MB | Download |