Published February 5, 2023
| Version V2.0
Journal article
Open
Towards Detection of Semantic Clones Across Code Components in Distributed Systems: Semantic Clones Dataset
Description
This dataset includes 27,221 pairs of control flow graphs from the TrainTicket/v0.1.0 benchmark. They have been classified according to manual analysis of the code as A and B clones, as well as non-clones. The similarity values calculated between the component attributes are listed and were used in automating the approach in our paper.
Files
Sematic-clones-dataset-TrainTicketv0.1.0.csv
Files
(7.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:e6c7770e45971d2e9732257ad4bda807
|
5.9 MB | Preview Download |
|
md5:46d0f41a64782d64f9b096bd7a1446f2
|
1.6 MB | Download |