Published February 5, 2023 | Version V2.0
Journal article Open

Towards Detection of Semantic Clones Across Code Components in Distributed Systems: Semantic Clones Dataset

  • 1. Baylor University

Description

This dataset includes 27,221 pairs of control flow graphs from the TrainTicket/v0.1.0 benchmark. They have been classified according to manual analysis of the code as A and B clones, as well as non-clones. The similarity values calculated between the component attributes are listed and were used in automating the approach in our paper.

Files

Sematic-clones-dataset-TrainTicketv0.1.0.csv

Files (7.5 MB)

Name Size Download all
md5:e6c7770e45971d2e9732257ad4bda807
5.9 MB Preview Download
md5:46d0f41a64782d64f9b096bd7a1446f2
1.6 MB Download