Published March 16, 2022
| Version 1.0.0
Dataset
Open
Data for contrastive learning framework
Description
Data for contrastive learning framework, containing data for training and evaluation in two settings: detection of functionally equivalent programs on the
POJ-104 dataset, and the plagiarism detection task on the dataset of solutions to competitive programming contests held on the Codeforces platform. In both tasks, the datasets contain pairs of programs, labeled whether they are clones or not.
Files
data.zip
Files
(9.2 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:cb4be13b1d5ce5c8c8e6053c1b20175f
|
9.2 GB | Preview Download |