Published March 16, 2022 | Version 1.0.0
Dataset Open

Data for contrastive learning framework

Authors/Creators

  • 1. Anonymous

Description

Data for contrastive learning framework, containing data for training and evaluation in two settings: detection of functionally equivalent programs on the 
POJ-104 dataset, and the plagiarism detection task on the dataset of solutions to competitive programming contests held on the Codeforces platform. In both tasks, the datasets contain pairs of programs, labeled whether they are clones or not.

Files

data.zip

Files (9.2 GB)

Name Size Download all
md5:cb4be13b1d5ce5c8c8e6053c1b20175f
9.2 GB Preview Download