Zenodo.org will be unavailable for 2 hours on September 29th from 06:00-08:00 UTC. See announcement.

Dataset Open Access

SeSaMe: A Data Set of Semantically Similar Java Methods

Kamp, Marius; Kreutzer, Patrick; Philippsen, Michael

This is the data set presented in the paper

Kamp, M., Kreutzer P., Philippsen M.: SeSaMe: A Data Set of Semantically
Similar Java Methods. 16th International Conference on Mining Software
Repositories (MSR 2019), Montreal, QC, Canada. 2019

Files (543.7 MB)
Name Size
dataset-unfiltered.json.xz
md5:21e4d6df9ba1f603b7cd8abde1076bdc
71.3 kB Download
dataset.json.xz
md5:72d445e7edae2331027ab2421632e912
68.3 kB Download
docs.db.xz
md5:85b1fcd342ee47bde3bd6c0be3397bf7
543.3 MB Download
sampled-pairs.csv
md5:647857348a09671aaca586a038165c3f
304.1 kB Download
693
424
views
downloads
All versions This version
Views 693693
Downloads 424424
Data volume 26.2 GB26.2 GB
Unique views 619619
Unique downloads 326326

Share

Cite as