Dataset Open Access

SeSaMe: A Data Set of Semantically Similar Java Methods

Kamp, Marius; Kreutzer, Patrick; Philippsen, Michael

This is the data set presented in the paper

Kamp, M., Kreutzer P., Philippsen M.: SeSaMe: A Data Set of Semantically
Similar Java Methods. 16th International Conference on Mining Software
Repositories (MSR 2019), Montreal, QC, Canada. 2019

Files (543.7 MB)
Name Size
dataset-unfiltered.json.xz
md5:21e4d6df9ba1f603b7cd8abde1076bdc
71.3 kB Download
dataset.json.xz
md5:72d445e7edae2331027ab2421632e912
68.3 kB Download
docs.db.xz
md5:85b1fcd342ee47bde3bd6c0be3397bf7
543.3 MB Download
sampled-pairs.csv
md5:647857348a09671aaca586a038165c3f
304.1 kB Download
225
191
views
downloads
All versions This version
Views 225225
Downloads 191191
Data volume 13.1 GB13.1 GB
Unique views 210210
Unique downloads 147147

Share

Cite as