Published November 13, 2017
| Version v3
Dataset
Open
Dataset with manually validated version histories of Stack Overflow posts
Description
We used this dataset to evaluate different string similarity metrics for SOTorrent (http://sotorrent.org/).
The dataset has been created with this tool: https://github.com/sotorrent/so-posthistory-gt
The dataset has been used in this project: https://github.com/sotorrent/metrics-comparison
Files
LICENSE.txt
Files
(1.5 MB)
Name | Size | Download all |
---|---|---|
md5:2acd14bf8b67dea11530bba3b87d1ee8
|
220 Bytes | Preview Download |
md5:022a15a36d6a261345a249027945b939
|
373.6 kB | Preview Download |
md5:6211c6a08366b96f233a78f6274c4452
|
168.4 kB | Preview Download |
md5:1d3a0f0302de5ff2ef27df33c30c92f0
|
299.9 kB | Preview Download |
md5:1659fe5f452931be75641e41442cb148
|
148.3 kB | Preview Download |
md5:adc696d312a5bf0373a4719e914c203c
|
202.9 kB | Preview Download |
md5:56ff87e7fb6a8268ca0e47b737978c78
|
173.2 kB | Preview Download |
md5:e8fcdbfd3ad5eb22ff0f8b0ad3b3c29b
|
172.8 kB | Preview Download |