There is a newer version of the record available.

Published November 13, 2017 | Version v3
Dataset Open

Dataset with manually validated version histories of Stack Overflow posts

  • 1. University of Trier

Description

We used this dataset to evaluate different string similarity metrics for SOTorrent (http://sotorrent.org/).

The dataset has been created with this tool: https://github.com/sotorrent/so-posthistory-gt

The dataset has been used in this project: https://github.com/sotorrent/metrics-comparison

 

Files

LICENSE.txt

Files (1.5 MB)

Name Size Download all
md5:2acd14bf8b67dea11530bba3b87d1ee8
220 Bytes Preview Download
md5:022a15a36d6a261345a249027945b939
373.6 kB Preview Download
md5:6211c6a08366b96f233a78f6274c4452
168.4 kB Preview Download
md5:1d3a0f0302de5ff2ef27df33c30c92f0
299.9 kB Preview Download
md5:1659fe5f452931be75641e41442cb148
148.3 kB Preview Download
md5:adc696d312a5bf0373a4719e914c203c
202.9 kB Preview Download
md5:56ff87e7fb6a8268ca0e47b737978c78
173.2 kB Preview Download
md5:e8fcdbfd3ad5eb22ff0f8b0ad3b3c29b
172.8 kB Preview Download