Dataset Open Access

Webis Abstractive Snippet Corpus 2020

Chen ,Wei-Fan; Syed, Shahbaz; Potthast, Martin; Hagen, Matthias; Stein, Benno

The Webis Abstractive Snippet 2020 (Webis-Snippete-20) comprises four abstractive snippet dataset from ClueWeb09, Clueweb12, and DMOZ descriptions. More than 10 million <webpage, abstractive snippet> pairs / 3.5 million <query, webpage, abstractive snippet> pairs were collected.

Files (11.2 GB)
Name Size
released-snippet-ac-qb.zip
md5:41696d93df837a53f871c0e402eb0a22
4.3 GB Download
released-snippet-ac.zip
md5:f36a9c50a117d5bee91831b4a23c7bb0
6.6 GB Download
released-snippet-dmoz-qb.zip
md5:ed30087f82080000ac5e67b23a8d8c98
23.2 MB Download
released-snippet-dmoz.zip
md5:511694a7c9e4794364eaff89838e0039
244.4 MB Download
820
2,699
views
downloads
All versions This version
Views 820820
Downloads 2,6992,699
Data volume 10.8 TB10.8 TB
Unique views 735735
Unique downloads 365365

Share

Cite as