Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published February 2, 2021 | Version v1
Dataset Open

Webis-Dataset-Reviews-21

  • 1. Bauhaus-Universität Weimar
  • 2. Leipzig University

Description

The Webis-Dataset-Reviews-21 corpus comprises the curated list of 13,372 NLP-related datasets and their 539,411 mentions extracted from all the publications available in ACL Anthology corpus.

Dataset specification

All files are in gzip-compressed JSON Lines format.

  • dataset_mentions.jsonl: contains the extracted dataset mentions
    [dataset, paper, mention]
  • nlp_datasets.jsonl: each record contains the following dataset metadata
    [source, name, doi, decsription, year, creator, corpus_url, paper_title, paper_url, task, language, format, size, ids]

Files

Files (43.5 MB)

Name Size Download all
md5:a7410bdb00644402e50113f8c0e03eef
41.2 MB Download
md5:f9d7b216c37adb828710bf202e82de06
2.3 MB Download