Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published February 2, 2021 | Version v1
Dataset Open


  • 1. Bauhaus-Universität Weimar
  • 2. Leipzig University


The Webis-Dataset-Reviews-21 corpus comprises the curated list of 13,372 NLP-related datasets and their 539,411 mentions extracted from all the publications available in ACL Anthology corpus.

Dataset specification

All files are in gzip-compressed JSON Lines format.

  • dataset_mentions.jsonl: contains the extracted dataset mentions
    [dataset, paper, mention]
  • nlp_datasets.jsonl: each record contains the following dataset metadata
    [source, name, doi, decsription, year, creator, corpus_url, paper_title, paper_url, task, language, format, size, ids]


Files (43.5 MB)

Name Size Download all
41.2 MB Download
2.3 MB Download