Published February 2, 2021
| Version v1
Dataset
Open
Webis-Dataset-Reviews-21
- 1. Bauhaus-Universität Weimar
- 2. Leipzig University
Description
The Webis-Dataset-Reviews-21 corpus comprises the curated list of 13,372 NLP-related datasets and their 539,411 mentions extracted from all the publications available in ACL Anthology corpus.
Dataset specification
All files are in gzip-compressed JSON Lines format.
- dataset_mentions.jsonl: contains the extracted dataset mentions
[dataset, paper, mention]
- nlp_datasets.jsonl: each record contains the following dataset metadata
[source, name, doi, decsription, year, creator, corpus_url, paper_title, paper_url, task, language, format, size, ids]
Files
Files
(43.5 MB)
Name | Size | Download all |
---|---|---|
md5:a7410bdb00644402e50113f8c0e03eef
|
41.2 MB | Download |
md5:f9d7b216c37adb828710bf202e82de06
|
2.3 MB | Download |