Published March 10, 2022
| Version v2
Dataset
Open
Multi-label Datasets used in "Adapting Transformers for Multi-Label Text Classification"
Description
The three Multi-Label datasets used in the article "Adapting Transformers for Multi-Label Text Classification".
- AAPD Dataset (ArXiv Academic Paper Dataset) [Yang et al. 2018]1
- Reuters-21578 Dataset: https://archive.ics.uci.edu/ml/datasets/reuters-21578+text+categorization+collection
- MFHAD (Multilabel French HAL Abstracts Dataset)
1Pengcheng Yang, Xu Sun, Wei Li, Shuming Ma, Wei Wu, and Houfeng Wang. 2018.
SGM: Sequence Generation Model for Multi-label Classification. In Proceedings
of the 27th International Conference on Computational Linguistics. Association for
Computational Linguistics, Santa Fe, New Mexico, USA, 3915–3926.