Published September 10, 2018
| Version v1
Dataset
Open
PAN18 Author Profiling
Authors/Creators
- 1. Universität Leipzig
- 2. Bauhaus-Universität Weimar
Description
We provide you with a training data set that consists of Twitter users labeled with gender. For each author, a total of 100 tweets and 10 images are provided. Authors are grouped by the language of their tweets: English, Arabic and Spanish.
More information about the task: Link
Files
pan18-author-profiling-test-dataset-2018-03-20.zip
Files
(11.9 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:74c2f77989c61af1209b97d7ba82db9f
|
4.9 GB | Preview Download |
|
md5:1e4a44a6d63ef9f8737ec107db7694b7
|
7.0 GB | Preview Download |
Additional details
References
- Francisco Rangel, Manuel Montes-y-Gómez, Martin Potthast, and Benno Stein. Overview of the 6th Author Profiling Task at PAN 2018: Cross-domain Authorship Attribution and Style Change Detection. In Linda Cappellato, Nicola Ferro, Jian-Yun Nie, and Laure Soulier, editors, CLEF 2018 Evaluation Labs and Workshop – Working Notes Papers, 10-14 September, Avignon, France, September 2018. CEUR-WS.org. ISSN 1613-0073.