Dataset Restricted Access

PAN18 Author Profiling

Rangel, Francisco; Montes-y-Gómez, Manuel; Potthast, Martin; Stein, Benno

We provide you with a training data set that consists of Twitter users labeled with gender. For each author, a total of 100 tweets and 10 images are provided. Authors are grouped by the language of their tweets: English, Arabic and Spanish.

More information about the task: Link

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

Please request access to the data with a short statement on how you want to use it. Thanks!
We would like to point out that you can register on to be part of the PAN community.

  • Francisco Rangel, Manuel Montes-y-Gómez, Martin Potthast, and Benno Stein. Overview of the 6th Author Profiling Task at PAN 2018: Cross-domain Authorship Attribution and Style Change Detection. In Linda Cappellato, Nicola Ferro, Jian-Yun Nie, and Laure Soulier, editors, CLEF 2018 Evaluation Labs and Workshop – Working Notes Papers, 10-14 September, Avignon, France, September 2018. ISSN 1613-0073.

All versions This version
Views 1,6971,697
Downloads 555555
Data volume 3.3 TB3.3 TB
Unique views 1,2301,230
Unique downloads 289289


Cite as