Dataset Restricted Access

PAN17 Author Profiling

Rangel, Francisco; Rosso, Paolo; Potthast, Martin; Stein, Benno

We provide you with a training data set that consists of Twitter tweets in English, Spanish, Portuguese and Arabic, labeled with gender and language variety.

More information about the task: Link

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.


Please request access to the data with a short statement on how you want to use it. Thanks!
We would like to point out that you can register on pan.webis.de to be part of the PAN community.


  • Francisco Rangel, Paolo Rosso, Martin Potthast, and Benno Stein. Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter. In Linda Cappellato, Nicola Ferro, Lorraine Goeuriot, and Thomas Mandl, editors, CLEF 2017 Evaluation Labs and Workshop – Working Notes Papers, 11-14 September, Dublin, Ireland, September 2017. CEUR-WS.org. ISSN 1613-0073.

447
90
views
downloads
All versions This version
Views 447447
Downloads 9090
Data volume 3.9 GB3.9 GB
Unique views 291291
Unique downloads 4545

Share

Cite as