Dataset Restricted Access

PAN14 Author Profiling

Rangel, Francisco; Rosso, Paolo; Chugur, Irina; Potthast, Martin; Trenkmann, Martin; Stein, Benno; Verhoeven, Ben; Daelemans, Walter

We provide you with a training data set that consists of blog posts, Twitter tweets and social media texts written in both English and Spanish as well as hotel reviews written in English. With regard to age, we will consider the following classes: 18-24, 25-34, 35-49, 50-64, 65-xx.

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

Please request access to the data with a short statement on how you want to use it. Thanks!
We would like to point out that you can register on to be part of the PAN community.

  • Francisco Rangel, Paolo Rosso, Irina Chugur, Martin Potthast, Martin Trenkmann, Benno Stein, Ben Verhoeven, and Walter Daelemans. Overview of the 2nd Author Profiling Task at PAN 2014. In Linda Cappellato, Nicola Ferro, Martin Halvey, and Wessel Kraaij, editors, Working Notes Papers of the CLEF 2014 Evaluation Labs, September 2014. ISSN 1613-0073.

All versions This version
Views 588588
Downloads 7272
Data volume 14.7 GB14.7 GB
Unique views 440440
Unique downloads 6060


Cite as