Dataset Restricted Access

PAN14 Author Identification: Verification

Stamatatos, Efstathios; Daelemans, Walter; Verhoeven, Ben; Potthast, Martin; Stein, Benno; Juola, Patrick; A. Sanchez-Perez, Miguel; Barrón-Cedeño, Alberto

We provide you with a training corpus that comprises a set of author verification problems in several languages/genres. Each problem consists of some (up to five) known documents by a single person and exactly one questioned document. All documents within a single problem instance will be in the same language and best efforts are applied to assure that within-problem documents are matched for genre, register, theme, and date of writing. The document lengths vary from a few hundred to a few thousand words.

More information: Link

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

Please request access to the data with a short statement on how you want to use it. Thanks!
We would like to point out that you can register on to be part of the PAN community.

  • Efstathios Stamatatos, Walter Daelemans, Ben Verhoeven, Martin Potthast, Benno Stein, Patrick Juola, Miguel A. Sanchez-Perez, and Alberto Barrón-Cedeño. Overview of the Author Identification Task at PAN 2014. In Linda Cappellato, Nicola Ferro, Martin Halvey, and Wessel Kraaij, editors, Working Notes Papers of the CLEF 2014 Evaluation Labs, September 2014. ISSN 1613-0073.

All versions This version
Views 392392
Downloads 3030
Data volume 744.6 MB744.6 MB
Unique views 288288
Unique downloads 2727


Cite as