Dataset Restricted Access

PAN16 Author Identification: Clustering

Stamatatos, Efstathios; Tschuggnall, Michael; Verhoeven, Ben; Daelemans, Walter; Specht, Günther; Stein, Benno; Potthast, Martin

We provide a collection of (up to 100) documents to identify authorship links and groups of documents by the same author. All documents are single-authored, in the same language, and belong to the same genre. However, the topic or text-length of documents may vary. The number of distinct authors whose documents are included in the collection is not given.

More information: Link

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

Please request access to the data with a short statement on how you want to use it. Thanks!
We would like to point out that you can register on to be part of the PAN community.

  • Efstathios Stamatatos, Michael Tschuggnall, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, and Martin Potthast. Clustering by Authorship Within and Across Documents. In Working Notes Papers of the CLEF 2016 Evaluation Labs volume 1609 of CEUR Workshop Proceedings, September 2016. ISSN 1613-0073.

All versions This version
Views 208208
Downloads 1313
Data volume 69.3 MB69.3 MB
Unique views 159159
Unique downloads 1313


Cite as