Dataset Restricted Access

PAN15 Author Identification: Verification

Stamatatos, Efstathios; Daelemans Daelemans amd Ben Verhoeven, Walter; Juola, Patrick; López-López, Aurelio; Potthast, Martin; Stein, Benno


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>We provide you with a training corpus that comprises a set of author verification problems in several languages/genres. Each problem consists of some (up to five) known documents by a single person and exactly one questioned document. All documents within a single problem instance will be in the same language. However, their genre and/or topic may differ significantly. The document lengths vary from a few hundred to a few thousand words.</p>\n\n<p>The documents of each problem are located in a separate folder, the name of which (problem ID) encodes the language of the documents. The following list shows the available sub-corpora, including their language, type (cross-genre or cross-topic), code, and examples of problem IDs:</p>\n\n<p>Language; Type; Code; Problem IDs<br>\nDutch; Cross-genre; DU; DU001, DU002, DU003, etc.<br>\nEnglish; Cross-topic; EN; EN001, EN002, EN003, etc.<br>\nGreek; Cross-topic; GR; GR001, GR002, GR003, etc.<br>\nSpanish; Cross-genre; SP; SP001, SP002, SP003, etc.</p>\n\n<p>The ground truth data of the training corpus found in the file <code>truth.txt</code> include one line per problem with problem ID and the correct binary answer (Y means the known and the questioned documents are by the same author and N means the opposite). For example:</p>\n\n<pre>EN001 N\nEN002 Y\nEN003 N\n...</pre>", 
  "creator": [
    {
      "@type": "Person", 
      "name": "Stamatatos, Efstathios"
    }, 
    {
      "@type": "Person", 
      "name": "Daelemans Daelemans amd Ben Verhoeven, Walter"
    }, 
    {
      "@type": "Person", 
      "name": "Juola, Patrick"
    }, 
    {
      "@type": "Person", 
      "name": "L\u00f3pez-L\u00f3pez, Aurelio"
    }, 
    {
      "affiliation": "Universit\u00e4t Leipzig", 
      "@id": "https://orcid.org/0000-0003-2451-0665", 
      "@type": "Person", 
      "name": "Potthast, Martin"
    }, 
    {
      "affiliation": "Bauhaus-Universit\u00e4t Weimar", 
      "@id": "https://orcid.org/0000-0001-9033-2217", 
      "@type": "Person", 
      "name": "Stein, Benno"
    }
  ], 
  "url": "https://zenodo.org/record/3737563", 
  "datePublished": "2015-09-08", 
  "@type": "Dataset", 
  "keywords": [
    "authorship", 
    "verification", 
    "pan", 
    "2015"
  ], 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.3737563", 
  "@id": "https://doi.org/10.5281/zenodo.3737563", 
  "workFeatured": {
    "alternateName": "PAN at CLEF 2015", 
    "@type": "Event", 
    "name": "Conference title: PAN at Conference and Labs of the Evaluation Forum 2015"
  }, 
  "name": "PAN15 Author Identification: Verification"
}
508
39
views
downloads
All versions This version
Views 508508
Downloads 3939
Data volume 236.0 MB236.0 MB
Unique views 384384
Unique downloads 3737

Share

Cite as