Dataset Restricted Access

PAN16 Author Profiling

Rangel, Francisco; Rosso, Paolo; Verhoeven, Ben; Daelemans, Walter; Potthast, Martin; Stein, Benno


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Francisco Rangel, Paolo Rosso, Ben Verhoeven, Walter Daelemans, Martin Potthast, and Benno Stein. Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations. In Krisztian Balog, Linda Cappellato, Nicola Ferro, and Craig Macdonald, editors, CLEF 2016 Evaluation Labs and Workshop – Working Notes Papers, 5-8 September, Évora, Portugal, September 2016. CEUR-WS.org. ISSN 1613-0073.</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">twitter</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">twwets</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">author</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">profiling</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">pan</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">2016</subfield>
  </datafield>
  <controlfield tag="005">20200409202017.0</controlfield>
  <controlfield tag="001">3745963</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="g">PAN at CLEF 2016</subfield>
    <subfield code="a">Conference title: PAN at Conference and Labs of the Evaluation Forum 2016</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Rosso, Paolo</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Verhoeven, Ben</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Daelemans, Walter</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Universität Leipzig</subfield>
    <subfield code="0">(orcid)0000-0003-2451-0665</subfield>
    <subfield code="a">Potthast, Martin</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="0">(orcid)0000-0001-9033-2217</subfield>
    <subfield code="a">Stein, Benno</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">restricted</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2016-09-05</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-pan</subfield>
    <subfield code="o">oai:zenodo.org:3745963</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Rangel, Francisco</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">PAN16 Author Profiling</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-pan</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;We provide&amp;nbsp;a training data set that consists of Twitter tweets in English, Spanish and Dutch.&lt;/p&gt;

&lt;p&gt;The English and Spanish datasets are labeled with age and gender, whereas the Dutch one only with gender. With regard to age, we will consider the following classes: 18-24, 25-34, 35-49, 50-64, 65-xx.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Remark.&lt;/strong&gt; Due to Twitter&amp;#39;s privacy policy we cannot provide tweets directly, but only URLs referring to them. You will have to download them yourself. For your convenience, we provide a download software for this. We expect participants to extract gender and age information only from the textual part of a tweet and to discard any other meta information that may be provided by Twitter&amp;#39;s API. When we evaluate your software at our site, we do not expect it downloads tweets. We will do this beforehand.&lt;/p&gt;

&lt;p&gt;More information about the task:&amp;nbsp;&lt;a href="https://pan.webis.de/clef16/pan16-web/author-profiling.html"&gt;Link&lt;/a&gt;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3745962</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3745963</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
1,041
144
views
downloads
All versions This version
Views 1,0411,041
Downloads 144144
Data volume 568.4 MB568.4 MB
Unique views 721721
Unique downloads 7272

Share

Cite as