6397037
doi
10.5281/zenodo.6397037
oai:zenodo.org:6397037
user-pan
BERTA CHULVI
UNIVERSITAT POLITÈCNICA DE VALÈNCIA
FRANCISCO RANGEL
SYMANTO RESEARCH
PAOLO ROSSO
UNIVERSITAT POLITÈCNICA DE VALÈNCIA
ELISABETTA FERSINI
UNIVERSITÉ DEGLI STUDI DI MILANO BICOCCA
PAN 22 Author Profiling: Profiling Irony and Stereotype Spreaders on Twitter (IROSTEREO)
REYNIER ORTEGA BUENO
UNIVERSITAT POLITÈCNICA DE VALÈNCIA
info:eu-repo/semantics/restrictedAccess
author profiling
irony
stereotypes
social categories
<p><strong>TASK</strong></p>
<p>With irony, language is employed in a figurative and subtle way to mean the opposite to what is literally stated. In case of sarcasm, a more aggressive type of irony, the intent is to mock or scorn a victim without excluding the possibility to hurt. Stereotypes are often used, especially in discussions about controversial issues such as immigration or sexism and misogyny. At PAN’22, we will focus on profiling ironic authors in Twitter. Special emphasis will be given to those authors that employ irony to spread stereotypes, for instance, towards women or the LGTB community. The goal will be to classify authors as ironic or not depending on their number of tweets with ironic content. Among those authors we will consider a subset that employs irony to convey stereotypes in order to investigate if state-of-the-art models are able to distinguish also these cases. Therefore, given authors of Twitter together with their tweets, the goal will be to profile those authors that can be considered as ironic.</p>
<p><strong>DATA</strong></p>
<p><strong>Input</strong></p>
<p>The uncompressed dataset consists in a folder which contains:</p>
<ul>
<li>A XML file per author (Twitter user) with 200 tweets. The name of the XML file correspond to the unique author id.</li>
<li>A truth.txt file with the list of authors and the ground truth.</li>
</ul>
<p>The format of the XML files is:</p>
<pre><code class="language-xml"> <author lang="en">
<documents>
<document>Tweet 1 textual contents</document>
<document>Tweet 2 textual contents</document>
...
</documents>
</author></code></pre>
<p>The format of the truth.txt file is as follows. The first column corresponds to the author id. The second column contains the truth label.</p>
<pre><code> 2d0d4d7064787300c111033e1d2270cc:::I
b9eccce7b46cc0b951f6983cc06ebb8:::NI
f41251b3d64d13ae244dc49d8886cf07:::I
47c980972060055d7f5495a5ba3428dc:::NI
d8ed8de45b73bbcf426cdc9209e4bfbc:::I
2746a9bf36400367b63c925886bc0683:::NI
...</code></pre>
<p><strong>Evaluation</strong></p>
<p>The performance of your system will be ranked by accuracy.</p>
<p> </p>
<p>More info on the task: <a href="https://pan.webis.de/clef22/pan22-web/author-profiling.html">https://pan.webis.de/clef22/pan22-web/author-profiling.html </a></p>
Zenodo
2022-03-29
info:eu-repo/semantics/other
6397036
user-pan
V1
1651597206.597524
public
10.5281/zenodo.6397036
isVersionOf
doi