Published March 1, 2024 | Version v1
Conference paper Open

Putting Context in Context: the Impact of Discussion Structure on Text Classification

  • 1. ROR icon Fondazione Bruno Kessler
  • 2. ROR icon University of Trento

Description

Current text classification approaches usually focus on the content to be classified. Contextual aspects (both linguistic and extra-linguistic) are usually neglected, even in tasks based on online discussions. Still in many cases the multi-party and multi-turn nature of the context from which these elements are selected can be fruitfully ex- ploited. In this work, we propose a series of experiments on a large dataset for stance de- tection in English, in which we evaluate the contribution of different types of contextual in- formation, i.e. linguistic, structural and tempo- ral, by feeding them as natural language input into a transformer-based model. We also exper- iment with different amounts of training data and analyse the topology of local discussion networks in a privacy-compliant way. Results show that structural information can be highly beneficial to text classification but only under certain circumstances (e.g. depending on the amount of training data and on discussion chain complexity). Indeed, we show that contextual information on smaller datasets from other clas- sification tasks does not yield significant im- provements. Our framework, based on local discussion networks, allows the integration of structural information, while minimising user profiling, thus preserving their privacy.

Files

2024.eacl-long.108.pdf

Files (1.4 MB)

Name Size Download all
md5:d85a5da67fad9bae8562925c4819a6a5
1.4 MB Preview Download

Additional details

Funding

European Commission
AI4TRUST – AI-based-technologies for trustworthy solutions against disinformation 101070190

Dates

Accepted
2023-03-01