MAD-TSC: A Multilingual Aligned News Dataset for Target-dependent Sentiment Classification

doi:10.5281/zenodo.7940057

Published July 9, 2023 | Version v1

Conference paper Open

MAD-TSC: A Multilingual Aligned News Dataset for Target-dependent Sentiment Classification

1. Université Paris-Saclay, CEA, List,
2. Université de Lorraine - CNRS - Loria,

Target-dependent sentiment classification (TSC) enables a fine-grained automatic analysis of sentiments expressed in texts.
Sentiment expression varies depending on the domain, and it is necessary to create domain-specific datasets.
While socially important, TSC in the news domain remains relatively understudied.
We introduce MAD-TSC, the first multilingual aligned dataset designed for TSC in news. MAD-TSC differs substantially from existing resources.
First, it includes aligned examples in eight languages to facilitate a comparison of performance for individual languages, and a direct comparison of human and machine translation.
Second, the dataset is sampled from a diversified parallel news corpus, and is diversified in terms of news sources and geographic spread of entities.
Finally, MAD-TSC is more challenging than existing datasets because its samples are more complex.
We exemplify the use of MAD-TSC with comprehensive monolingual and multilingual experiments.
The latter shows that machine translations can successfully replace manual ones, and that performance for all included languages can match that of English by automatically translating test examples.

Files

mad_tsc.pdf

Files (1.3 MB)

Name	Size	Download all
mad_tsc.pdf md5:df069125f6e7936dff094284ae0ebb20	1.3 MB	Preview Download

Additional details

AI4Media – A European Excellence Centre for Media, Society and Democracy 951911: European Commission

	All versions	This version
Views	229	223
Downloads	126	124
Data volume	168.4 MB	165.9 MB

MAD-TSC: A Multilingual Aligned News Dataset for Target-dependent Sentiment Classification

Creators

Description

Files

mad_tsc.pdf

Files (1.3 MB)

Additional details

Funding