Published January 15, 2024 | Version v1
Dataset Open

EUvsDisinfo: a Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles (Dataset)

Description

This is the dataset and metadata accompanying the paper submission titled "EUvsDisinfo: a Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles".

For copyright and liability reasons, we do not publicly distribute the complete dataset. Instead, we provide the software used to create the dataset (DOI: 10.5281/zenodo.10492913) and a list containing the URLs of all the posts in the full dataset (this repository).

To reconstruct our dataset: use the software to extract the articles corresponding URL list. Please note that some posts may no longer be available or may have been modified.

If you are researching disinformation, propaganda, or a relevant field: please contact the authors, we may be able to provide you with the original dataset.

The fields are provided in the following format:

Field name

Description Format

debunk_id

ID of the debunk on EUvsDisinfo website to which the article refers alphanumeric string

keywords

The topics covered in the article comma-separated string

article_id

Unique ID of the article alphanumeric string

article_publisher

The name of the article publisher string

article_domain

The domain where the article was published. Different domains can refer to the same publisher string

article_url

The url where the article can be collected from by the software string

article_language

The language in which the article was written string

debunk_date

The date when the debunk was published date (dd-mm-yyyy)

class

Class assigned based on whether the article is marked as disinformation or appears in the debunk to in the context of disprooving disinformation string (trustworthy or disinformation)

 

Files

euvsdisinfo_base.csv

Files (5.5 MB)

Name Size Download all
md5:40b80c97bc563a7aba6a3e62aacf97d8
5.5 MB Preview Download

Additional details

Related works

Is compiled by
Software: 10.5281/zenodo.10492913 (DOI)

Funding

vera.ai: VERification Assisted by Artificial Intelligence 10039055
UK Research and Innovation
vera.ai – vera.ai: VERification Assisted by Artificial Intelligence 101070093
European Commission