Published October 20, 2023 | Version v1
Dataset Open

Analysing state-backed propaganda websites: a new dataset and linguistic study (public dataset)

  • 1. ROR icon University of Sheffield

Description

This is the dataset accompanying the EMNLP 2023 paper "Analysing state-backed propaganda websites: a new dataset and linguistic study".

For copyright and liability reasons, we do not publicly distribute the complete dataset. Instead, we provide the software used to create the dataset (DOI: 10.5281/zenodo.10008086) and a list containing the URLs of all the posts in the full dataset (this repository).

To reconstruct our dataset: use the software to extract the sites, then filter the posts to the corresponding URL list. Please note that some posts may no longer be available or may have been modified.

If you are researching disinformation, propaganda, or a relevant field: please contact the authors, we may be able to provide you with the original dataset.

Files

rrn-20230306-posts.url-list.txt

Files (1.0 MB)

Name Size Download all
md5:2484c7686c0b636bdc1181272c45d464
611.8 kB Preview Download
md5:36218c17a954589a65b85b531e98de47
426.3 kB Preview Download

Additional details

Related works

Is compiled by
Software: 10.5281/zenodo.10008086 (DOI)
Is derived from
Dataset: 10.5281/zenodo.10008933 (DOI)
Is described by
Conference paper: 10.18653/v1/2023.emnlp-main.349 (DOI)

Funding

VIGILANT : Vital IntelliGence to Investigate ILlegAl DisiNformaTion 10039039
UK Research and Innovation