Published May 1, 2024
| Version v1
Dataset
Open
Datasets for the paper: Lost in Translation: Using Global Fact-Checks to Measure Multilingual Misinformation Prevalence, Spread, and Evolution
Authors/Creators
Description
FullData.csv.gz: Contains links to all claims in the data-set.
- publishing_date: Date on which the fact-check was published.
- claim_date: Date that claim was made.
- verdict: Rating given by the fact-checking organisation.
- language: Language of the claim.
- cluster_{threshold}: ID of the cluster that claim belongs to at all given clusters. Entry "0" means that claim is singleton and not clustered with any other claims.
Embeddings.npy: Contains a dictionary linking each claim to it's embedding calculated with LaBSE.
Files
Additional details
Software
- Repository URL
- https://github.com/dorianquelle/Lost-In-Translation
- Programming language
- Python