Published September 15, 2020
| Version 1.0
Software
Open
acred models and data
Description
Models and data needed to run and reproduce results for the acred credibility review system, as described in our paper accepted at the International Semantic Web Conference 2020.
Models:
- semantic_encoder.zip is a RoBERTa base model fine-tuned on STS-B for encoding sentences for semantic similarity
- saved_fnc1_classifier is a RoBERTa base model with sentence-pair classifier fine-tuned on FNC-1 for stance detection
- check_worthiness is a RoBERTa base model with sentence classifier fine-tuned on a variety of datasets to predict the checkworthiness of a sentence
Data:
- claims-from-ClaimReviews-45K.csv provides 45K sentences for which we have found ClaimReviews, which serve as ground credibility signals
- sentences-extractedFrom-Articles-40K.csv provides 40K sentences extracted from a variety of news websites, which serve as ground credibility signals
- claim_dev_embs_85K_20200426.tar.gz provides embeddings for the 85K sentences in the csv files
- claimReviews-pruned-45K.jsonl provides a pruned version of the original claimReview. In particular, we do not include the full text of hte claimReview, but only the url, author information and main rating.
Files
check_worthiness_acc_0.95.zip
Files
(1.6 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:e795ae79a21decd97a425322b31f4474
|
326.7 MB | Preview Download |
|
md5:df67b2c071b14508856e30013f59950d
|
599.8 MB | Download |
|
md5:222afa9b078d24c9641f819d1e5ae99d
|
24.8 MB | Download |
|
md5:47c55fe7368f5f7c12c5357b76660e07
|
11.2 MB | Preview Download |
|
md5:b430a34bf4713aea4f074a5922acde5c
|
336.0 MB | Download |
|
md5:23b50b4277d96d34b509064b88c66d28
|
324.1 MB | Preview Download |
|
md5:cb0b1a66d1490dabe4cc022597a1dacf
|
11.2 MB | Preview Download |