Dataset Open Access
Fatma Arslan;
Naeemul Hassan;
Chengkai Li;
Mark Tremayne
The ClaimBuster dataset consists of statements extracted from all U.S. general election presidential debates (1960-2016) along with human-annotated check-worthiness labels where each sentence is categorized into one of the three categories: non-factual statement, unimportant factual statement, and check-worthy factual statement.
If you use this dataset, please cite the following paper:
@inproceedings{arslan2020claimbuster,
title={{A Benchmark Dataset of Check-worthy Factual Claims}},
author={Arslan, Fatma and Hassan, Naeemul and Li, Chengkai and Tremayne, Mark },
booktitle={14th International AAAI Conference on Web and Social Media},
year={2020},
organization={AAAI}
}
Name | Size | |
---|---|---|
ClaimBuster_Datasets.zip
md5:06ca00d0705e0a7fe9fb9a23a539ca97 |
4.7 MB | Download |
All versions | This version | |
---|---|---|
Views | 2,384 | 1,368 |
Downloads | 1,470 | 143 |
Data volume | 6.1 GB | 674.3 MB |
Unique views | 2,042 | 1,233 |
Unique downloads | 915 | 136 |