Dataset Open Access
Fatma Arslan;
Naeemul Hassan;
Chengkai Li;
Mark Tremayne
{ "description": "<p>The ClaimBuster dataset consists of statements extracted from all U.S. general election presidential debates (1960-2016) along with human-annotated check-worthiness labels. It contains 23,533 sentences where each sentence is categorized into one of the three categories: non-factual statement, unimportant factual statement, and check-worthy factual statement. </p>", "license": "https://creativecommons.org/licenses/by/4.0/legalcode", "creator": [ { "affiliation": "University of Texas at Arlington", "@type": "Person", "name": "Fatma Arslan" }, { "affiliation": "University of Maryland", "@type": "Person", "name": "Naeemul Hassan" }, { "affiliation": "University of Texas at Arlington", "@id": "https://orcid.org/0000-0002-1724-8278", "@type": "Person", "name": "Chengkai Li" }, { "affiliation": "University of Texas at Arlington", "@type": "Person", "name": "Mark Tremayne" } ], "url": "https://zenodo.org/record/3609356", "datePublished": "2020-01-15", "keywords": [ "factual claim", "check-worthy claim", "check-worthiness" ], "@context": "https://schema.org/", "distribution": [ { "contentUrl": "https://zenodo.org/api/files/0eae643c-e79f-4f2d-986d-e70b3cff6223/all_sentences.csv", "encodingFormat": "csv", "@type": "DataDownload" }, { "contentUrl": "https://zenodo.org/api/files/0eae643c-e79f-4f2d-986d-e70b3cff6223/crowdsourced.csv", "encodingFormat": "csv", "@type": "DataDownload" }, { "contentUrl": "https://zenodo.org/api/files/0eae643c-e79f-4f2d-986d-e70b3cff6223/groundtruth.csv", "encodingFormat": "csv", "@type": "DataDownload" } ], "identifier": "https://doi.org/10.5281/zenodo.3609356", "@id": "https://doi.org/10.5281/zenodo.3609356", "@type": "Dataset", "name": "ClaimBuster: A Benchmark Dataset of Check-worthy Factual Claims" }
All versions | This version | |
---|---|---|
Views | 2,349 | 992 |
Downloads | 1,456 | 1,309 |
Data volume | 6.0 GB | 5.3 GB |
Unique views | 2,019 | 900 |
Unique downloads | 907 | 797 |