Published March 12, 2017 | Version v1
Dataset Open

Hansard Speeches and Sentiment V1.0.1

Creators

Description

A public dataset of speeches in the Hansard, the record of the speeches, votes and legislation in the UK Parliament. The dataset provides information on each speech of ten words or longer, made in the House of Commons between 1980 and 2016, with information on the speaking MP, their party, gender and age at the time of the speech. The dataset also includes all speeches of ten words made from 1936 to 1979, without identifying information on the speaker.

The speeches have been classified for sentiment using a total of five libraries from the R packages `sentimentr`, `syuzhet` and `lexicon`.

The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'. More details on the dataset are available at: http://evanodell.com/datasets/hansard-data/

Notes

This is an updated version of http://doi.org/10.5281/zenodo.376665

Files

senti_df.csv

Files (6.0 GB)

Name Size Download all
md5:55f1396608f4ec25259f928ea71cdd3d
6.0 GB Preview Download

Additional details

Related works

Is new version of
10.5281/zenodo.376665 (DOI)