There is a newer version of this record available.

Dataset Open Access

Hansard Speeches and Sentiment V2.5.0

Odell, Evan

Full details are available at https://evanodell.com/projects/datasets/hansard-data

Summary

A public dataset of speeches in the Hansard, stored as a tibble class in RDS files, for the R programming language. The dataset provides information on every speech made in the House of Commons between the parliament returned from the 1979 general election and the end of 2017, with information on the speaking MP, their party, gender, birthdate, starting and finishing dates as an MP, and age at the time of the speech.

The `hansard_senti_post_V250` dataset contains 2,196,175 speeches and 382,484,493 words. It is distributed under a Creative Commons 4.0 BY-SA licence.

Notes

The code and matching data used to generate this dataset is available on Github.

The data used to create this dataset was taken from the parlparse project operated by They Work For You and supported by mySociety.

The dataset is licensed under a Creative Commons Attribution 4.0 International License.

The code used to create this dataset is licensed under an MIT license.

Please contact me if you find any errors in the dataset. The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'.

This release is an update of previously released datasets. See full documentation for details.
Files (3.1 GB)
Name Size
gender-senti-mean-V250.csv
md5:74357f6fa7958e20331b12970218e4b4
1.8 kB Download
gov-senti-mean-V250.csv
md5:fcee82a33b52c019d7a780f878783b1a
1.8 kB Download
hansard-summary-stats-V250.xlsx
md5:24cd08569adce2efaec491f748ba741c
1.1 MB Download
hansard_post_V250.rds
md5:f34c5a8c1bfd7f9cfa04284e51f4cb07
3.1 GB Download
ministry-senti-mean-V250.csv
md5:7bbec04d9452425f226376e53598fb56
8.3 kB Download
month-senti-mean-V250.csv
md5:5a4dcf2457ba3d39dcb3c14c13fe7a9e
237.7 kB Download
mp-senti-mean-V241.csv
md5:dfdde2e2ec6dca36729fad7242f8c4b1
916.2 kB Download
mp-senti-mean-V250.csv
md5:05ef55bd964ddb87ad97474a751de83f
1.3 MB Download
party-group-senti-mean-V241.csv
md5:de33263477ddef6645f61d8d91a6b4b1
2.1 kB Download
party-group-senti-mean-V250.csv
md5:2d6f1415e5320e14649d21d24ea5b9a5
3.5 kB Download
party-senti-mean-V241.csv
md5:06f04d711938a5d7d8a50f61523e92be
14.1 kB Download
party-senti-mean-V250.csv
md5:72106b14257bb921bf47471fc9aa19b8
19.8 kB Download
quarter-senti-mean-V241.csv
md5:02fb8fa5c9e164c10c6e98543b8feea1
66.6 kB Download
quarter-senti-mean-V250.csv
md5:3daaa8dd35a29c79ea8e4bd73b03b96d
91.8 kB Download
year-senti-mean-V241.csv
md5:e4fec7bae959ea9b85e4c0c430cc8a0d
17.1 kB Download
year-senti-mean-V250.csv
md5:0df937dac04a71d67ab445b958c42c3e
23.5 kB Download
141
210
views
downloads
All versions This version
Views 14134
Downloads 21020
Data volume 389.2 GB18.8 GB
Unique views 12432
Unique downloads 14314

Share

Cite as