There is a newer version of the record available.

Published August 4, 2017 | Version v4
Dataset Open

Hansard Speeches and Sentiment V2.4

Creators

  • 1. Disability Rights UK

Description

Full details are available at https://evanodell.com/projects/datasets/hansard-data

Summary

A public dataset of speeches in the Hansard, stored as a tibble class in RDS files, for the R programming language. The dataset provides information on every speech made in the House of Commons between the parliament returned from the 1979 general election and the parliamentary summer recess starting on 2017-07-20, with information on the speaking MP, their party, gender, birthdate, starting and finishing dates as an MP, and age at the time of the speech. The dataset also includes all speeches made from 1936 to the dissolution of parliament for the 1979 general election. The post-1979 election dataset is labelled hansard_senti_post_V24 and the pre-1979 election dataset is labelled hansard_senti_pre_V24.

The `hansard_senti_post_V24` dataset contains 2,169,348 speeches and 373,323,215 words. The `hansard_senti_pre_V24` dataset contains 2,977,461 speeches and 406,062,364 words. It is distributed under a Creative Commons 4.0 BY-SA licence.

Notes

The code and matching data used to generate this dataset is available on Github.

The data used to create this dataset was taken from the parlparse project operated by They Work For You and supported by mySociety.

The dataset is licensed under a Creative Commons Attribution 4.0 International License.

The code used to create this dataset is licensed under an MIT license.

Please contact me if you find any errors in the dataset. The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'.

Notes

This release is an update of previously released datasets. See full documentation for details.

Files

gov-senti-mean-V24.csv

Files (6.2 GB)

Name Size Download all
md5:18077a66d63947230e655757ad33e1be
1.3 kB Preview Download
md5:b8aa951c9db0547a690eeeaf5faf7544
889.7 kB Download
md5:d7936b12009dfede06345c290da87cf9
2.9 GB Download
md5:2ffac48ecb98cdf658e6937d6f703e30
3.2 GB Download
md5:282187f0d0dc7abb7adf68b934269dc2
6.0 kB Preview Download
md5:03d1c9de11a9353b052bb1c7a06ce711
172.0 kB Preview Download
md5:dfdde2e2ec6dca36729fad7242f8c4b1
916.2 kB Preview Download
md5:16014c160bdd286e40cf96c6695fc23f
2.1 kB Preview Download
md5:83db15581b0837e885c902dd037f684d
14.1 kB Preview Download
md5:02fb8fa5c9e164c10c6e98543b8feea1
66.6 kB Preview Download
md5:e4fec7bae959ea9b85e4c0c430cc8a0d
17.1 kB Preview Download
md5:ea0b86c535ae8b6ce487de00c4b1033d
17.2 kB Preview Download

Additional details