Dataset Open Access
Full details are available at https://evanodell.com/projects/datasets/hansard-data
Summary
A public dataset of speeches in the Hansard, stored as a csv. The dataset provides information on every speech made in the House of Commons between the parliament returned from the 1979 general election and the end of 2018, with information on the speaking MP, their party, gender, birthdate, starting and finishing dates as an MP, age at the time of the speech, and cabinet/shadow cabinet jobs.
Notes
The code and matching data used to generate this dataset is available on Github.
The data used to create this dataset was taken from the parlparse project operated by They Work For You and supported by mySociety.
The dataset is licensed under a Creative Commons Attribution 4.0 International License.
The code used to create this dataset is licensed under an MIT license.
Please contact me if you find any errors in the dataset. The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'.
Name | Size | |
---|---|---|
all_speech.csv
md5:9869c0b57c01aa8f125a296d80a223ae |
2.9 GB | Download |
all_speech.rds
md5:499f9022ce807b50396233ab5eebc67e |
3.1 GB | Download |
year-senti-mean-V250.csv
md5:0df937dac04a71d67ab445b958c42c3e |
23.5 kB | Download |
All versions | This version | |
---|---|---|
Views | 2,757 | 190 |
Downloads | 3,320 | 87 |
Data volume | 5.2 TB | 240.6 GB |
Unique views | 2,140 | 170 |
Unique downloads | 2,071 | 76 |