There is a newer version of this record available.

Dataset Open Access

Hansard Speeches 1979-2018

Odell, Evan

Full details are available at https://evanodell.com/projects/datasets/hansard-data

Summary

A public dataset of speeches in the Hansard, stored as a csv. The dataset provides information on every speech made in the House of Commons between the parliament returned from the 1979 general election and the end of 2018, with information on the speaking MP, their party, gender, birthdate, starting and finishing dates as an MP, age at the time of the speech, and cabinet/shadow cabinet jobs.

Notes

The code and matching data used to generate this dataset is available on Github.

The data used to create this dataset was taken from the parlparse project operated by They Work For You and supported by mySociety.

The dataset is licensed under a Creative Commons Attribution 4.0 International License.

The code used to create this dataset is licensed under an MIT license.

Please contact me if you find any errors in the dataset. The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'.

This release is an update of previously released datasets. See full documentation for details.
Files (5.9 GB)
Name Size
all_speech.csv
md5:9869c0b57c01aa8f125a296d80a223ae
2.9 GB Download
all_speech.rds
md5:499f9022ce807b50396233ab5eebc67e
3.1 GB Download
year-senti-mean-V250.csv
md5:0df937dac04a71d67ab445b958c42c3e
23.5 kB Download
2,757
3,320
views
downloads
All versions This version
Views 2,757190
Downloads 3,32087
Data volume 5.2 TB240.6 GB
Unique views 2,140170
Unique downloads 2,07176

Share

Cite as