Dataset Open Access

Hansard Speeches 1979-2018

Odell, Evan

Full details are available at https://evanodell.com/projects/datasets/hansard-data

Summary

A public dataset of speeches in the Hansard, stored as a csv. The dataset provides information on every speech made in the House of Commons between the parliament returned from the 1979 general election and the end of 2018, with information on the speaking MP, their party, gender, birthdate, starting and finishing dates as an MP, age at the time of the speech, and cabinet/shadow cabinet jobs.

Notes

The code and matching data used to generate this dataset is available on Github.

The data used to create this dataset was taken from the parlparse project operated by They Work For You and supported by mySociety.

The dataset is licensed under a Creative Commons Attribution 4.0 International License.

The code used to create this dataset is licensed under an MIT license.

Please contact me if you find any errors in the dataset. The integrity of the public Hansard record is questionable at times, and while I have improved it, the data is presented 'as is'.

This release is an update of previously released datasets. See full documentation for details.
Files (5.8 GB)
Name Size
hansard-1979-2018-v261.csv
md5:7509872ee34ad6db817b3bff8b395735
2.8 GB Download
hansard-1979-2018-v261.rds
md5:95bc5f91c3752def0c0553fc9ba9d8e9
3.0 GB Download
906
1,490
views
downloads
All versions This version
Views 906277
Downloads 1,490154
Data volume 2.7 TB443.4 GB
Unique views 692245
Unique downloads 1,002120

Share

Cite as