Published June 15, 2022
| Version v1
Dataset
Open
Supplementary Data Files of the Greek Parliament Proceedings Dataset
Authors/Creators
- 1. Athens University of Economics & Business, Greece
- 2. Stockholm University, Sweden
Description
The dataset includes supplementary files of the previous upload "A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis". Specifically, it includes the 5,355 sitting record files from which speeches were extracted in the form of the conversation that took place in the Greek Parliament as well as the previous versions of the tell_all_cleaned.csv before preprocessing and cleaning, namely tell_all.csv and tell_all_FILLED.csv.
- original_data: A folder of the original record files downloaded from the website of the Greek Parliament (https://www.hellenicparliament.gr/Praktika/Synedriaseis-Olomeleias). The filenames are edited to follow the naming format "recordDate_id_periodNo_sessionNo_sittingNo.ext".
- _data: A folder of the record files converted to text format with filenames translated to English.
- tell_all.csv: The initial file of all extracted speeches before preprocessing and cleaning. The file includes the following columns: member_name, sitting_date, parliamentary_period, parliamentary_session, parliamentary_sitting, political_party, government, member_region, roles, member_gender, speaker_info, speech.
- tell_all_FILLED.csv: This file is an intermediate step of preprocessing of the tell_all.csv file. In this file, missing names of chairmen of various parliamentary sittings are filled. It includes the same columns as the tell_all.csv file.
-------------
Acknowledgments:
This work was supported by the European Union’s Horizon 2020 research and innovation program ``FASTEN'' under grant agreement No 825328 and the non profit data journalism organization iMEdD.org.
Files
supplementary_data.zip
Files
(3.2 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:fd8723e585433b4faf3c65b27a4a3d04
|
3.2 GB | Preview Download |