Published June 15, 2022 | Version v1
Dataset Open

Supplementary Data Files of the Greek Parliament Proceedings Dataset

  • 1. Athens University of Economics & Business, Greece
  • 2. Stockholm University, Sweden

Description

The dataset includes supplementary files of the previous upload "A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis". Specifically, it includes the 5,355 sitting record files from which speeches were extracted in the form of the conversation that took place in the Greek Parliament as well as the previous versions of the tell_all_cleaned.csv before preprocessing and cleaning, namely tell_all.csv and tell_all_FILLED.csv.

  • original_data: A folder of the original record files downloaded from the website of the Greek Parliament (https://www.hellenicparliament.gr/Praktika/Synedriaseis-Olomeleias). The filenames are edited to follow the naming format "recordDate_id_periodNo_sessionNo_sittingNo.ext".
  • _data: A folder of the record files converted to text format with filenames translated to English.
  • tell_all.csv: The initial file of all extracted speeches before preprocessing and cleaning. The file includes the following columns: member_name, sitting_date, parliamentary_period, parliamentary_session, parliamentary_sitting, political_party, government, member_region, roles, member_gender, speaker_info, speech.
  • tell_all_FILLED.csv: This file is an intermediate step of preprocessing of the tell_all.csv file. In this file, missing names of chairmen of various parliamentary sittings are filled. It includes the same columns as the tell_all.csv file.

-------------

Acknowledgments:

This work was supported by the European Union’s Horizon 2020 research and innovation program ``FASTEN'' under grant agreement No 825328 and the non profit data journalism organization iMEdD.org.

Files

supplementary_data.zip

Files (3.2 GB)

Name Size Download all
md5:fd8723e585433b4faf3c65b27a4a3d04
3.2 GB Preview Download

Additional details

Funding

European Commission
FASTEN - Fine-Grained Analysis of Software Ecosystems as Networks 825328