Published October 15, 2024 | Version 0.9.0
Dataset Open

DemocraSci - A Parliamentary Knowledge Graph (4 legislative periods)

Description

Overview

The current dataset encodes policy-making and parliamentary processes in a knowledge graph (KG). This structure allows for the detailed representation of relationships between various entities and tracks the evolution of bills from their initial proposal through to the final vote. It also includes comprehensive metadata on the members of parliament (MPs), such as those who proposed the bills and participated in discussions across different chambers. Additionally, the dataset contains the full text of speeches delivered by MPs during these deliberations.

Files contained

The DemocraSci dataset comprises the following files:

  • neo4j.dump : the file containing the Neo4J graph database. 
  • democrasci_rdb.db : a SQL database formed by the speeches given by MPs in the Parliament chambers. The DB contains a single table with two columns, one with the speech transcript and another with a unique identifier. This uid links each speech with a Speech node in the KG.

The previous information is extracted for four consecutive legislative periods (48 to 51), from year 2007 to 2023.

The structure of this KG is detailed in

https://democrasci.jkminder.ch/index.html

Here, we provide more information on the different nodes and relations contained in the graph, as well as the properties of each. 

Dataset creation

The dataset has been parsed from the CuriaDB database of the Swiss parliament. From the information contained within, we defined a schema that encodes the political process and all the entities involved in it. 

All the different extraction and processing tasks, from the parsing of the original database to the curation of the data and the manipulation of the KG are done using Python. Specifically, for the population of the Neo4J database we leveraged Data2Neo.

Resources

The knowledge graph has been created using Neo4J. The Community Edition is free to use and already offers many tools for KG exploration and graph data science

https://neo4j.com/deployment-center/#community

No4J can be used through the command line interface, launching a web instance where previously a Neo4J dump has been loaded. 

https://neo4j.com/docs/operations-manual/current/backup-restore/restore-dump/

Additionally, the Neo4J desktop app facilitates restoring a dump, and enables using more tools for exploration, such as Bloom

https://neo4j.com/docs/desktop-manual/current/operations/create-from-dump/

For the exploration of the SQL DB, the easiest option is DB Browser

Files

Files (1.2 GB)

Name Size Download all
md5:8d23414de01bdc7cdc65c80e565a3f27
692.5 MB Download
md5:ce7b2dee5de6d15e769afd17e7ee46f3
469.5 MB Download

Additional details

Related works

Is described by
Journal article: 10.1111/spsr.12590 (DOI)

Funding

Swiss National Science Foundation
Analyzing Co-Sponsorship Networks from 127 Years of the Swiss Federal Assembly 184963
Swiss Data Science Center
A Research Platform for Data-Driven Democracy Studies C17-13

Dates

Available
2024-10-15
Date submission to Zenodo
Collected
2024
Last year collection of CuriaDB data