Dataset Open Access
The repository contains a cleaned and pre-processed corpus of parliamentary debates from the Parliamentary Assembly of Bosnia and Herzegovina. The corpus is accompanied by the metadata on elected representatives and their political parties. It covers the period of 1998-2018 (six complete terms) and counts over 127 thousand speeches.
If you use the dataset, please cite: Mochtak, Michal, Josip Glaurdić, Christophe Lesschaeve, and Ensar Muharemović (2022): BiHCorp: Corpus of Parliamentary Debates in Bosnia and Herzegovina (v1.1.1),
v1.1.1 (latest version)
- added the concept DOI to codebooks (DOI was generated only after the repository was published)
- fixed a typo in one of the debates' date
- fixed minor inconsistencies in the tag column
- originally posted on GESIS repository (https://doi.org/10.7802/2387); migrated to ZENODO due to limitations concerning the concept DOI