Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published May 19, 2024 | Version 2024-05-19
Dataset Open

Corpus of Resolutions: UN Security Council (CR-UNSC)

  • 1. ROR icon Ludwig-Maximilians-Universität München
  • 2. ROR icon Scuola Superiore Sant'Anna
  • 3. ROR icon King's College London

Description

Overview

The Corpus of Resolution: UN Security Council (CR-UNSC) collects and presents for the first time in human and machine-readable form all resolutions, drafts, and meeting records of the UN Security Council, including detailed metadata, as published by the UN Digital Library and revised by the authors.

The United Nations Security Council (UNSC) is the most influential of the principal UN organs. Composed of five permanent and ten non-permanent members, its functioning is constrained by the political context in which it operates. During the Cold War, the complex political relationships between the permanent members and their veto powers significantly affected the capacity of the UNSC to address violations of international peace and security, with only 646 resolutions passed from 1946 to 1989. Since the 1990s, the activity of the UN Security Council has increased dramatically and produced 2721 resolutions up to the end of 2023. The length, complexity and thematic breadth of the resolutions has also increased, prompting calls to redefine it as a quasi-legislative body.

Under Articles 24 and 25 of the UN Charter, member states have conferred upon the UNSC the "primary responsibility for the maintenance of international peace and security" and have agreed "to accept and carry out" its decisions. The discharge of this function is carried out through the powers bestowed upon it under Chapter VI of the UN Charter, "Pacific Settlement of Disputes", Chapter VII, "Action with Respect to Threats to the Peace, Breaches of the Peace, and Acts of Aggression", Chapter VIII, "Regional Arrangements", and Chapter XII, "International Trusteeship System". 

Under the peace and security mandate, its areas of activity cover disarmament, pacific settlement of disputes, enforcement, and, until 1994, strategic areas in a trusteeship agreement. Its functions also pertain to the correct working of the United Nations, covering issues of membership, the appointment of the Secretary General, the elections of judges of the International Court of Justice (ICJ), the calling of special and emergency sessions of the General Assembly, the amendment of the Charter and of the ICJ Statute.

Please refer to the Codebook for a detailed explanation of the dataset and instructions on how to make use of it.

 

Updates

The CR-UNSC will be updated at least once per year.

In case of serious errors an update will be provided at the earliest opportunity and a highlighted advisory issued on the Zenodo page of the current version. Minor errors will be documented in the GitHub issue tracker and fixed with the next scheduled release.

The CR-UNSC is versioned according to the day of the last run of the data pipeline, in the ISO format YYYY-MM-DD. Its initial release version is 2024-05-03.

Notifications regarding new and updated data sets will be published on my academic website at www.seanfobbe.com or on the Fediverse at @seanfobbe@fediscience.org

 

Changelog

  • New variant: EN_TXT_BEST containing a write-out of the English resolution texts equivalent to the CSV file text variable
  • New diagrams: bar charts of top M49 regions and sub-regions of countries mentioned in resolution texts
  • Fixed naming mix-up of BIBTEX and GRAPHML zip archives
  • Fixed whitespace character detection in citation extraction (adds ca. 10% more citations)
  • Fixed improper merging of weights in citation network
  • Fixed "cannot xtfrm data frames" warning
  • Improve REGEX detection for certain geographic entities
  • Improve Codebook (headings, citation network docs)

 

Key Metrics

Version: 2024-05-19

Scope: UNSC Resolutions from 1 (1946) up to and including 2722 (2024)

Tokens: 3,704,016 (English resolution texts)

Languages: English, French, Spanish, Arabic, Chinese, Russian

 

Features

  • 82 Variables
  • Resolution texts in all six official UN languages (English, French, Spanish, Arabic, Chinese, Russian)
  • Draft texts of resolutions in English
  • Meeting record texts in English
  • URLs to draft texts in all other languages (French, Spanish, Arabic, Chinese, Russian)
  • URLs to meeting record texts in all other languages (French, Spanish, Arabic, Chinese, Russian)
  • Citation data as GraphML (UNSC-to-UNSC resolutions and UNSC-to-UNGA resolutions)
  • Bibliographic database in BibTeX/OSCOLA format for e.g. Zotero, Endnote and Jabref
  • Extensive Codebook to explain the uses of the dataset
  • Compilation Report and Quality Assurance Report explain construction and validation of the data set
  • Publication quality diagrams for teaching, research and all other purposes (PDF for printing, PNG for web)
  • Open and platform independent file formats (CSV, PDF, TXT, GraphML)
  • Software version controlled with Docker
  • Publication of full data set (Open Data)
  • Publication of full source code (Open Source)
  • Data published under Public Domain waiver (CC Zero 1.0)
  • Source Code is Free Software published under the GNU General Public License Version 3 (GNU GPL v3)
  • Secure cryptographic signatures for all files in version of record (SHA2-256 and SHA3-512)

 

Recommended Variants

Traditional Scholars

ALL_PDF_Resolutions

EN_TXT_BEST

BIBTEX_OSCOLA

Quantitative Scholars

ALL_CSV_FULL

EN_TXT_BEST

CITATIONS_GRAPHML

 

Please refer to the Codebook regarding for details on each variant. The ZIP archives include texts in all languages, unless noted in the filename.

We strongly recommend using the CSV files for quantitative analysis, but if you find CSV hard to use and want to analyze only the text of resolutions, the EN_TXT_BEST variant is a mix of expert-revised OCR and born digital texts equivalent to the "text" variable in the CSV file.

 

Compilation Report and Quality Assurance Report

With every compilation of the full data set, an extensive Compilation Report and detailed Quality Assurance Report  are created and published in PDF format.

The Compilation Report includes the source code for the pipeline architecture, comments and explanations of design decisions, relevant computational results, exact timestamps and a table of contents with clickable internal hyperlinks to each section.

The Quality Assurance Report contains a count of all hard tests and expectations, additional visualizations and documented test results for all soft tests that require further interpretation

The Compilation Report, Quality Assurance Report and Source Code are published under the following DOI: https://zenodo.org/doi/10.5281/zenodo.7319783

 

Attribution and Copyright

This data is derived from the United Nations Digital Library at https://digitallibrary.un.org. Records were accessed and downloaded on 13 and 26 March 2024, with additional work on revisions and corrections up to and including the date given as the version number.

Pursuant to UN Administrative Instruction ST/AI/189/Add.9/Rev.2 of 17 September 1987 all official records and United Nations Documents (including resolutions, compilations of resolutions, drafts and meeting records) are in the public domain. We wish to honor the letter and spirit of this UN policy. To ensure the widest possible distribution of official UN documents and to promote the international rule of law we waive any copyright that might have accrued by creating the dataset under a Creative Commons CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

 

Disclaimer

This data set is an academic initiative and is not associated with or endorsed by the United Nations or any of its constituent organs and organizations.

 

Author Websites

Personal Website of Seán Fobbe

Personal Website of Lorenzo Gasbarri

Personal Website of Niccolò Ridi

 

Contact

Did you discover any errors? Do you have suggestions on how to improve the data set? You can either post these to the Issue Tracker on GitHub or contact Seán Fobbe via https://seanfobbe.com/contact/

Files

CR-UNSC_2024-05-19_Codebook.pdf

Files (5.7 GB)

Name Size Download all
md5:0cf7e66cc9b67e5ad4033a76157107b4
100.4 MB Preview Download
md5:38834a2c03c3bce65888759f2a8a881b
50.7 MB Preview Download
md5:7d60a7659ede413e4e2be07730e007a5
3.3 GB Preview Download
md5:4aa0717c6b5134a2b4fefde7558c9e9c
144.1 MB Preview Download
md5:c51f95629c6d74b9a2131361f78f0e0c
11.1 MB Preview Download
md5:a4a805bd49fc9aa388f368e5ba3bf900
1.1 MB Preview Download
md5:2517016ba0366537cb15b98ac4a7521d
1.2 MB Preview Download
md5:354c59574b90a156bfccc69eb29d8452
2.6 MB Preview Download
md5:512bc2af373c2623a9ca4691c927f04d
6.9 kB Preview Download
md5:0ecadb7f5b627a4ee795e0e9fd01486b
236.1 MB Preview Download
md5:174f46643e4548056012d11a7ef1c9e4
1.9 GB Preview Download
md5:e8be280bf3714ab67ff10a243b31935f
8.2 MB Preview Download

Additional details

Related works

Is compiled by
Software: 10.5281/zenodo.11212057 (DOI)
Is derived from
https://digitallibrary.un.org/ (URL)

Software

Repository URL
https://github.com/SeanFobbe/cr-unsc
Programming language
R
Development Status
Active