Published June 10, 2022
| Version v1
Dataset
Open
Clean OpenLegalData - German
Creators
Description
This dataset of German court cases is obtained from OpenLegalData. The data received clearly mentioned information such as court, level of appeal, and ECLI (European Case Law Identifier). However, tenor, tatbestand, gründe, and entscheidungsgründe were available only in HTML format. Out of over 100000 extracted cases, we were able to parse 43337 HTML only due to structural problems with HTML content.
The resulting dataset is approximate ~1.1 GBs with 43337 rows and has the following 12 features:
Feature | Total | Example content |
id | 43337 | 127981 |
slug | 43337 | ag-volklingen-2002-07-10-5c-c-24102 |
ecli | 10831 | NaN |
date | 43337 | 2002-07-10 |
court | 43337 | Amtsgericht Völklingen |
jurisdiction | 43337 | Ordentliche Gerichtsbarkeit |
level_of_appeal | 43337 | Amtsgericht |
type | 43337 | Urteil |
tenor | 36282 | 1. Die Beklagten werden als Gesamtschuldner verurteilt, an die ... |
tatbestand | 24243 | Auf die Darstellung des Tatbestandes wird gemäß § 313 Abs ... |
gründe | 27144 | Die Klage ist zulässig und begründet. Die Klägerin kann von de... |
entscheidungsgründe | 24038 | Die Klage ist zulässig und begründet. Die Klägerin kann von de... |
Important: Make sure to use "|" (pipe-symbol) as CSV separator.
Example:
data = pd.read_csv('clean_OLD.csv', sep="|")
Files
clean_OLD.csv
Files
(1.1 GB)
Name | Size | Download all |
---|---|---|
md5:69afc3f41a717f25467d38840f1fa0a3
|
1.1 GB | Preview Download |
Additional details
References
- Ostendorff, Malte, Till Blume, and Saskia Ostendorff. "Towards an open platform for legal information." Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020. 2020.