EDH
Description
The dataset contains 81,476 cleaned and streamlined Latin inscriptions from the Epigraphic Database Heidelberg (EDH, https://edh-www.adw.uni-heidelberg.de, License https://creativecommons.org/licenses/by-sa/4.0/), aggregated on 2021/01/21, created for the purpose of a quantitative study of epigraphic trends by the Social Dynamics in the Ancient Mediterranean Project (SDAM, http://sdam.au.dk).
The full lifecycle of the transformation process, including programmatical access, modifications, and streamlining of the original dataset is documented by a sequence of Python and R scripts (https://github.com/sdam-au/EDH_ETL). The dataset is stored as a JSON file, ensuring compatibility both with Python and R.
The scripts used to generate the dataset and their metadata are available via GitHub: https://github.com/sdam-au/EDH_ETL
.
Files
EDH_text_cleaned_2021-01-21.json
Files
(245.8 MB)
Name | Size | Download all |
---|---|---|
md5:f19c195aad69c94d12e7548160aacc81
|
245.8 MB | Preview Download |