Linguistically Annotated Achemenet Babylonian Texts
Authors/Creators
Contributors
Data collector (4):
Description
This repository contains automatically lemmatized Babylonian cuneiform texts from the Achemenet project (http://www.achemenet.com/). Achemenet provides transliterations and translations of documents written in the Achaemenid Persian Empire (550-330 BCE). The repository provides a snapshot of the Babylonian cuneiform texts available on Achemenet in December 2020. We thank our colleagues at Achemenet for the permission to lemmatize the texts and publish them online. We have converted the Achemenet transliterations into Oracc atf, and we are naturally responsible for any errors introduced into the transliterations during the conversion. We also thank Niek Veldhuis (Berkeley) and Heidi Jauhiainen (Helsinki) for their help at various stages of the project.
The texts have been automatically lemmatized at the Centre of Excellence in Ancient Near Eastern Empires (University of Helsinki), funded by the Research Council of Finland (decision numbers 298647, 330727, and 352747). Linda Leinonen, Matias Sakko, Senja Salmi, and Repekka Uotila assisted in cleaning the data and creating metadata. The texts are also available on Korp (http://urn.fi/urn:nbn:fi:lb-2023062102). Korp allows extensive searches on the texts and presents the results as a KWIC concordance list. It also offers statistical information on the search results and enables the user to download them.
The zip file Achemenet contains the annotated texts and the file Scripts contains the Python scripts used for converting files (XLSX/HTML > ATF > CONLLU > VRT).
For further information on the dataset, see Alstola, T., Sahala, A., Valk, J., & Ong, M. (2026). Semi-Automatic Annotation of Babylonian Cuneiform Texts. Journal of Open Humanities Data, 12(41). https://doi.org/10.5334/johd.494.
Files
Achemenet.zip
Files
(7.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:3d04f601bc9eb80d4da1d9c821753e93
|
2.0 MB | Preview Download |
|
md5:6d75aac21f7100fd9bdf533698ae9bb7
|
5.4 MB | Preview Download |
Additional details
Related works
- Is documented by
- Journal article: 10.5334/johd.494 (DOI)
- Is source of
- Dataset: 10.5281/zenodo.15355780 (DOI)
- Is supplement to
- Model: 10.5281/zenodo.14978872 (DOI)
- Dataset: 10.5281/zenodo.14186072 (DOI)
Funding
- Research Council of Finland
- Semantic domains in Akkadian texts 298647
- Research Council of Finland
- Empire and Village: Imperial Control Strategies and Local Responses in the Babylonian Countryside 330727
- Research Council of Finland
- Centre of Excellence in Ancient Near Eastern Empires / Consortium: ANEE 336673
Software
- Programming language
- Python