Open Context Database SQL Dump and Parquet Exports
Authors/Creators
- 1. Alexandria Archive Institute Open Context
- 2. The Alexandria Archive Institute
Description
Open Context (https://opencontext.org) publishes free and open access research data for archaeology and related disciplines. An open source (but bespoke) Django (Python) application supports these data publishing services. The software repository is here: https://github.com/ekansa/open-context-py (the "production" branch is the one used for Open Context's primary public deployment).
We also provide a Docker based approach for installing Open Context via this code repository: https://github.com/opencontext/oc-docker (the "production" branch installs the branch of code used for Open Context's primary public deployment).
The Open Context team runs ETL (extract, transform, load) workflows to import data contributed by researchers from various source relational databases and spreadsheets. Open Context uses PostgreSQL (https://www.postgresql.org) relational database to manage these imported data in a graph style schema. The Open Context Python application interacts with the PostgreSQL database via the Django Object-Relational-Model (ORM).
This database dump includes all published structured data organized used by Open Context (table names that start with 'oc_all_'). The binary media files referenced by these structured data records are stored elsewhere. Binary media files for some projects, still in preparation, are not yet archived with long term digital repositories.
These data comprehensively reflect the structured data currently published and publicly available on Open Context. Other data (such as user and group information) used to run the Website are not included. The data are provided in a plain text SQL dump (for restoration into a version 14+ PostgreSQL database) and in the non-proprietary (but binary) parquet file format.
IMPORTANT
This database dump contains data from roughly 190+ different projects. Each project dataset has its own metadata and citation expectations. If you use these data, you must cite each data contributor appropriately, not just this Zenodo archived database dump.
Files
opencontext-2025-09-03-schema-and-data.zip
Files
(5.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:b98a6115520253f8c9aa9fc8ced1d3f8
|
1.5 GB | Download |
|
md5:7252edf9b9ebaeb951f44ee087924426
|
275.7 MB | Download |
|
md5:f94260439c417b07247c749720f7c31d
|
460.4 MB | Download |
|
md5:064ed77b3aa7c6bb2d0d0d5c0856d43a
|
70.3 MB | Download |
|
md5:f8057b9f6a2a057acb9fb019f0f4e1ba
|
151.6 MB | Download |
|
md5:38cd926b349f2d9662bc446bb7c6e10c
|
3.0 GB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/ekansa/open-context-py
- Programming language
- Python
- Development Status
- Active