There is a newer version of the record available.

Published June 25, 2025 | Version v2
Dataset Open

Open Context Database SQL Dump and Parquet Exports

  • 1. Alexandria Archive Institute Open Context
  • 2. The Alexandria Archive Institute

Description

Open Context (https://opencontext.org) publishes free and open access research data for archaeology and related disciplines. An open source (but bespoke) Django (Python) application supports these data publishing services. The software repository is here: https://github.com/ekansa/open-context-py (the "production" branch is the one used for Open Context's primary public deployment). 

We also provide a Docker based approach for installing Open Context via this code repository: https://github.com/opencontext/oc-docker (the "production" branch installs the branch of code used for Open Context's primary public deployment).

 

The Open Context team runs ETL (extract, transform, load) workflows to import data contributed by researchers from various source relational databases and spreadsheets. Open Context uses PostgreSQL (https://www.postgresql.org) relational database to manage these imported data in a graph style schema. The Open Context Python application interacts with the PostgreSQL database via the Django Object-Relational-Model (ORM).

This database dump includes all published structured data organized used by Open Context (table names that start with 'oc_all_'). The binary media files referenced by these structured data records are stored elsewhere. Binary media files for some projects, still in preparation, are not yet archived with long term digital repositories.

These data comprehensively reflect the structured data currently published and publicly available on Open Context. Other data (such as user and group information) used to run the Website are not included. The data are provided in a plain text SQL dump (for restoration into a version 14+ PostgreSQL database) and in the non-proprietary (but binary) parquet file format.

 

IMPORTANT

This database dump contains data from roughly 190+ different projects. Each project dataset has its own metadata and citation expectations. If you use these data, you must cite each data contributor appropriately, not just this Zenodo archived database dump.

Files

opencontext-2025-06-24.zip

Files (5.3 GB)

Name Size Download all
md5:0d48dc478e2fd7a8c372d8bd34d4926d
1.6 GB Download
md5:891e81e1f7b5857372ad75f8c0a92f82
275.5 MB Download
md5:b2133c4d74a0136fcd6e2ad168db3ded
459.9 MB Download
md5:86ca939100a215f8f2d1ca7c36a0448e
70.3 MB Download
md5:f6037bca2d29708bc958dc4acc21ca1b
153.8 MB Download
md5:205bb41dfbdd50339d4c7fbe1dd24230
2.8 GB Preview Download

Additional details

Software

Repository URL
https://github.com/ekansa/open-context-py
Programming language
Python
Development Status
Active