Published September 4, 2025 | Version v3
Dataset Open

Open Context Database SQL Dump and Parquet Exports

  • 1. Alexandria Archive Institute Open Context
  • 2. The Alexandria Archive Institute

Description

Open Context (https://opencontext.org) publishes free and open access research data for archaeology and related disciplines. An open source (but bespoke) Django (Python) application supports these data publishing services. The software repository is here: https://github.com/ekansa/open-context-py (the "production" branch is the one used for Open Context's primary public deployment). 

We also provide a Docker based approach for installing Open Context via this code repository: https://github.com/opencontext/oc-docker (the "production" branch installs the branch of code used for Open Context's primary public deployment).

 

The Open Context team runs ETL (extract, transform, load) workflows to import data contributed by researchers from various source relational databases and spreadsheets. Open Context uses PostgreSQL (https://www.postgresql.org) relational database to manage these imported data in a graph style schema. The Open Context Python application interacts with the PostgreSQL database via the Django Object-Relational-Model (ORM).

This database dump includes all published structured data organized used by Open Context (table names that start with 'oc_all_'). The binary media files referenced by these structured data records are stored elsewhere. Binary media files for some projects, still in preparation, are not yet archived with long term digital repositories.

These data comprehensively reflect the structured data currently published and publicly available on Open Context. Other data (such as user and group information) used to run the Website are not included. The data are provided in a plain text SQL dump (for restoration into a version 14+ PostgreSQL database) and in the non-proprietary (but binary) parquet file format.

 

IMPORTANT

This database dump contains data from roughly 190+ different projects. Each project dataset has its own metadata and citation expectations. If you use these data, you must cite each data contributor appropriately, not just this Zenodo archived database dump.

Files

opencontext-2025-09-03-schema-and-data.zip

Files (5.5 GB)

Name Size Download all
md5:b98a6115520253f8c9aa9fc8ced1d3f8
1.5 GB Download
md5:7252edf9b9ebaeb951f44ee087924426
275.7 MB Download
md5:f94260439c417b07247c749720f7c31d
460.4 MB Download
md5:064ed77b3aa7c6bb2d0d0d5c0856d43a
70.3 MB Download
md5:f8057b9f6a2a057acb9fb019f0f4e1ba
151.6 MB Download
md5:38cd926b349f2d9662bc446bb7c6e10c
3.0 GB Preview Download

Additional details

Software

Repository URL
https://github.com/ekansa/open-context-py
Programming language
Python
Development Status
Active