Published March 3, 2026 | Version v1
Poster Open

Two Sides of the RSE Coin: Reproducible and Semantically Interoperable Workflows in Climate and Geoarchaeological Research

Description

In geoscientific and geoarchaeological research, analytical workflows often remain locked in Excel-based environments. While such approaches are convenient, they limit reproducibility, transparency, and interoperability. This paper presents how Research Software Engineering (RSE) methodology can transform these “black box” workflows into reproducible, FAIR-compliant and semantically interoperable processes by reimplementing established analytical routines in open, script-based environments using Python and R.

The work builds on data from tephra and sediment analyses in the Eifel region, focusing on EPMA and LA-ICP-MS. Traditionally, these datasets were processed and visualised in Excel, making version control, reproducibility, and methodological transparency difficult.

Building on methods from ELSAinteractive++ (PHd by B.W. Diensberg), a C++-based application for visualising sediment cores, the project replaces static, closed workflows with modular Python scripts based on pandas and matplotlib. Each processing step, from data import to statistical transformation and plotting, is now reproducible and extensible.

Further integration draws on an Eifel GDGT dataset and analytical workflow to demonstrate how existing R code for biomarker-based temperature reconstructions can be re-implemented in Python for enhanced cross-platform reproducibility. Together, these examples illustrate the practical implementation of the FAIR4RS principles, Findable, Accessible, Interoperable, and Reusable research software.

Beyond reproducible computation, the project also explores semantic data publication. Laboratory results and analytical metadata are modelled as RDF triples to ensure machine-actionable interoperability with Cultural Heritage and Earth Science data. Python scripts assist in transforming tabular outputs into RDF using common vocabularies, while the SPARQLing Unicorn Toolkit enables the conversion of these RDF datasets into human-readable HTML pages automatically published on GitHub Pages. This dual approach, semantic and computational FAIRification, bridges the gap between traditional analytical workflows and Linked Open Data infrastructures.

In contrast to spreadsheet-based workflows, the RSE-driven approach offers:

  • Versioned and transparent analysis pipelines via Git-based code;
  • Parameter reproducibility through explicit scripting;
  • Scalability and automation for high-volume datasets; and
  • Semantic interoperability through FAIR metadata and RDF publication.

By showcasing examples from the Campanian Ignimbrite and Eifel tephra research, the paper highlights how RSE methodology enhances both reproducibility and semantic interoperability across disciplinary borders. It demonstrates that adopting RSE principles in geoscientific and archaeological workflows is not just a technical improvement but a cultural shift, from isolated Excel sheets to open, machine-actionable, and verifiable research.

Files

deRSE26_Stuttgart_21313_GeoSciRSE.pdf

Files (2.7 MB)

Name Size Download all
md5:cca9ca8159adc3a0c6f2d9a9fcaa3fc2
2.7 MB Preview Download

Additional details

Software