Published October 8, 2025 | Version v1
Presentation Open

"Streamlining Data Publication: Automatic Metadata and Large Datasets in the Age of AI." Discussion Session at Open Science Conference 2025

  • 1. ROR icon FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
  • 2. ROR icon Karlsruhe Institute of Technology

Description

This is a presentation of the discussion session "Streamlining Data Publication: Automatic Metadata and Large Datasets in the Age of AI" created by the DiTraRe team for the Open Science Conference in Hamburg 2025.

Research data repositories are essential infrastructure for enabling Open Science and ensuring data is Findable, Accessible, Interoperable, and Reusable (FAIR). However, researchers working on repositories face significant challenges in handling ever-increasing volumes of large datasets and the often time-consuming manual process of creating comprehensive, quality metadata. These issues can hinder data publication workflows and limit the findability and usability of valuable research output.

We will present the challenges encountered, propose solutions and first implementations for automatic metadata extraction and large data handling, and discuss how these innovations contribute to a more streamlined and scalable data publication workflow. Participants will have the opportunity to engage with developers and users of repositories, explore the practical implications for their own data management practices, and discuss the potential for adopting similar solutions in other repository contexts.

This discussion session will provide insights into the approaches, technologies, and lessons learned from the Leibniz Science Campus “Digital Transformation of Research” (DiTraRe) work on RADAR and implementation of AI methods in the topic of metadata standardisation. The session is especially relevant for everyone interested in the practical implementation of advanced research data management features that promote reproducibility, efficiency and FAIR principles.

Files

Discussion Session DiTraRe Open Science Conference 2025.pdf

Files (7.3 MB)

Additional details

Related works

Continues
Proposal: 10.5281/zenodo.11109405 (DOI)
Conference paper: 10.5281/zenodo.14872358 (DOI)
Presentation: 10.5281/zenodo.14925184 (DOI)

Funding

Leibniz Association
Leibniz Science Campus "Digital Transformation of Research" (DiTraRe) W74/2022

Dates

Available
2025-10-08