Published July 21, 2025 | Version v1
Presentation Open

Data Everywhere: Using and Sharing Scientific Data with Pelican

  • 1. ROR icon University of Wisconsin–Madison
  • 2. Morgridge Institute for Research

Description

While there are perhaps hundreds of petabytes of datasets available to researchers, instead of swimming in seas of data there is often a feel of sitting in a data desert: there’s a mismatch between what sits in carefully curated repositories around the world versus what’s accessible at the computational resources locally available. The Pelican Project (https://pelicanplatform.org/) aims to bridge the gap between repositories and compute by providing a software platform to connect the two sides. Pelican’s flagship instance, the Open Science Data Federation (OSDF), serves billions of objects and more than a hundred petabytes a year to national-scale resources. This tutorial, targeted at end-user data consumers and data providers, covers the data access model of Pelican, guides participants to access and share data through an existing data federation, and considers how data movement via Pelican and the OSDF can enable their research computing.

Files

pearc25_data-everywhere.pdf

Files (14.6 MB)

Name Size Download all
md5:b4f410fe12364c15790648c720d8446e
5.4 MB Preview Download
md5:4dffa9af8cedd49a5d8619268a114b0f
9.2 MB Preview Download

Additional details

Funding

U.S. National Science Foundation
Pelican: Advancing the Open Science Data Federation Platform 2331480