Published October 6, 2023 | Version camera ready
Conference paper Open

Integrating R in a distributed scientific workflow via a Jupyter-based Environment

  • 1. University of Salento
  • 2. University of Amsterdam
  • 3. National Research Council

Description

The Research Infrastructure Lifewatch Italy has

developed a Virtual Research Environment for studies on phytoplankton

ecology that includes computational services based

on R, a programming language widely used for data science

and ecology. Here we have verified the feasibility of a Jupyterbased

research environment, the NaaVRE, which has so far been

tested only with Python, for running R code in a workflow on the

Cloud. The successful execution demonstrated the potentialities

of R in Cloud-based research environments. However, further

investigation is needed, in particular, to overcome the issue of

the lack of dependencies declaration in R. The possibility of

performing analyses in a workflow, combined with the computational

resources of remote infrastructures, will support scientists

in carrying out FAIR and innovative research in a more efficient,

integrated and collaborative way.

Files

2023.conference.escience.r-containerization.camera.pdf

Files (591.1 kB)

Additional details

Funding

European Commission
Blue-Cloud 2026 – A federated European FAIR and Open Research Ecosystem for oceans, seas, coastal and inland waters 101094227
European Commission
CLARIFY – CLoud ARtificial Intelligence For pathologY 860627
European Commission
ENVRI-FAIR – ENVironmental Research Infrastructures building Fair services Accessible for society, Innovation and Research 824068