Published July 14, 2018 | Version v1
Conference paper Open

AiiDA and the Materials Cloud: workflow engine with automated provenance and dissemination platform for Open Science

  • 1. Theory and Simulation of Materials (THEOS), EPFL, Lausanne, Switzerland
  • 2. Theory and Simulation of Materials (THEOS), EPFL, Lausanne, Switzerland and Laboratory for Molecular Simulation (LSMO), EPFL, Sion, Switzerland
  • 3. nanotech@surfaces laboratory, Empa, Dübendorf, Switzerland
  • 4. Theory and Simulation of Materials (THEOS), EPFL, Lausanne, Switzerland and UC Berkeley, USA
  • 5. Theory and Simulation of Materials (THEOS), EPFL, Lausanne, Switzerland and Vilnius University, Lithuania
  • 6. Harvard University, USA
  • 7. Laboratory for Molecular Simulation (LSMO), EPFL, Sion, Switzerland
  • 8. Swiss National Supercomputing Centre (CSCS), Lugano, Switzerland

Description

Modern advances in computational technology have facilitated great strides in a wide variety of scientific disciplines and have led to the production of a wealth of valuable research data. However, the extraordinarily quick growth of computational capabilities has left the scientific world wanting for a simple yet effective way of managing these new workflows and the vast amount of data that they produce. We present AiiDA[1], a highly-automated and robust workflow engine written in Python, designed for high-throughput computational science. AiiDA automatically tracks data provenance (and stores it in the form of a directed graph) while managing and automating simulations running either locally or on supercomputers. All data and calculations are stored in a database and can be efficiently queried thanks to a simple but powerful query language. AiiDA enables computational science that is fully reproducible and facilitates sharing of research results. Its symbiotic counterpart, the Materials Cloud [2], is an interactive online platform powered by AiiDA. It is designed to enable seamless sharing and dissemination of resources, of raw data with their provenance as well as of curated open research data in computational science, and to provide cloud resources for simulations. The combination of AiiDA and Materials Cloud (whose architecture is shown in the figure) provides an Open Science Framework fully compliant with the FAIR (findable, accessible, interoperable and reusable) data principles [3]. The diligent and dedicated data management of AiiDA, combined with the user-friendly and ergonomic Materials Cloud, form an invaluable tool to any computational scientist.

[1] G. Pizzi et al., Comp. Mat. Sci. 111, 218 (2016) - www.aiida.net

[2] http://www.materialscloud.org

[3] M. D. Wilkinson et al, Sci. Data 3, 160018 (2016)

Notes

Preprint submitted to RO2018 workshop at IEEE eScience Conference 2018

Files

abstract-aiida-materialscloud.zip

Files (852.2 kB)

Name Size Download all
md5:eab7dd75ef014cee34b83ac65351d27c
380.3 kB Preview Download
md5:92d46e13f413e2002577f21d0dca0b72
471.9 kB Preview Download

Additional details

References