Poster Open Access

The Swiss Data Science Center on a mission to empower reproducible, traceable and reusable science

Schymanski, Stanislaus Josef; Eric Bouillet; Olivier Verscheure

JSON Export

  "files": [
      "links": {
        "self": ""
      "checksum": "md5:ba914ff03484f06c97c5d94389aa2cda", 
      "bucket": "80a7dfa7-501d-419e-8803-8b7616acc872", 
      "key": "SDSC_Poster3a.pdf", 
      "type": "pdf", 
      "size": 3206238
  "owners": [
  "doi": "10.5281/zenodo.581298", 
  "stats": {
    "version_unique_downloads": 113.0, 
    "unique_views": 78.0, 
    "views": 80.0, 
    "version_views": 80.0, 
    "unique_downloads": 114.0, 
    "version_unique_views": 78.0, 
    "volume": 391161036.0, 
    "version_downloads": 121.0, 
    "downloads": 122.0, 
    "version_volume": 387954798.0
  "links": {
    "doi": "", 
    "latest_html": "", 
    "bucket": "", 
    "badge": "", 
    "html": "", 
    "latest": ""
  "created": "2017-05-19T14:35:54.687732+00:00", 
  "updated": "2020-01-20T15:49:09.643693+00:00", 
  "conceptrecid": "786161", 
  "revision": 10, 
  "id": 581298, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.581298", 
    "description": "<p>Our abilities to collect, store and analyse scientific data have sky-rocketed in the past decades, but at the same\u00a0time, a disconnect between data scientists, domain experts and data providers has begun to emerge. Data scientists\u00a0are developing more and more powerful algorithms for data mining and analysis, while data providers are making\u00a0more and more data publicly\u00a0available, and yet many, if not most, discoveries are based on specific data and/or\u00a0algorithms that \"are available from the authors upon request\".<br>\nIn the strong belief that scientific progress would be much faster if reproduction and re-use of such data\u00a0and algorithms was made easier, the Swiss Data Science Center (SDSC) has\u00a0committed to provide an open framework for the handling and tracking of scientific data and algorithms, from raw data and first principle equations\u00a0to final data products and visualisations, modular simulation models and benchmark evaluation algorithms. Led\u00a0jointly by EPFL and ETH Zurich, the SDSC is composed of a distributed multi-disciplinary team of data scientists<br>\nand experts in select domains. The center aims to federate data providers, data and computer scientists, and\u00a0subject-matter experts around a cutting-edge analytics platform offering user-friendly tooling and services to help\u00a0with the adoption of Open Science, fostering research productivity and excellence.<br>\nIn this presentation, we will discuss our vision of a high-scalable open but secure community-based platform for sharing, accessing, exploring, and analyzing scientific data in easily reproducible workflows, augmented\u00a0by automated provenance and impact tracking, knowledge graphs, fine-grained access right and digital right\u00a0management, and a variety of domain-specific software tools. For maximum interoperability, transparency and\u00a0ease of use, we plan to utilize notebook interfaces wherever possible, such as Apache Zeppelin and Jupyter.<br>\nFeedback and suggestions from the audience will be gratefully considered.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    "title": "The Swiss Data Science Center on a mission to empower reproducible, traceable and reusable science", 
    "notes": "Poster presented at  European Geosciences Union General Assembly 2017, id: EGU2017-12179.", 
    "relations": {
      "version": [
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "786161"
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "581298"
    "communities": [
        "id": "osr"
    "keywords": [
      "Open Science", 
      "Data Science", 
      "Science 2.0", 
      "Digital Science", 
      "Reusable Research"
    "publication_date": "2017-04-28", 
    "creators": [
        "affiliation": "Swiss Data Science Center (SDSC), Swiss Federal Institute of Technology (ETH), Zurich, Switzerland", 
        "name": "Schymanski, Stanislaus Josef"
        "affiliation": "Swiss Data Science Center (SDSC), \u00c9cole polytechnique f\u00e9d\u00e9rale de Lausanne (EPFL), Switzerland", 
        "name": "Eric Bouillet"
        "affiliation": "Swiss Data Science Center (SDSC), \u00c9cole polytechnique f\u00e9d\u00e9rale de Lausanne (EPFL), Switzerland", 
        "name": "Olivier Verscheure"
    "meeting": {
      "dates": "23-28 April 2017", 
      "title": "European Geosciences Union General Assembly 2017", 
      "acronym": "EGU", 
      "url": "", 
      "session": "IE2.4/ESSI3.10", 
      "place": "Vienna, Austria", 
      "session_part": "EGU2017-12179"
    "access_right": "open", 
    "resource_type": {
      "type": "poster", 
      "title": "Poster"
All versions This version
Views 8080
Downloads 121122
Data volume 388.0 MB391.2 MB
Unique views 7878
Unique downloads 113114


Cite as