Published May 18, 2018 | Version v1
Project deliverable Open

Report on existing metadata standards, and proposals for new vocabularies

Creators

  • 1. STFC

Description

Structural biology has a long tradition of open data, notably through depositing structures in the Protein Data Bank. Nevertheless there are challenges to transition to full compliance with the FAIR principles. As reported in M6.3:

  • Current workflows in Structural Biology may not be properly described in an unambiguous manner due to the lack of appropriate metadata standards specifying them (M6.3 report).
  • There is no agreed ontology for the primary data processing, either at the level of integrated studies combining different technologies or even at the single technique level (M6.3 report).
  • Metadata about experimental conditions are often incomplete
  • As a result of this, the chain of custody from sample to publication is often broken at several points

This report describes progress on these issues:

  • Extensions to the mmCIF data standard and increasing use of it,
  • … particularly for hybrid and integrative models,
  • Recording provenance
  • Linking datasets to research projects
  • Adoption of CWL for describing workflows
  • Metadata support in West-Life software

The pressing needs that we found were not for novel metadata vocabularies, but for take up of existing ones. This report therefore plans practical steps to deliver improvements during remaining months of West-Life, and steps to enable future culture changes and implementation.

Files

D7_9.pdf

Files (136.3 kB)

Name Size Download all
md5:69e52a732f9ed66f10471559cccaccd4
136.3 kB Preview Download

Additional details

Funding

West-Life – World-wide E-infrastructure for structural biology 675858
European Commission