Published September 13, 2023 | Version v1
Lesson Open

Dealing with FAIR data

  • 1. IGDORE


This is the material for a workshop I gave at the University of Maribor Open Science Summer School 2023.

The lecture was meant for a very diverse class of students (from Bachelor to PhD degrees), and is a broad introduction to FAIR data, with a series of hands-on exercises on the FAIR principles.

  • The Dealing with FAIR data PDF file is the backbone of the lecture
    • it starts with an introduction: a broad recap of research data, the definition of open data, the research data lifecycle, a list of terminologies useful to follow along, and a brief mention of the FAIR principles (things the students had seen the day before)
    • the second part is about the FAIR principles in action with hands-on exercises for each of the four letters: persistent identifiers, APIs, machine-readable formats, licenses, etc.
    • the third part is about the process of FAIRification of a dataset: we look at tabular data, in particular into tidy formats vs messy formats, and we use OpenRefine to tidy up some datasets. Towards the last part of the workshop, we also look at the Frictionless Data Package format, we create one with the Data Package Creator, and we finally upload our FAIR toy dataset to the sandbox environment of Zenodo, getting a DOI.
  • The other csv files are the datasets used during the workshop:
    • untidy1.csv and untidy2.csv are used for the exercises in OpenRefine
    • scientific-publications-per-million.csv is instead used for the data package creation, which produces the datapackage.json file

The HTML document of this workshop is published on the web at this link.



Files (12.8 MB)

Name Size Download all
1.9 kB Preview Download
12.7 MB Preview Download
105.8 kB Preview Download
133 Bytes Preview Download
140 Bytes Preview Download
62 Bytes Preview Download
94 Bytes Preview Download