Dealing with FAIR data

Masuzzo, Paola

doi:10.5281/zenodo.8340333

Published September 13, 2023 | Version v1

Lesson Open

Dealing with FAIR data

Masuzzo, Paola¹

1. IGDORE

This is the material for a workshop I gave at the University of Maribor Open Science Summer School 2023.

The lecture was meant for a very diverse class of students (from Bachelor to PhD degrees), and is a broad introduction to FAIR data, with a series of hands-on exercises on the FAIR principles.

The Dealing with FAIR data PDF file is the backbone of the lecture
- it starts with an introduction: a broad recap of research data, the definition of open data, the research data lifecycle, a list of terminologies useful to follow along, and a brief mention of the FAIR principles (things the students had seen the day before)
- the second part is about the FAIR principles in action with hands-on exercises for each of the four letters: persistent identifiers, APIs, machine-readable formats, licenses, etc.
- the third part is about the process of FAIRification of a dataset: we look at tabular data, in particular into tidy formats vs messy formats, and we use OpenRefine to tidy up some datasets. Towards the last part of the workshop, we also look at the Frictionless Data Package format, we create one with the Data Package Creator, and we finally upload our FAIR toy dataset to the sandbox environment of Zenodo, getting a DOI.
The other csv files are the datasets used during the workshop:
- untidy1.csv and untidy2.csv are used for the exercises in OpenRefine
- scientific-publications-per-million.csv is instead used for the data package creation, which produces the datapackage.json file

The HTML document of this workshop is published on the web at this link.

Files

datapackage.json

Files (12.8 MB)

Name	Size	Download all
datapackage.json md5:63635ec1dbf8af354e5f06e1cbf10ad8	1.9 kB	Preview Download
Dealing with FAIR data.pdf md5:fb5301830f073394dc28b3394fa074b6	12.7 MB	Preview Download
scientific-publications-per-million.csv md5:88ae98b3b816791004c7a0bd7929ab8e	105.8 kB	Preview Download
untidy1.csv md5:320a423a5fd65b66387e8c2536231f38	133 Bytes	Preview Download
untidy2.csv md5:6365d6f508501fc97024e79235d6bb0d	140 Bytes	Preview Download
untidy2_hh.csv md5:cec8d6c75604776f59f26f6a8d6cc9ab	62 Bytes	Preview Download
untidy2_hhm.csv md5:e14dd31353028f9438d2e8f7f5e37973	94 Bytes	Preview Download

	All versions	This version
Views	493	486
Downloads	520	513
Data volume	4.4 GB	4.3 GB

Dealing with FAIR data

Creators

Description

Files

datapackage.json

Files (12.8 MB)