dataset: Create interoperable and well-documented data frames

Daniel Antal

doi:10.5281/zenodo.6969683

Published August 6, 2022 | Version 0.1.6

Software Open

dataset: Create interoperable and well-documented data frames

Daniel Antal

See the package documentation website on dataset.dataobservatory.eu.
Report bugs and suggestions on Github: https://github.com/dataobservatory-eu/dataset/issues

The primary aim of dataset is create well-referenced, well-described, interoperable datasets from data.frames, tibbles or data.tables that translate well into the W3C Data Cube Vocabulary based on the statistical SDMX data cube model. Such standard R objects (data.fame, data.table, tibble, or well-structured lists like json) become highly interoperable and can be placed into relational databases, semantic web applications, archives, repositories. They follow the FAIR principles: they are findable, accessible, interoperable and reusable.

Our datasets:

Contain Dublin Core or DataCite (or both) metadata that makes the findable and easier accessible via online libraries. See vignette article Datasets With FAIR Metadata.
Their dimensions can be easily and unambigously reduced to triples for RDF applications; they can be easily serialized to, or synchronized with semantic web applications. See vignette article From dataset To RDF.
Contain processing metadata that greatly enhance the reproducibility of the results, and the reviewability of the contents of the dataset, including metadata defined by the DDI Alliance, which is particularly helpful for not yet processed data;
Follow the datacube model of the Statistical Data and Metadata eXchange, therefore allowing easy refreshing with new data from the source of the analytical work, and particularly useful for datasets containing results of statistical operations in R;
Correct exporting with FAIR metadata to the most used file formats and straighforward publication to open science repositories with correct bibliographical and use metadata. See Export And Publish a dataset.
Relatively lightweight in dependencies and easily works with data.frame, tibble or data.table R objects.

Files

Files (505.0 kB)

Name	Size	Download all
dataset_0.1.6.tar.gz md5:7262a399211238b8be9b0deba3dd1204	505.0 kB	Download

Additional details

Is documented by: Software documentation: https://zenodo.org/record/6969653#.Yu6BnXZBzIU (URL)

R (Computer program language): https://id.loc.gov/authorities/subjects/sh2002004407.html
Data sets: http://id.loc.gov/authorities/subjects/sh2018002256
Metadata: https://id.loc.gov/authorities/subjects/sh96000740.html
Semantic web: https://id.loc.gov/authorities/subjects/sh2002000569.html

	All versions	This version
Views	4,069	436
Downloads	1,393	29
Data volume	645.4 MB	14.6 MB

Files (505.0 kB)

Related works

Subjects

dataset: Create interoperable and well-documented data frames

Authors/Creators

Description

Files

Files (505.0 kB)

Additional details

Related works

Subjects