There is a newer version of the record available.

Published December 8, 2023 | Version v10
Software Open

dataset: Create Data Frames that are Easier to Exchange and Reuse

Authors/Creators

Contributors

Description

See the package documentation website on dataset.dataobservatory.eu.
Report bugs and suggestions on Github: https://github.com/dataobservatory-eu/dataset/issues

The primary aim of dataset is create well-referenced, well-described, interoperable datasets from data.frames, tibbles or data.tables that translate well into the W3C Data Cube Vocabulary based on the statistical SDMX data cube model. Such standard R objects (data.fame, data.table, tibble, or well-structured lists like json) become highly interoperable and can be placed into relational databases, semantic web applications, archives, repositories. They follow the FAIR principles: they are findable, accessible, interoperable and reusable.

Our datasets:

Contain Dublin Core or DataCite (or both) metadata that makes the findable and easier accessible via online libraries. See vignette article Datasets With FAIR Metadata.

Their dimensions can be easily and unambigously reduced to triples for RDF applications; they can be easily serialized to, or synchronized with semantic web applications. See vignette article From dataset To RDF.

Contain processing metadata that greatly enhance the reproducibility of the results, and the reviewability of the contents of the dataset, including metadata defined by the DDI Alliance, which is particularly helpful for not yet processed data;

Follow the datacube model of the Statistical Data and Metadata eXchange, therefore allowing easy refreshing with new data from the source of the analytical work, and particularly useful for datasets containing results of statistical operations in R;

Correct exporting with FAIR metadata to the most used file formats and straighforward publication to open science repositories with correct bibliographical and use metadata. See Export And Publish a dataset.

Relatively lightweight in dependencies and easily works with data.frame, tibble or data.table R objects.

Files

dataset_0.2.7.pdf

Files (726.2 kB)

Name Size Download all
md5:6c68f00bb2ec74a5864806b3bff421ad
279.9 kB Preview Download
md5:e6ec4fda24e395ea1e7af1b83b3c4998
446.3 kB Download

Additional details

Related works

Is documented by
Software documentation: https://zenodo.org/record/6969653#.Yu6BnXZBzIU (URL)
Software documentation: https://dataset.dataobservatory.eu/news/index.html (URL)
Is required by
Software: https://zenodo.org/record/7013164#.YwFmc3ZBzIU (URL)

Funding

European Commission
OpenMusE - OPEN MUSIC EUROPE (OPENMUSE) – AN OPEN, SCALABLE DATA-TO-POLICY PIPELINE FOR EUROPEAN MUSIC ECOSYSTEMS 101095295