There is a newer version of this record available.

Other Open Access

dataset: Create interoperable and well-documented data frames

Daniel Antal

See the package documentation website on dataset.dataobservatory.eu.
Report bugs and suggestions on Github: https://github.com/dataobservatory-eu/dataset/issues

The primary aim of dataset is create well-referenced, well-described, interoperable datasets from data.frames, tibbles or data.tables that translate well into the W3C Data Cube Vocabulary based on the statistical SDMX data cube model. Such standard R objects (data.fame, data.table, tibble, or well-structured lists like json) become highly interoperable and can be placed into relational databases, semantic web applications, archives, repositories. They follow the FAIR principles: they are findable, accessible, interoperable and reusable.

Our datasets:

  • Contain Dublin Core or DataCite (or both) metadata that makes the findable and easier accessible via online libraries. See vignette article Datasets With FAIR Metadata.

  • Their dimensions can be easily and unambigously reduced to triples for RDF applications; they can be easily serialized to, or synchronized with semantic web applications. See vignette article From dataset To RDF.

  • Contain processing metadata that greatly enhance the reproducibility of the results, and the reviewability of the contents of the dataset, including metadata defined by the DDI Alliance, which is particularly helpful for not yet processed data;

  • Follow the datacube model of the Statistical Data and Metadata eXchange, therefore allowing easy refreshing with new data from the source of the analytical work, and particularly useful for datasets containing results of statistical operations in R;

  • Correct exporting with FAIR metadata to the most used file formats and straighforward publication to open science repositories with correct bibliographical and use metadata. See Export And Publish a dataset.

  • Relatively lightweight in dependencies and easily works with data.frame, tibble or data.table R objects.

Files (492.4 kB)
Name Size
dataset_0.1.4.tar.gz
md5:b664fb90d3621a3872c57115e44bd3d2
492.4 kB Download
354
12
views
downloads
All versions This version
Views 354106
Downloads 123
Data volume 6.1 MB1.5 MB
Unique views 26480
Unique downloads 123

Share

Cite as