Published May 18, 2025 | Version 0.4.4

dataset: Create Data Frames that are Easier to Exchange and Reuse

Description

The dataset package extension to the R statistical environment aims to ensure that the most important R object that contains a dataset, i.e. a data.frame or an inherited tibbletsibble or data.table contains important metadata for the reuse and validation of the dataset contents. The aim of dataset is to produce to turn R data frames into datasets that meet strict application criteria, can participate in the Statistical Data and Metadata eXchange, or send data to Wikidata, Europeana, and various open science repositories.

The current version of the dataset package is matureing. It was peer-reviewed and became part of rOpenSci.  at version 0.4.0.  The 0.4.1 version works better with data and time classes, and allows flatting the semantic informatin of the rich datasets to place them back to base R or tidyverse pipelines.

Files

dataset_0.4.4.pdf

Files (1.2 MB)

Name Size Download all
md5:d8d706706bf3bccba4ca6d12510c1a56
476.5 kB Preview Download
md5:e01ed9bac13b9b820193864e0bc9b4b9
764.2 kB Download

Additional details

Funding

European Commission
OpenMusE - OPEN MUSIC EUROPE (OPENMUSE) – AN OPEN, SCALABLE DATA-TO-POLICY PIPELINE FOR EUROPEAN MUSIC ECOSYSTEMS 101095295

Dates

Updated
2024-12-16
0.3.3008
Issued
2024-12-23
0.3.4 CRAN release
Issued
2025-08-26
0.4.0 CRAN & rOpenSci release
Issued
2025-11-16
0.4.1 CRAN release
Issued
2026-05-18
0.4.4 CRAN release

Software

Repository URL
https://github.com/dataobservatory-eu/dataset/
Programming language
R
Development Status
Active