Published November 16, 2025 | Version 0.4.1
Software Open

dataset: Create Data Frames that are Easier to Exchange and Reuse

Description

The dataset package extension to the R statistical environment aims to ensure that the most important R object that contains a dataset, i.e. a data.frame or an inherited tibbletsibble or data.table contains important metadata for the reuse and validation of the dataset contents. The aim of dataset is to produce to turn R data frames into datasets that meet strict application criteria, can participate in the Statistical Data and Metadata eXchange, or send data to Wikidata, Europeana, and various open science repositories.

The current version of the dataset package is matureing. It was peer-reviewed and became part of rOpenSci.  at version 0.4.0.  The 0.4.1 version works better with data and time classes, and allows flatting the semantic informatin of the rich datasets to place them back to base R or tidyverse pipelines.

Files

dataset_0.4.1.pdf

Files (1.2 MB)

Name Size Download all
md5:6dc1a212845af0535713161e71ba29ed
478.6 kB Preview Download
md5:548a0d6cf6a3cf6abaa792ca76943f34
764.3 kB Download

Additional details

Funding

European Commission
OpenMusE - OPEN MUSIC EUROPE (OPENMUSE) – AN OPEN, SCALABLE DATA-TO-POLICY PIPELINE FOR EUROPEAN MUSIC ECOSYSTEMS 101095295

Dates

Updated
2024-12-16
0.3.3008
Issued
2024-12-23
0.3.4 CRAN release
Issued
2025-08-26
0.4.0 CRAN & rOpenSci release
Issued
2025-11-16
0.4.1 CRAN release

Software

Repository URL
https://github.com/dataobservatory-eu/dataset/
Programming language
R
Development Status
Active