Divide and Remaster (DnR)

Petermann, Darius; Wichern, Gordon; Wang, Zhong-Qiu; Le Roux, Jonathan

doi:10.5281/zenodo.5574713

Published October 17, 2021 | Version 1.0

Dataset Open

Divide and Remaster (DnR)

1. Indiana University, Department of Intelligent Systems Engineering
2. Mitsubishi Electric Research Laboratories

Introduction:

Divide and Remaster (DnR) is a source separation dataset for training and testing algorithms that separate a monaural audio signal into speech, music, and sound effects/background stems. The dataset is composed of artificial mixtures using audio from the librispeech, free music archive (FMA), and Freesound Dataset 50k (FSD50k). We introduce it as part of the Cocktail Fork Problem paper.

At a Glance:

The size of the unzipped dataset is ~174GB
Each mixture is 60 seconds long and sources are not fully overlapped
Audio is encoded as 16-bit .wav files at a sampling rate of 44.1 kHz
The data is split into training tr (3295 mixtues), validation cv (440 mixtures) and testing tt (652 mixtures) subsets
The directory for each mixture contains four .wav files, mix.wav, music.wav, speech.wav, sfx.wav, and annots.csv which contains the metadata for the original audio used to compose the mixture (transcriptions for speech, sound classes for sfx, and genre labels for music)

Other Resources:

Demo examples and additional information are available at: https://cocktail-fork.github.io/

For more details about the data generation process, the code used to generate our dataset can be found at the following: https://github.com/darius522/dnr-utils

Contact and Support:

Have an issue, concern, or question about DnR ? If so, please open an issue here.

For any other inquiries, feel free to shoot an email at: firstname.lastname@gmail.com, my name is Darius Petermann ;)

Citation:

If you use DnR please cite [our paper](https://arxiv.org/abs/2110.09958) in which we introduce the dataset as part of the Cocktail Fork Problem:

@article{Petermann2021cocktail,
    title={The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks}, 
    author={Darius Petermann and Gordon Wichern and Zhong-Qiu Wang and Jonathan {Le Roux}},
    year={2021},
    journal={arXiv preprint arXiv:2110.09958},
    archivePrefix={arXiv},
    primaryClass={eess.AS}
}

Files

Files (106.1 GB)

Name	Size	Download all
dnr.tar.gz md5:ee33e4bc4cb76b1c17e26f4fee377667	106.1 GB	Download

	All versions	This version
Views	8,804	5,442
Downloads	6,347	824
Data volume	642.1 TB	158.1 TB

Divide and Remaster (DnR)

Creators

Description

Files

Files (106.1 GB)