Planned intervention: On Thursday March 28th 07:00 UTC Zenodo will be unavailable for up to 5 minutes to perform a database upgrade.

There is a newer version of the record available.

Published October 20, 2019 | Version v1
Dataset Open

BabySlakh

  • 1. Northwestern University
  • 2. Mitsubishi Electric Research Labs

Description

Introduction

BabySlakh is a tiny version of Slakh2100 (zenodo link) that is useful for debugging. It consists of the first 20 tracks of Slakh2100 (i.e., Track00001 through Track00020). All of the audio is in the wav format and has a sample rate of 16 kHz. BabySlakh is ready to go once it's unzipped.

 

About Slakh

The Synthesized Lakh (Slakh) Dataset is a dataset of multi-track audio and aligned MIDI for music source separation and multi-instrument automatic transcription. Individual MIDI tracks are synthesized from the Lakh MIDI Dataset v0.1 using professional-grade sample-based virtual instruments, and the resulting audio is mixed together to make musical mixtures. This release of Slakh, called Slakh2100, contains 2100 automatically mixed tracks and accompanying, aligned MIDI files, synthesized from 187 patches categorized into 34 classes.

 

Citing Slakh

If you use Slakh2100 or generate data using the same method we ask that you cite it using the following bibtex entry:

@inproceedings{manilow2019cutting,
  title={Cutting Music Source Separation Some {Slakh}: A Dataset to Study the Impact of Training Data Quality and Quantity},
  author={Manilow, Ethan and Wichern, Gordon and Seetharaman, Prem and Le Roux, Jonathan},
  booktitle={Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
  year={2019},
  organization={IEEE}
}

 

Files

babyslakh_16k.zip

Files (882.9 MB)

Name Size Download all
md5:ea1797fc57689a0e33c759c17a2292f5
882.9 MB Preview Download