Dataset Open Access

BabySlakh

Manilow, Ethan; Wichern, Gordon; Seetharaman, Prem; Le Roux, Jonathan

Introduction:

BabySlakh is a tiny version of Slakh2100 (zenodo link) that is useful for debugging and prototyping. It consists of the first 20 tracks of Slakh2100 (i.e., Track00001 through Track00020, inclusive). All of the audio is in the wav format and has a sample rate of 16 kHz. BabySlakh is ready to go once it's unzipped.

 

About Slakh:

The Synthesized Lakh (Slakh) Dataset is a dataset of multi-track audio and aligned MIDI for music source separation and multi-instrument automatic transcription. Individual MIDI tracks are synthesized from the Lakh MIDI Dataset v0.1 using professional-grade sample-based virtual instruments, and the resulting audio is mixed together to make musical mixtures. The full release of Slakh, called Slakh2100, contains 2100 automatically mixed tracks and accompanying, aligned MIDI files, synthesized from 187 patches categorized into 34 classes.

 

Helpful Links:

For more information, see www.slakh.com.

Support code for Slakh: Available here.

Code to render Slakh data: Available in this repo.

See the dataset at a glance, and info about metadata.yaml.

 

Citing Slakh:

If you use BabySlakh or generate data using the same method we ask that you cite it using the following bibtex entry:

@inproceedings{manilow2019cutting,
  title={Cutting Music Source Separation Some {Slakh}: A Dataset to Study the Impact of Training Data Quality and Quantity},
  author={Manilow, Ethan and Wichern, Gordon and Seetharaman, Prem and Le Roux, Jonathan},
  booktitle={Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
  year={2019},
  organization={IEEE}
}

 

Files (882.8 MB)
Name Size
babyslakh_16k.tar.gz
md5:311096dc2bde7d61c97e930edbfc7f78
882.8 MB Download
235
54
views
downloads
All versions This version
Views 235230
Downloads 5453
Data volume 47.7 GB46.8 GB
Unique views 165163
Unique downloads 4646

Share

Cite as