Published September 13, 2024 | Version v1
Dataset Open

SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation

Description

The SynthSOD dataset contains more than 47 hours of multitrack music obtained by synthesizing orchestra and ensemble pieces from the Symbolic Orchestral Database (SOD) using Spitfire BBC Symphony Orchestra Professional Library. To synthesize the MIDI files from the SOD, we needed to fix the original files into the General MIDI standard, select a subsect of files that fitted into our requirements (e.g.,  containing only instruments that we could synthesize), and develop a new system to generate musically-motivated random annotations about tempo, dynamic, and articulation. The code to replicate this process is available in our repository and all the details can be read in our paper. We have also published the code to train and evaluate the baseline and the pre-trained models in a GitHub repository.

We have also published the aligned score information for most of the pieces here.

Technical info

Inside the SynthSOD-data folder, there is a folder for every one of the songs of the dataset and inside them, there is a folder called Tree with the signals synthesized for the Decca Tree (which provide a reasonable stereo mix with the original reverberation of the synthesizer) and a folder called Close Mic with the signals synthesized for the close mics of the instruments (which are the driest signals generated by the synthesizer and can be used as source signals if wanting to add custom reverberation). Inside these folders are the FLAC files of the instruments present in the mix, which should be at least two of the followings: Violin_1.flac, Violin_2.flac, Viola.flac, Cello.flac, Bass.flac, Flute.flac, Piccolo.flac, Clarinet.flac, Oboe.flac, coranglais.flac, Bassoon.flac, Horn.flac, Trumpet.flac, Trombone.flac, Tuba.flac, Harp.flac, Timpani.flac, and untunedpercussion.flac.

The file SynthSOD_metadata_all.json contains information about the instruments present in the dataset and the activity time of every one of them and their combinations for the whole dataset and for every one of the songs as well as the ID of every song in the SOD. The files SynthSOD_metadata_train.json, SynthSOD_metadata_evaluation.json, and SynthSOD_metadata_test.json contain the same information but only for the songs in the official train, evaluation, and test partitions of the dataset. Note that the folder SynthSOD-data contains the songs for all the partitions without any splits, so the information about the partitions is only in the JSON files. You can find an example of a PyTorch dataloader for the dataset in the repository of the baseline model.

The compressed file SynthSOD-sample.zip is just a subset of the full dataset with 10 pieces that can be downloaded to take a look/listen to the data before downloading the full dataset.

Files

SynthSOD.zip

Files (47.0 GB)

Name Size Download all
md5:78bd2b2a243d24d5b59b05367953c785
2.0 GB Preview Download
md5:609b17fadaf6324e6979695252dcec24
45.0 GB Preview Download

Additional details

Related works

Is described by
Journal: 10.1109/OJSP.2025.3528361 (DOI)
Is supplemented by
Dataset: 10.5281/zenodo.14971533 (DOI)

Funding

European Commission
REPERTORIUM - Researching and Encouraging the Promulgation of European Repertory through Technologies Operating on Records Interrelated Utilising Machines 101095065

References

  • J. Garcia-Martinez, D. Diaz-Guerra, A. Politis, T. Virtanen, J. J. Carabias-Orti and P. Vera-Candeas, "SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation," in IEEE Open Journal of Signal Processing, vol. 6, pp. 129-137, 2025, doi: 10.1109/OJSP.2025.3528361