Dataset Open Access

Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles

Ondřej Cífka; Umut Şimşekli; Gaël Richard


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Ondřej Cífka</dc:creator>
  <dc:creator>Umut Şimşekli</dc:creator>
  <dc:creator>Gaël Richard</dc:creator>
  <dc:date>2020-08-29</dc:date>
  <dc:description>The Groove2Groove MIDI Dataset is a parallel corpus of synthetic MIDI accompaniments in almost 3000 different styles, created as described in the paper Groove2Groove: One-Shot Accompaniment Style Transfer with Supervision from Synthetic Data [pdf]. See the README.md file or the Groove2Groove website for more information.

The dataset is split into the following sections:


	train contains 5744 MIDI files in 2872 styles (exactly 2 files per style). Each file contains 252 measures following a 2 measure count-in.
	val and test each contain 1200 files in 40 styles (exactly 30 files per style, 16 bars per file after the count-in). The sets of styles are disjoint from each other and from those in train.
	itest is generated from the same chord charts as test, but in 40 styles from the training set.


Chord charts for all MIDI files are provided in the ABC format and the Band-in-a-Box (MGU) format. Each chord chart corresponds to at least 2 MIDI files in different styles.

The code used to automate Band-in-a-Box is available in the pybiab package.

If you use the data in your research, please reference the paper (not just the Zenodo record):

@article{groove2groove,
  author={Ond\v{r}ej C\'{i}fka and Umut \c{S}im\c{s}ekli and Ga\"{e}l Richard},
  title={{Groove2Groove}: One-Shot Music Style Transfer with Supervision from Synthetic Data},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  publisher={IEEE},
  year={2020},
  volume={28},
  pages={2638--2650},
  doi={10.1109/TASLP.2020.3019642},
  url={https://doi.org/10.1109/TASLP.2020.3019642}
}</dc:description>
  <dc:identifier>https://zenodo.org/record/3958000</dc:identifier>
  <dc:identifier>10.5281/zenodo.3958000</dc:identifier>
  <dc:identifier>oai:zenodo.org:3958000</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>info:eu-repo/grantAgreement/EC/H2020/765068/</dc:relation>
  <dc:relation>doi:10.1109/TASLP.2020.3019642</dc:relation>
  <dc:relation>doi:10.5281/zenodo.3957999</dc:relation>
  <dc:relation>url:https://zenodo.org/communities/ieee</dc:relation>
  <dc:relation>url:https://zenodo.org/communities/ismir</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by-nc/4.0/legalcode</dc:rights>
  <dc:subject>musical styles</dc:subject>
  <dc:subject>parallel corpus</dc:subject>
  <dc:subject>music</dc:subject>
  <dc:subject>MIDI</dc:subject>
  <dc:subject>accompaniments</dc:subject>
  <dc:subject>chord charts</dc:subject>
  <dc:title>Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
471
150
views
downloads
All versions This version
Views 471471
Downloads 150150
Data volume 35.4 GB35.4 GB
Unique views 394394
Unique downloads 138138

Share

Cite as