Dataset Open Access

Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles

Ondřej Cífka; Umut Şimşekli; Gaël Richard


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.3958000</identifier>
  <creators>
    <creator>
      <creatorName>Ondřej Cífka</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6268-6445</nameIdentifier>
      <affiliation>LTCI, Télécom Paris, Institut Polytechnique de Paris</affiliation>
    </creator>
    <creator>
      <creatorName>Umut Şimşekli</creatorName>
      <affiliation>LTCI, Télécom Paris, Institut Polytechnique de Paris</affiliation>
    </creator>
    <creator>
      <creatorName>Gaël Richard</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4960-0010</nameIdentifier>
      <affiliation>LTCI, Télécom Paris, Institut Polytechnique de Paris</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2020</publicationYear>
  <subjects>
    <subject>musical styles</subject>
    <subject>parallel corpus</subject>
    <subject>music</subject>
    <subject>MIDI</subject>
    <subject>accompaniments</subject>
    <subject>chord charts</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2020-08-29</date>
  </dates>
  <language>en</language>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/3958000</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementTo" resourceTypeGeneral="JournalArticle">10.1109/TASLP.2020.3019642</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.3957999</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/ieee</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/ismir</relatedIdentifier>
  </relatedIdentifiers>
  <version>1.0.0</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-nc/4.0/legalcode">Creative Commons Attribution Non Commercial 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;The&amp;nbsp;&lt;em&gt;Groove2Groove MIDI Dataset&lt;/em&gt;&amp;nbsp;is a parallel corpus of synthetic MIDI accompaniments in almost 3000 different styles,&amp;nbsp;created as described in the paper&amp;nbsp;&lt;em&gt;&lt;a href="https://doi.org/10.1109/TASLP.2020.3019642"&gt;Groove2Groove: One-Shot Accompaniment Style Transfer with Supervision from Synthetic Data&lt;/a&gt;&lt;/em&gt;&amp;nbsp;[&lt;a href="https://groove2groove.telecom-paris.fr/data/paper.pdf"&gt;pdf&lt;/a&gt;]. See the &lt;code&gt;README.md&lt;/code&gt; file or the&amp;nbsp;&lt;em&gt;&lt;a href="https://groove2groove.telecom-paris.fr/#Dataset"&gt;Groove2Groove website&lt;/a&gt;&lt;/em&gt; for more information.&lt;/p&gt;

&lt;p&gt;The dataset is split into the following sections:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;code&gt;train&lt;/code&gt;&amp;nbsp;contains 5744 MIDI files in 2872 styles (exactly 2 files per style). Each file contains 252 measures&amp;nbsp;following a 2 measure count-in.&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;val&lt;/code&gt;&amp;nbsp;and&amp;nbsp;&lt;code&gt;test&lt;/code&gt;&amp;nbsp;each contain 1200 files in 40 styles (exactly 30 files per style, 16 bars per file after the count-in). The sets of styles are disjoint from each other and from those in&amp;nbsp;&lt;code&gt;train&lt;/code&gt;.&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;itest&lt;/code&gt;&amp;nbsp;is generated from the same chord charts as&amp;nbsp;&lt;code&gt;test&lt;/code&gt;, but in 40 styles from the training set.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Chord charts for all MIDI files are provided in the ABC format&amp;nbsp;and the Band-in-a-Box (MGU) format. Each chord chart corresponds to at least 2 MIDI files in different styles.&lt;/p&gt;

&lt;p&gt;The code used to automate Band-in-a-Box is available in the &lt;a href="https://github.com/cifkao/pybiab"&gt;pybiab&lt;/a&gt; package.&lt;/p&gt;

&lt;p&gt;If you use the data in your research, please reference the paper (not just&amp;nbsp;the Zenodo record):&lt;/p&gt;

&lt;pre&gt;&lt;code&gt;@article{groove2groove,
  author={Ond\v{r}ej C\'{i}fka and Umut \c{S}im\c{s}ekli and Ga\"{e}l Richard},
  title={{Groove2Groove}: One-Shot Music Style Transfer with Supervision from Synthetic Data},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  publisher={IEEE},
  year={2020},
  volume={28},
  pages={2638--2650},
  doi={10.1109/TASLP.2020.3019642},
  url={https://doi.org/10.1109/TASLP.2020.3019642}
}&lt;/code&gt;&lt;/pre&gt;</description>
  </descriptions>
  <fundingReferences>
    <fundingReference>
      <funderName>European Commission</funderName>
      <funderIdentifier funderIdentifierType="Crossref Funder ID">10.13039/501100000780</funderIdentifier>
      <awardNumber awardURI="info:eu-repo/grantAgreement/EC/H2020/765068/">765068</awardNumber>
      <awardTitle>New Frontiers in Music Information Processing</awardTitle>
    </fundingReference>
  </fundingReferences>
</resource>
471
150
views
downloads
All versions This version
Views 471471
Downloads 150150
Data volume 35.4 GB35.4 GB
Unique views 394394
Unique downloads 138138

Share

Cite as