Published October 30, 2019 | Version v1
Dataset Open

JazzICat Dataset

  • 1. Athena Research Center
  • 2. National and Kapodistrian University of Athens
  • 3. University of Piraeus

Description

Description of the provided files:

1. databrick_aggr_nflat_non_augm.tar.gz
Derived from the initial dataset ( https://github.com/wayne391/lead-sheet-dataset ), our file above, stores all the information of the initial dataset -- beat information, melody / accompaniment chord channels and compact chord information ( 275 features ), as a numpy array of shape ( features, time steps ).

2. samples.tar.gz
This is our final dataset, including all the pieces of length less or equal to 145 time steps, in their original pitch, first and then transposed each one of the rest of the 12 pitches. Each of those parts (pieces) with num N, stored in a separate folder ( samples/partN/partN.npz ), in ".npz" file format, with time resolution of 1/2 of a quarter note, beat information, flattened melody and accompaniment channels, compact representation of chord information ( 21 features ), enriched accompaniment channel, as a numpy array with shape ( features, time steps ).

3. parts_harmony_chord2int_dict.pkl
The dictionary of all the unique accompaniment chords in the training set, which encodes the 128-vector of each chord to its class number.

4. parts_harmony_int2chord_dict.pkl
The decoding dictionary of the accompaniment channel, which decodes a chord class number to the 128-vector representation of the chord.

Files

Files (21.5 MB)

Name Size Download all
md5:76ce6d6f242d776b0a5f3b1ebd212fbf
2.2 MB Download
md5:25cff72fa068c08143c3cb76137c3532
1.7 MB Download
md5:a1a1f191ab3f681709ce83ed40b9869f
3.1 MB Download
md5:5ea95a1bc7c45e3b56ebd3b9ebb1b79b
14.4 MB Download