JazzICat Dataset
Creators
- 1. Athena Research Center
- 2. National and Kapodistrian University of Athens
- 3. University of Piraeus
Description
Description of the provided files:
1. databrick_aggr_nflat_non_augm.tar.gz
Derived from the initial dataset ( https://github.com/wayne391/lead-sheet-dataset ), our file above, stores all the information of the initial dataset -- beat information, melody / accompaniment chord channels and compact chord information ( 275 features ), as a numpy array of shape ( features, time steps ).
2. samples.tar.gz
This is our final dataset, including all the pieces of length less or equal to 145 time steps, in their original pitch, first and then transposed each one of the rest of the 12 pitches. Each of those parts (pieces) with num N, stored in a separate folder ( samples/partN/partN.npz ), in ".npz" file format, with time resolution of 1/2 of a quarter note, beat information, flattened melody and accompaniment channels, compact representation of chord information ( 21 features ), enriched accompaniment channel, as a numpy array with shape ( features, time steps ).
3. parts_harmony_chord2int_dict.pkl
The dictionary of all the unique accompaniment chords in the training set, which encodes the 128-vector of each chord to its class number.
4. parts_harmony_int2chord_dict.pkl
The decoding dictionary of the accompaniment channel, which decodes a chord class number to the 128-vector representation of the chord.
Files
Files
(21.5 MB)
Name | Size | Download all |
---|---|---|
md5:76ce6d6f242d776b0a5f3b1ebd212fbf
|
2.2 MB | Download |
md5:25cff72fa068c08143c3cb76137c3532
|
1.7 MB | Download |
md5:a1a1f191ab3f681709ce83ed40b9869f
|
3.1 MB | Download |
md5:5ea95a1bc7c45e3b56ebd3b9ebb1b79b
|
14.4 MB | Download |