Published September 28, 2022 | Version v3
Journal article Open

music semantic understanding benchmark

Creators

  • 1. Doh

Description

# Music Semantic Dataset Preprocessor

Dataset preprocessor for music semantic understanding

- github : https://github.com/SeungHeonDoh/msu-benchmark

The selection criteria are as follows: if a dataset has 1) commercial music for retrieval, 2) publicly assessed (at least upon request) and 3) categorical single or multi-label annotations for supporting text-based retrieval scenarios. We summarize all the datasets and tasks in Table. MagnaTagATune (MTAT) consists of 25k music clips from 5,223 unique songs. Following a previous work, we use their published splits and top~50 tags. We do not compare result with previous works using different split. MTG-Jamendo (MTG) contains 55,094 full audio tracks with 183 tags about genre, instrument, and mood/theme. We use the official splits (split-0) in each category for tagging, genre, instrument, and mood/theme tasks. For single-label genre classification, we use the fault-filtered version of GTZAN (GZ) and the `small' version of Free Music Archive (FMA-Small). For the vocal attribute recognition task, we use K-pop Vocal Tag (KVT) dataset. It consists of 6,787 vocal segments from K-pop music tracks. All the segments are annotated with 42 semantic tags describing various vocal style including pitch range, timbre, playing techniques, and gender. For the categorical mood recognition task, we use Emotify dataset. It consists of 400 excerpts in 4 genres with 9 emotional categories.

### Why we made this repo

There are now too many datasets and too many data splits. Because of this, if you are using multi-datasets, creating a loader will cost you a lot of time. To solve this, we propose a preprocessor for making `KV Style (key-values) annotation` file, `track split` file, and `resampler`. This will help the re-implementation of the research.

### Reference

 

@inproceedings{toward2023doh,
  title={Toward Universal Text-to-Music Retrieval},
  author={SeungHeon Doh, Minz Won, Keunwoo Choi, Juhan Nam},
  booktitle = {},
  year={2023}
}

Files

Files (42.9 MB)

Name Size Download all
md5:25906248a2d6a38ad736db016c71569f
42.9 MB Download