OpenMIC-2018: An Open Data-set for Multiple Instrument Recognition

Eric Humphrey; Simon Durand; Brian McFee

doi:10.5281/zenodo.1492445

Published September 23, 2018 | Version v1

Conference paper Open

OpenMIC-2018: An Open Data-set for Multiple Instrument Recognition

Identification of instruments in polyphonic recordings is a challenging, but fundamental problem in music information retrieval. While there has been significant progress in developing predictive models for this and related classification tasks, we as a community lack a common data-set which is large, freely available, diverse, and representative of naturally occurring recordings. This limits our ability to measure the efficacy of computational models. This article describes the construction of a new, open data-set for multi-instrument recognition. The dataset contains 20,000 examples of Creative Commons-licensed music available on the Free Music Archive. Each example is a 10-second excerpt which has been partially labeled for the presence or absence of 20 instrument classes by annotators on a crowd-sourcing platform. We describe in detail how the instrument taxonomy was constructed, how the dataset was sampled and annotated, and compare its characteristics to similar, previous data-sets. Finally, we present experimental results and baseline model performance to motivate future work.

Files

248_Paper.pdf

Files (752.8 kB)

Name	Size	Download all
248_Paper.pdf md5:f0973cb0d321c2b2df09ab95642896f7	752.8 kB	Preview Download

Views

687

Downloads

Show more details

	All versions	This version
Views	1,358	1,356
Downloads	687	685
Data volume	627.1 MB	625.6 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 19th International Society for Music Information Retrieval Conference, 438-444. Paris, France.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2018) , Paris, France, September 23-27, 2018

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 20, 2018
Modified: August 2, 2024

OpenMIC-2018: An Open Data-set for Multiple Instrument Recognition

Authors/Creators

Description

Files

248_Paper.pdf

Files (752.8 kB)