Published 2024 | Version v2
Dataset Open

GAPS (Guitar-Aligned Performance Scores) Dataset

Description

UPDATE - a newer version of this dataset is now available at [https://huggingface.co/datasets/xavriley/GAPS](https://huggingface.co/datasets/xavriley/GAPS) which includes audio and also fixes a bug with case sensitivity in the Soundslice ids that are used to identify files. The HF version is recommended for future research projects but this version remains available for reference.

Release of the aligned MIDI transcriptions, scores and downbeats that constitute the GAPS dataset. Links to YouTube URLs for audio and video are provided in the accompanying metadata file. See below for the abstract of the publication.

Abstract:

We introduce GAPS (Guitar-Aligned Performance Scores), a new dataset of classical guitar performances, and a benchmark guitar transcription model that achieves state-of-the-art performance on GuitarSet in both supervised and zero-shot settings. GAPS is the largest dataset of real guitar audio, containing 14 hours of freely available audio-score aligned pairs, recorded in diverse conditions by over 200 performers, together with high-resolution note-level MIDI alignments and performance videos. These enable us to train a state-of-the-art model for automatic transcription of solo guitar recordings which can generalise well to real world audio that is unseen during training.

Files

gaps_v1_no_audio.zip

Files (7.0 MB)

Name Size Download all
md5:6a12e3ea1c865ae7a0abab95c9ed1b58
7.0 MB Preview Download

Additional details

Funding

Engineering and Physical Sciences Research Council
UKRI Centre for Doctoral Training in Artificial Intelligence and Music EP/S022694/1