There is a newer version of the record available.

Published July 27, 2021 | Version 2.0
Dataset Open

Schubert Winterreise Dataset

  • 1. International Audio Laboratories Erlangen
  • 2. Utrecht University
  • 3. Blue Square Group e.V., Bonn

Description

The Schubert Winterreise Dataset (SWD) is a multimodal dataset comprising various representations and annotations of Franz Schubert's 24-song cycle Winterreise. The primary material (raw data) consists of textual representations of the songs' lyrics, music scores in image, symbolic, and MIDI format, as well as nine audio recordings of performances (only two included due to copyright issues). The secondary material (annotations) comprises information of musical measure positions in sheet music images and audio recordings as well as analyses of chords, local keys, global keys, and structural parts.

UPDATE: Version 2.0 contains additional note annotations comprising pitchpitch class (chroma), and instrument labels for each note event (generated semi-automatically).

The SWD is organized as follows:

01_RawData

  • lyrics_txt
  • score-IMSLP_png
  • score-IMSLP_pdf-complete.pdf
  • score_sibelius
  • score_pdf
  • score_musicxml
  • score_midi
  • audio_wav

02_Annotations

  • ann_score-IMSLP_measure
  • ann_score_chord
  • ann_score_localkey-ann1
  • ann_score_localkey-ann2
  • ann_score_localkey-ann3
  • ann_score_globalkey.csv
  • ann_score_structure
  • ann_audio_measure
  • ann_audio_chord
  • ann_audio_localkey-ann1
  • ann_audio_localkey-ann2
  • ann_audio_localkey-ann3
  • ann_audio_globalkey.csv
  • ann_audio_structure
  • ann_audio_note

03_ExtraMaterial

  • original data
  • scripts for cutting
  • licenses of the published performances

Further information can be found in the README file and the accompanying journal paper.

 

License:

The dataset is published under a Creative Commons Attribution 3.0 Unported license (https://creativecommons.org/licenses/by/3.0/legalcode). We acknowledge the publication of the performance by baritone Randall Scarlata with pianist Jeremy Denk (denoted as SC06), published under the same license by the Isabella Stewart Gardner Museum (Boston). Please note that the audio files of this performance must not be re-distributed in modified form. All other sources for the dataset are in the Public Domain.

 

 

Notes

For a detailed description, please see the accompanying journal paper: Christof Weiß, Frank Zalkow, Vlora Arifi-Müller, Meinard Müller, Hendrik Vincent Koops, Anja Volk, and Harald G. Grohganz. Schubert Winterreise Dataset: A Multimodal Scenario for Music Analysis. In: ACM Journal on Computing and Cultural Heritage, 2021.

Files

Schubert_Winterreise_Dataset_v2-0.zip

Files (506.9 MB)

Name Size Download all
md5:d3a48422a90bf177ef89349725042978
506.9 MB Preview Download

Additional details

Related works

Is documented by
Journal article: 10.1145/3429743 (DOI)

References

  • Frans Absil. Musical Analysis – Visiting the Great Composers (6th ed.), 2017
  • Harald Grohganz. Algorithmen zur strukturellen Analyse von Musikaufnahmen. Ph.D. Dissertation. University of Bonn, Germany, 2015
  • Hendrik Vincent Koops. Computational Modelling of Variance in Musical Harmony. Ph.D. Dissertation. Utrecht University, Utrecht, The Netherlands, 2019
  • Frank Zalkow, Angel Villar Corrales, TJ Tsai, Vlora Arifi-Müller, and Meinard Müller. Tools for Semi-Automatic Bounding Box Annotation of Musical Measures in Sheet Music. In Demos and Late Breaking News of the International Society for Music Information Retrieval Conference (ISMIR). Delft, The Netherlands, 2019