Published January 19, 2021 | Version 1.0
Dataset Open

Using Corpus Studies to Find the Origins of the Madrigal: Music and Feature Values

  • 1. McGill University
  • 2. Marianopolis College

Description

This distribution includes files associated with the experiments described in the "Using Corpus Studies to Find the Origins of the Madrigal" paper presented at the 2021 Future Directions of Music Cognition conference (http://org.osu.edu/mascats/). Our group encoded all the music from the original sources ourselves, using Sibelius, and exported the Sibelius data as PDF and MIDI files. The details of the corpus are described in the "Florence 164 Metadata.xlsx" file and in the conference paper itself.

All files included in this archive are distributed under a "CC BY-SA 4.0" license" license (https://creativecommons.org/licenses/by-sa/4.0/). 

All features were extracted using jSymbolic 2.2 (http://jmir.sourceforge.net) directly from the MIDI encodings included here. Details on all the individual features extracted with the software are available in the jSymbolic manual (http://jmir.sourceforge.net/manuals/jSymbolic_manual/home.html). The features are presented as follows in the "F164_Extracted_Features" folder:

- F164_Feature_Definitions.xml: Descriptions of all extracted features, encoded in ACE XML 1.0, as output directly by jSymbolic. This file does not itself include any feature values.

- F164_Feature_Values.xml: Extracted feature values, encoded in ACE XML 1.0, as output directly by jSymbolic. The features are described in the FeatureDefinitions.xml file. Class associations are implied by the folder containing each MIDI file.

- F164_Feature_Values_BasicCSV: Extracted feature values encoded in a CSV file, as output directly by jSymbolic. Class associations are implied by the folder containing each MIDI file.

- F164_Feature_Values_HumanReadable.xlsx: Extracted features formatted into a human-readable Microsoft Excel file. Group averages and standard deviations have been added to the bottom.

- F164_Feature_Values_WekaReady.csv: Extracted feature values encoded in a CSV file in a format readable by Weka (https://www.cs.waikato.ac.nz/ml/weka/). Class values have been added in a column on the right, and file paths have been removed, as required by Weka.
 

Notes

Thanks to Ian Lorenz, Jonathan Stuchbery, Linda Pearse, Sara Sabol, Vi-An Tran, Zoey Cochran, Tristan Tenaglio, Rían Adamían, Ichiro Fujinaga, and SIMSSA for their important contributions. Financial support was provided by the granting agencies FRQSC (Fonds de recherche Société et Culture du Québec) and SSHRC (Social Sciences and Humanities Research Council of Canada).

Files

Florence164_Files_and_Features.zip

Files (3.7 MB)

Name Size Download all
md5:93d5ca5384562f1610ce0018fcdb7012
3.7 MB Preview Download