PHENICX-Anechoic: note annotations for Aalto anechoic orchestral database
This dataset includes audio and annotations useful for tasks as score-informed source separation, score following, multi-pitch estimation, transcription or instrument detection, in the context of symphonic music.
This dataset was presented and used in the evaluation of:
M. Miron, J. Carabias-Orti, J. J. Bosch, E. Gómez and J. Janer, "Score-informed source separation for multi-channel orchestral recordings", Journal of Electrical and Computer Engineering (2016))"
On this web page we do not provide the original audio files, which can be found at the web page hosted by Aalto University. However, with their permission we distribute the denoised versions for some of the anechoic orchestral recordings:
Pätynen, J., Pulkki, V., and Lokki, T., "Anechoic recording system for symphony orchestra," Acta Acustica united with Acustica, vol. 94, nr. 6, pp. 856-865, November/December 2008.
For the intellectual rights and the distribution policy of the audio recordings in this dataset contact Aalto University, Jukka Pätynen and Tapio Lokki. For more information about the original anechoic recordings we refer to the web page and the associated publication 
We provide the associated musical note onset and offset annotations, and the Roomsim configuration files used to generate the multi-microphone recordings .
The anechoic dataset in  consists of four passages of symphonic music from the Classical and Romantic periods. This work presented a set of anechoic recordings for each of the instruments, which were then synchronized between them so that they could later be combined to a mix of the orchestra. In order to keep the evaluation setup consistent between the four pieces, we selected the following instruments: violin, viola, cello, double bass, oboe, flute, clarinet, horn, trumpet and bassoon.
We created a ground truth score, by hand annotating the notes played by the instruments. The annotation process involved gathering the original scores in MIDI format, performing an initial automatic audio-to-score alignment, then manually aligning each instrument track separately with the guidance of a monophonic pitch estimation.
During the recording process detailed in , the gain of the microphone amplifiers was fixed to the same value for the whole process, which reduced the dynamic range of the recordings of the quieter instruments. This lead to problems with which we had to deal, in order to reduce the noise. In the paper we described the score-informed denoising procedure we applied to each track.
A complete description of the dataset and the creation methodology, including the generation of the multi-microphone recordings, is presented in .
Please Acknowledge PHENICX-Anechoic in Academic Research
Using this dataset
When the present dataset is used for academic research, we would highly appreciate if scientific publications of works partly based on the PHENICX-Anechoic dataset quote the publications above.
We are interested in knowing if you find our datasets useful! If you use our dataset please email us at firstname.lastname@example.org and tell us about your research.
- 10.1155/2016/8363507 (DOI)
-  M. Miron, J. Carabias-Orti, J. J. Bosch, E. Gómez and J. Janer, "Score-informed source separation for multi-channel orchestral recordings", Journal of Electrical and Computer Engineering (2016)
-  Pätynen, J., Pulkki, V., and Lokki, T., "Anechoic recording system for symphony orchestra," Acta Acustica united with Acustica, vol. 94, nr. 6, pp. 856-865, November/December 2008.
-  Campbell, D., K. Palomaki, and G. Brown. "A Matlab simulation of" shoebox" room acoustics for use in research and teaching." Computing and Information Systems 9.3 (2005): 48.