Gontier, Felix; Lagrange, Mathieu; Aumond, Pierre; Lavandier, Catherine; Petiot, Jean-François

  "abstract": "<p>This is a corpus of 100 sound scenes of 45s each representing urban sound environments, including:</p>\n\n<ul>\n\t<li>6 scenes recorded in Paris,</li>\n\t<li>19 scenes simulated using simScene ( to replicate recorded scenarios, including the 6 recordings in this corpus,</li>\n\t<li>75 scenes simulated using simScene with diverse new scenarios, containing traffic, human voices and bird sources.</li>\n</ul>\n\n<p>The base audio files used for simulation are obtained from Freesound ( and LibriSpeech (</p>\n\n<p>This corpus has been evaluated by a panel of participants in a listening experiment, with assessments on the following 0-10 Likert scales:</p>\n\n<ol>\n\t<li><strong>Pleasantness</strong>: Unpleasant - Pleasant,</li>\n\t<li><strong>Liveliness</strong>: Inert, amorphous - Lively, eventful,</li>\n\t<li><strong>Overall loudness</strong>: Quiet - Noisy,</li>\n\t<li><strong>Interest</strong>: Boring, uninteresting - Stimulating, interesting,</li>\n\t<li><strong>Calmness</strong>: Agitated, chaotic - Calm, peaceful,</li>\n\t<li><strong>Sound level of passing vehicles</strong>: Very low - Very high,</li>\n\t<li><strong>Time of presence of traffic</strong>: Never - Continuously,</li>\n\t<li><strong>Time of presence of voices</strong>: Never - Continuously,</li>\n\t<li><strong>Time of presence of birds</strong>: Never - Continuously.</li>\n</ol>\n\n<p>Assessments from 23 subjects are available for the 6 recorded and 19 simulated scenes, and from 7 to 8 subjects for the 75 simulated scenes.</p>\n\n<p>The contents of this dataset are as follow:</p>\n\n<ul>\n\t<li><em>assessments</em>: contains evaluations by 23 subjects of perceptual scales on the corpus\n\n\t<ul>\n\t\t<li><em>sXX</em>: Folder corresponding to participant XX\n\n\t\t<ul>\n\t\t\t<li><em>Pt_.txt</em>: Contains assessments. Each line corresponds to one scene, columns correspond to (resp.) scene number (see audio_list.txt for correspondance), pleasantness, liveliness, overall loudness, interest, calmness, sound level of passing vehicles, time of presence of traffic, human voices, bird sources.</li>\n\t\t</ul>\n\t\t</li>\n\t</ul>\n\t</li>\n\t<li><em>audio</em>\n\t<ul>\n\t\t<li><em>rec</em>: contains the 6 recorded 45s scenes</li>\n\t\t<li><em>rep</em>: contains the 19 replicated 45s scenes, with separated tracks for source contributions</li>\n\t\t<li><em>sim</em>: contains the 75 simulated 45s scenes, with separated tracks for source contributions</li>\n\t</ul>\n\t</li>\n\t<li><em>audio_list.txt</em>: list of audio files in the corpus, line ordering corresponds to numbers in the first column of assessments</li>\n\t<li><em>rep_exp.mat</em>, <em>sim_exp.mat</em>: Additional information about the corpus, with playback Leq and physical estimations of the perceptual time of presence of sources for each scene.</li>\n</ul>", 
