Dataset Open Access

SimSceneTVB Perception

Gontier, Felix; Lagrange, Mathieu; Aumond, Pierre; Lavandier, Catherine; Petiot, Jean-François

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Gontier, Felix</dc:creator>
  <dc:creator>Lagrange, Mathieu</dc:creator>
  <dc:creator>Aumond, Pierre</dc:creator>
  <dc:creator>Lavandier, Catherine</dc:creator>
  <dc:creator>Petiot, Jean-François</dc:creator>
  <dc:description>This is a corpus of 100 sound scenes of 45s each representing urban sound environments, including:

	6 scenes recorded in Paris,
	19 scenes simulated using simScene ( to replicate recorded scenarios, including the 6 recordings in this corpus,
	75 scenes simulated using simScene with diverse new scenarios, containing traffic, human voices and bird sources.

The base audio files used for simulation are obtained from Freesound ( and LibriSpeech (

This corpus has been evaluated by a panel of participants in a listening experiment, with assessments on the following 0-10 Likert scales:

	Pleasantness: Unpleasant - Pleasant,
	Liveliness: Inert, amorphous - Lively, eventful,
	Overall loudness: Quiet - Noisy,
	Interest: Boring, uninteresting - Stimulating, interesting,
	Calmness: Agitated, chaotic - Calm, peaceful,
	Sound level of passing vehicles: Very low - Very high,
	Time of presence of traffic: Never - Continuously,
	Time of presence of voices: Never - Continuously,
	Time of presence of birds: Never - Continuously.

Assessments from 23 subjects are available for the 6 recorded and 19 simulated scenes, and from 7 to 8 subjects for the 75 simulated scenes.

The contents of this dataset are as follow:

	assessments: contains evaluations by 23 subjects of perceptual scales on the corpus

		sXX: Folder corresponding to participant XX

			Pt_.txt: Contains assessments. Each line corresponds to one scene, columns correspond to (resp.) scene number (see audio_list.txt for correspondance), pleasantness, liveliness, overall loudness, interest, calmness, sound level of passing vehicles, time of presence of traffic, human voices, bird sources.
		rec: contains the 6 recorded 45s scenes
		rep: contains the 19 replicated 45s scenes, with separated tracks for source contributions
		sim: contains the 75 simulated 45s scenes, with separated tracks for source contributions
	audio_list.txt: list of audio files in the corpus, line ordering corresponds to numbers in the first column of assessments
	rep_exp.mat, sim_exp.mat: Additional information about the corpus, with playback Leq and physical estimations of the perceptual time of presence of sources for each scene.
  <dc:subject>Urban sound environments</dc:subject>
  <dc:subject>Noise mitigation</dc:subject>
  <dc:title>SimSceneTVB Perception</dc:title>
All versions This version
Views 110110
Downloads 99
Data volume 7.1 GB7.1 GB
Unique views 9292
Unique downloads 77


Cite as