Published June 18, 2019 | Version v1
Dataset Open

SimSceneTVB Learning

  • 1. LS2N, UMR CNRS 6004, Ecole Centrale de Nantes
  • 3. ETIS, UMR CNRS 8051, University of Paris Seine, University of Cergy-Pontoise, ENSEA


This is a dataset of 600 simulated sound scenes of 45s each representing urban sound environments, simulated using the simScene Matlab library ( The dataset is divided in two parts with a train subset (400 scenes) and a test subset (200 scenes) for the development of learning-based models.

Each scene is composed of three main sources (traffic, human voices and birds) according to an original scenario, which is composed semi-randomly conditionally to five ambiances: park, quiet street, noisy street, very noisy street and square. Separate channels for the contribution of each source are available. The base audio files used for simulation are obtained from Freesound ( and LibriSpeech ( The sound scenes are scaled according to a playback sound level in dB, which is drawn randomly but remains plausible according to the ambiance.


Files (4.4 GB)

Name Size Download all
4.4 GB Preview Download