Published September 20, 2025 | Version v1
Dataset Open

Audio files for: Expressive range characterization of open text-to-audio models (AIIDE 2025)

  • 1. ROR icon New Jersey Institute of Technology
  • 2. EDMO icon University of Gothenburg
  • 3. ROR icon American University

Description

Audio files for the paper:

  • Jonathan Morse, Azadeh Naderi, Swen Gaudl, Mark Cartwright, Amy K. Hoover, Mark J. Nelson (2025). Expressive range characterization of open text-to-audio models. In: Proceedings of the 21st AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment.

Contents:

  • fig1_samples.zip: Generated audio for the examples in Fig. 1. Two prompts; two models; 100 samples for each.
  • thunder_samples.zip: Generated audio for the running "thunder" example.  One prompt; two models; 100 samples for each. Source for Figs. 2-4.
  • esc50_samples.zip: Generated audio for the prompt "Sound of X" for each label X in the ESC-50 environmental audio dataset. Fifty prompts; three models; 100 samples for each. Source for Figs. 5-6 and Table 1.
  • generation_scripts.zip: Python scripts used to generate audio from the three models.

Files

esc50_samples.zip

Files (4.2 GB)

Name Size Download all
md5:647f1efc6304d8ffed379272380de22f
4.1 GB Preview Download
md5:a5b8d77b9283f4c840e0bd5d4ff85a32
128.3 MB Preview Download
md5:958eeb93fe015412874e9c628f53c2ed
4.6 kB Preview Download
md5:55585c2d7fd385aa82c367f304bfe8cf
45.0 MB Preview Download