Stylewavegan: Style-Based Synthesis of Drum Sounds With Extensive Controls Using Generative Adversarial Networks

Lavault, Antoine; Roebel, Axel; Voiry, Matthieu

doi:10.5281/zenodo.6573361

There is a newer version of the record available.

Published June 7, 2022 | Version v1

Conference paper Open

Stylewavegan: Style-Based Synthesis of Drum Sounds With Extensive Controls Using Generative Adversarial Networks

1. Sorbonne Université
2. Minstère de la Culture
3. Apeira Technologies

In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of StyleGAN, a state-of-the-art image generator. By conditioning StyleWaveGAN on both the type of drum and several audio descriptors, we are able to synthesize waveforms faster than real-time on a GPU directly in CD quality up to a duration of 1.5s while retaining a considerable amount of control over the generation. We also introduce an alternative to the progressive growing of GANs and experimented on the effect of dataset balancing for generative tasks. The experiments are carried out on an augmented subset of a publicly available dataset comprised of different drums and cymbals. We evaluate against two recent drum generators, WaveGAN and NeuroDrum, demonstrating significantly improved generation quality (measured with the Frechet Audio Distance) and interesting results with perceptual features.

Files

47.pdf

Files (601.5 kB)

Name	Size	Download all
47.pdf md5:f4614ace6cf334f9eeb4125b5c26473c	601.5 kB	Preview Download

394

Views

320

Downloads

Show more details

	All versions	This version
Views	394	159
Downloads	320	161
Data volume	205.0 MB	100.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 23, 2022
Modified: July 16, 2024

Stylewavegan: Style-Based Synthesis of Drum Sounds With Extensive Controls Using Generative Adversarial Networks

Authors/Creators

Description

Files

47.pdf

Files (601.5 kB)