Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published November 13, 2023 | Version v1
Conference paper Open

PolyDDSP: A Lightweight and Polyphonic Differentiable Digital Signal Processing Library

Description

This paper presents a work-in-progress DSP architecture building from the basis of the Differentiable Digital Signal Processing (DDSP) library by Engel et al. (2020). The architecture is designed to process polyphonic musical audio in real-time, making use of classical DSP methods for greater interpretability. Utilising recent advancements in lightweight polyphonic pitch detection models, multiple input audio streams can be processed simultaneously, and with a novel stochastic latent dimension, the model can generate novel audio outputs with more variation. Due to its lightweight nature, the proposed architecture is designed to be used for live audio transformations with minimal input latency. The paper also discusses the limitations of the existing state-of-the-art model, which is deterministic and restricted to monophonic processing. throughout, the paper explores potential applications of the proposed model. These include not only versatile timbre transfer between distinct instruments but interpolation between timbres, resulting in the creation of new sounds that can expand the aural pallet of musicians, sound designers, and experimental composers using live electronics. Furthermore, the model extends the library's toolkit, such as natural pitch shifting and room acoustic reverb modelling to previously unusable polyphonic inputs.

Files

cmmr2023_3c-P8.pdf

Files (909.1 kB)

Name Size Download all
md5:01c50fde9155da326960db67f5008d54
909.1 kB Preview Download