Presentation Open Access

ISMIR 2019 tutorial: waveform-based music processing with deep learning

Jongpil Lee; Jordi Pons; Sander Dieleman


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Jongpil Lee</dc:creator>
  <dc:creator>Jordi Pons</dc:creator>
  <dc:creator>Sander Dieleman</dc:creator>
  <dc:date>2019-11-04</dc:date>
  <dc:description>A common practice when processing music signals with deep learning is to transform the raw waveform input into a time-frequency representation. This pre-processing step allows having less variable and more interpretable input signals. However, along that process, one can limit the model's learning capabilities since potentially useful information (like the phase or high frequencies) is discarded. In order to overcome the potential limitations associated with such pre-processing, researchers have been exploring waveform-level music processing techniques, and many advances have been made with the recent advent of deep learning.

In this tutorial, we introduce three main research areas where waveform-based music processing can have a substantial impact:

1) Classification: waveform-based music classifiers have the potential to simplify production and research pipelines.

2) Source separation: making possible waveform-based music source separation would allow overcoming some historical challenges associated with discarding the phase.

3) Generation: waveform-level music generation would enable, e.g., to directly synthesize expressive music.

Link to the original Google Slides</dc:description>
  <dc:identifier>https://zenodo.org/record/3529714</dc:identifier>
  <dc:identifier>10.5281/zenodo.3529714</dc:identifier>
  <dc:identifier>oai:zenodo.org:3529714</dc:identifier>
  <dc:relation>doi:10.5281/zenodo.3529713</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:title>ISMIR 2019 tutorial: waveform-based music processing with deep learning</dc:title>
  <dc:type>info:eu-repo/semantics/lecture</dc:type>
  <dc:type>presentation</dc:type>
</oai_dc:dc>
1,008
975
views
downloads
All versions This version
Views 1,0081,002
Downloads 975974
Data volume 13.4 GB13.4 GB
Unique views 875869
Unique downloads 821820

Share

Cite as