Presentation Open Access

ISMIR 2019 tutorial: waveform-based music processing with deep learning

Jongpil Lee; Jordi Pons; Sander Dieleman


JSON-LD (schema.org) Export

{
  "description": "<p>A common practice when processing music signals with deep learning is to transform the raw waveform input into a time-frequency representation. This pre-processing step allows having less variable and more interpretable input signals. However, along that process, one can limit the model&#39;s learning capabilities since potentially useful information (like the phase or high frequencies) is discarded. In order to overcome the potential limitations associated with such pre-processing, researchers have been exploring waveform-level music processing techniques, and many advances have been made with the recent advent of deep learning.</p>\n\n<p>In this tutorial, we introduce three main research areas where waveform-based music processing can have a substantial impact:</p>\n\n<p>1) Classification: waveform-based music classifiers have the potential to simplify production and research pipelines.</p>\n\n<p>2) Source separation: making possible waveform-based music source separation would allow overcoming some historical challenges associated with discarding the phase.</p>\n\n<p>3) Generation: waveform-level music generation would enable, e.g., to directly synthesize expressive music.</p>\n\n<p><a href=\"https://docs.google.com/presentation/d/1_ezZXDkyhp9USAYMc5oKJCkUrUhBfo-Di8H8IfypGBM/edit?usp=sharing\">Link to the original Google Slides</a></p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "KAIST", 
      "@type": "Person", 
      "name": "Jongpil Lee"
    }, 
    {
      "affiliation": "Dolby Laboratories", 
      "@type": "Person", 
      "name": "Jordi Pons"
    }, 
    {
      "affiliation": "DeepMind", 
      "@type": "Person", 
      "name": "Sander Dieleman"
    }
  ], 
  "url": "https://zenodo.org/record/3529714", 
  "datePublished": "2019-11-04", 
  "@type": "PresentationDigitalDocument", 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.3529714", 
  "@id": "https://doi.org/10.5281/zenodo.3529714", 
  "workFeatured": {
    "url": "https://ismir2019.ewi.tudelft.nl/?q=node/1", 
    "alternateName": "ISMIR", 
    "location": "Delft", 
    "@type": "Event", 
    "name": "20th annual conference of the International Society for Music Information Retrieval"
  }, 
  "name": "ISMIR 2019 tutorial: waveform-based music processing with deep learning"
}
1,950
1,979
views
downloads
All versions This version
Views 1,9501,944
Downloads 1,9791,978
Data volume 27.2 GB27.2 GB
Unique views 1,7161,710
Unique downloads 1,7321,731

Share

Cite as