Thesis Open Access

A Wavenet for Music Source Separation

Francesc Lluís Salvadó


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/e806a5e3-05a6-455b-8169-fb187d832c82/A_Wavenet_for_Music_Source_Separation.pdf"
      }, 
      "checksum": "md5:d0f3b0f5050688304e7e0b761a8ec7bd", 
      "bucket": "e806a5e3-05a6-455b-8169-fb187d832c82", 
      "key": "A_Wavenet_for_Music_Source_Separation.pdf", 
      "type": "pdf", 
      "size": 1515881
    }
  ], 
  "owners": [
    54852
  ], 
  "doi": "10.5281/zenodo.1475940", 
  "stats": {
    "version_unique_downloads": 70.0, 
    "unique_views": 99.0, 
    "views": 114.0, 
    "downloads": 81.0, 
    "unique_downloads": 70.0, 
    "version_unique_views": 99.0, 
    "volume": 122786361.0, 
    "version_downloads": 81.0, 
    "version_views": 114.0, 
    "version_volume": 122786361.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1475940", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1475939", 
    "bucket": "https://zenodo.org/api/files/e806a5e3-05a6-455b-8169-fb187d832c82", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1475939.svg", 
    "html": "https://zenodo.org/record/1475940", 
    "latest_html": "https://zenodo.org/record/1475940", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1475940.svg", 
    "latest": "https://zenodo.org/api/records/1475940"
  }, 
  "conceptdoi": "10.5281/zenodo.1475939", 
  "created": "2018-10-31T14:58:40.955829+00:00", 
  "updated": "2019-04-09T14:23:40.550037+00:00", 
  "conceptrecid": "1475939", 
  "revision": 5, 
  "id": 1475940, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1475940", 
    "description": "<p>Currently, most successful source separation techniques use magnitude spectrograms as input, and are therefore by default discarding part of the signal: the phase. In order to avoid discarding potentially useful information, we propose an end-to-end learning model based on Wavenet for music source separation. As a result, the model we propose directly operates over the waveform, enabling, in that way, to consider any information available in the raw audio signal. Provided that the original Wavenet model operates sequentially (i.e., is not parallelizable and hence slow), in this work we make use of a discriminative non-causal adaptation of Wavenet capable to predict more than one sample at a time, thus permitting to overcome the undesirable time-complexity that the original Wavenet model has. Further, we investigate several data augmentation techniques and architectural changes to provide some insights on which are the most sensitive hyper-parameters for this family of Wavenet-like models. Our experimental results show that it is possible to approach the problem of music source separation in a end-to-end learning fashion, since our model performs on par with DeepConvSep, a state-of-the-art method based on processing magnitude spectrograms.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "A Wavenet for Music Source Separation", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1475939"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1475940"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "smc-master"
      }
    ], 
    "publication_date": "2018-08-31", 
    "creators": [
      {
        "name": "Francesc Llu\u00eds Salvad\u00f3"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "subtype": "thesis", 
      "type": "publication", 
      "title": "Thesis"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "relation": "isVersionOf", 
        "identifier": "10.5281/zenodo.1475939"
      }
    ]
  }
}
114
81
views
downloads
All versions This version
Views 114114
Downloads 8181
Data volume 122.8 MB122.8 MB
Unique views 9999
Unique downloads 7070

Share

Cite as