There is a newer version of this record available.

Presentation Open Access

Fast open modification spectral library searching through approximate nearest neighbor indexing

Bittremieux, Wout; Meysman, Pieter; Noble, William Stafford; Laukens, Kris


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/e346d6f1-c44f-471e-998a-92fa83d9efa6/Cascadia%20Proteomics_2018_Fast%20open%20modification%20spectral%20library%20searching%20through%20approximate%20nearest%20neighbor%20indexing.pptx"
      }, 
      "checksum": "md5:e4403f2a7853896a73141e3fd4e3b063", 
      "bucket": "e346d6f1-c44f-471e-998a-92fa83d9efa6", 
      "key": "Cascadia Proteomics_2018_Fast open modification spectral library searching through approximate nearest neighbor indexing.pptx", 
      "type": "pptx", 
      "size": 14582346
    }
  ], 
  "owners": [
    22786
  ], 
  "doi": "10.5281/zenodo.1319036", 
  "stats": {
    "version_unique_downloads": 54.0, 
    "unique_views": 32.0, 
    "views": 37.0, 
    "version_views": 110.0, 
    "unique_downloads": 20.0, 
    "version_unique_views": 90.0, 
    "volume": 320811612.0, 
    "version_downloads": 65.0, 
    "downloads": 22.0, 
    "version_volume": 947851544.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1319036", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1319035", 
    "bucket": "https://zenodo.org/api/files/e346d6f1-c44f-471e-998a-92fa83d9efa6", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1319035.svg", 
    "html": "https://zenodo.org/record/1319036", 
    "latest_html": "https://zenodo.org/record/1319591", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1319036.svg", 
    "latest": "https://zenodo.org/api/records/1319591"
  }, 
  "conceptdoi": "10.5281/zenodo.1319035", 
  "created": "2018-07-22T17:26:44.100766+00:00", 
  "updated": "2020-01-20T15:34:29.462467+00:00", 
  "conceptrecid": "1319035", 
  "revision": 5, 
  "id": 1319036, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1319036", 
    "description": "<p>Open modification search (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results.</p>\n\n<p>Here we present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate, as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts.</p>\n\n<p>ANN-SoLo outperforms the state-of-the-art SpectraST spectral library search engine both in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies 40% more spectra than SpectraST while achieving a speedup of an order of magnitude.</p>\n\n<p>ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo.</p>", 
    "license": {
      "id": "CC-BY-SA-4.0"
    }, 
    "title": "Fast open modification spectral library searching through approximate nearest neighbor indexing", 
    "relations": {
      "version": [
        {
          "count": 2, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1319035"
          }, 
          "is_last": false, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1319591"
          }
        }
      ]
    }, 
    "publication_date": "2018-07-23", 
    "creators": [
      {
        "orcid": "0000-0002-3105-1359", 
        "affiliation": "University of Washington", 
        "name": "Bittremieux, Wout"
      }, 
      {
        "orcid": "0000-0001-5903-633X", 
        "affiliation": "University of Antwerp", 
        "name": "Meysman, Pieter"
      }, 
      {
        "orcid": "0000-0001-7283-4715", 
        "affiliation": "University of Washington", 
        "name": "Noble, William Stafford"
      }, 
      {
        "orcid": "0000-0002-8217-2564", 
        "affiliation": "University of Antwerp", 
        "name": "Laukens, Kris"
      }
    ], 
    "meeting": {
      "url": "http://www.cascadiaproteomics.org/", 
      "dates": "23-24 July 2018", 
      "place": "Seattle, WA, USA", 
      "title": "Cascadia Proteomics Symposium"
    }, 
    "access_right": "open", 
    "resource_type": {
      "type": "presentation", 
      "title": "Presentation"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.1101/326173", 
        "relation": "isDocumentedBy"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.1319035", 
        "relation": "isVersionOf"
      }
    ]
  }
}
110
65
views
downloads
All versions This version
Views 11037
Downloads 6522
Data volume 947.9 MB320.8 MB
Unique views 9032
Unique downloads 5420

Share

Cite as