Presentation Open Access

Fast open modification spectral library searching through approximate nearest neighbor indexing

Bittremieux, Wout; Meysman, Pieter; Noble, William Stafford; Laukens, Kris


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1a540e6-2337-4e07-befd-ff72430a792d/Cascadia%20Proteomics_2018_Fast%20open%20modification%20spectral%20library%20searching%20through%20approximate%20nearest%20neighbor%20indexing.pptx"
      }, 
      "checksum": "md5:5c1c8da4dc6d1a814a38f4514ba281b9", 
      "bucket": "f1a540e6-2337-4e07-befd-ff72430a792d", 
      "key": "Cascadia Proteomics_2018_Fast open modification spectral library searching through approximate nearest neighbor indexing.pptx", 
      "type": "pptx", 
      "size": 14582324
    }
  ], 
  "owners": [
    22786
  ], 
  "doi": "10.5281/zenodo.1319591", 
  "stats": {
    "version_unique_downloads": 53.0, 
    "unique_views": 61.0, 
    "views": 71.0, 
    "version_views": 107.0, 
    "unique_downloads": 36.0, 
    "version_unique_views": 87.0, 
    "volume": 612457608.0, 
    "version_downloads": 64.0, 
    "downloads": 42.0, 
    "version_volume": 933269220.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1319591", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1319035", 
    "bucket": "https://zenodo.org/api/files/f1a540e6-2337-4e07-befd-ff72430a792d", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1319035.svg", 
    "html": "https://zenodo.org/record/1319591", 
    "latest_html": "https://zenodo.org/record/1319591", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1319591.svg", 
    "latest": "https://zenodo.org/api/records/1319591"
  }, 
  "conceptdoi": "10.5281/zenodo.1319035", 
  "created": "2018-07-23T14:01:46.297673+00:00", 
  "updated": "2020-01-20T15:34:50.826026+00:00", 
  "conceptrecid": "1319035", 
  "revision": 4, 
  "id": 1319591, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1319591", 
    "description": "<p>Open modification search (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results.</p>\n\n<p>Here we present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate, as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts.</p>\n\n<p>ANN-SoLo outperforms the state-of-the-art SpectraST spectral library search engine both in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies 40% more spectra than SpectraST while achieving a speedup of an order of magnitude.</p>\n\n<p>ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo.</p>", 
    "license": {
      "id": "CC-BY-SA-4.0"
    }, 
    "title": "Fast open modification spectral library searching through approximate nearest neighbor indexing", 
    "relations": {
      "version": [
        {
          "count": 2, 
          "index": 1, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1319035"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1319591"
          }
        }
      ]
    }, 
    "publication_date": "2018-07-23", 
    "creators": [
      {
        "orcid": "0000-0002-3105-1359", 
        "affiliation": "University of Washington", 
        "name": "Bittremieux, Wout"
      }, 
      {
        "orcid": "0000-0001-5903-633X", 
        "affiliation": "University of Antwerp", 
        "name": "Meysman, Pieter"
      }, 
      {
        "orcid": "0000-0001-7283-4715", 
        "affiliation": "University of Washington", 
        "name": "Noble, William Stafford"
      }, 
      {
        "orcid": "0000-0002-8217-2564", 
        "affiliation": "University of Antwerp", 
        "name": "Laukens, Kris"
      }
    ], 
    "meeting": {
      "url": "http://www.cascadiaproteomics.org/", 
      "dates": "23-24 July 2018", 
      "place": "Seattle, WA, USA", 
      "title": "Cascadia Proteomics Symposium"
    }, 
    "access_right": "open", 
    "resource_type": {
      "type": "presentation", 
      "title": "Presentation"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.1101/326173", 
        "relation": "isDocumentedBy"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.1319035", 
        "relation": "isVersionOf"
      }
    ]
  }
}
107
64
views
downloads
All versions This version
Views 10771
Downloads 6442
Data volume 933.3 MB612.5 MB
Unique views 8761
Unique downloads 5336

Share

Cite as