Poster Open Access

ProPhyle: a phylogeny-based metagenomic classifier using the Burrows-Wheeler Transform

Břinda, Karel; Salikhov, Kamil; Pignotti, Simone; Kucherov, Gregory


JSON Export

{
  "conceptdoi": "10.5281/zenodo.1045426", 
  "conceptrecid": "1045426", 
  "created": "2017-11-11T19:25:05.897813+00:00", 
  "doi": "10.5281/zenodo.1045427", 
  "files": [
    {
      "bucket": "f81e8774-7cff-45b1-b5f9-9ffd4530988c", 
      "checksum": "md5:28ffd9a73b6a88248248f4443284eccb", 
      "key": "prophyle_hitseq_2017.pdf", 
      "links": {
        "self": "https://zenodo.org/api/files/f81e8774-7cff-45b1-b5f9-9ffd4530988c/prophyle_hitseq_2017.pdf"
      }, 
      "size": 632711, 
      "type": "pdf"
    }
  ], 
  "id": 1045427, 
  "links": {
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1045427.svg", 
    "bucket": "https://zenodo.org/api/files/f81e8774-7cff-45b1-b5f9-9ffd4530988c", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1045426.svg", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1045426", 
    "doi": "https://doi.org/10.5281/zenodo.1045427", 
    "html": "https://zenodo.org/record/1045427", 
    "latest": "https://zenodo.org/api/records/1045427", 
    "latest_html": "https://zenodo.org/record/1045427"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "communities": [
      {
        "id": "bioinformatics"
      }
    ], 
    "creators": [
      {
        "affiliation": "CCDD Harvard TH Chan School of Public Health", 
        "name": "B\u0159inda, Karel", 
        "orcid": "0000-0003-0200-557X"
      }, 
      {
        "affiliation": "LIGM Universit\u00e9 Paris-Est", 
        "name": "Salikhov, Kamil"
      }, 
      {
        "affiliation": "LIGM Universit\u00e9 Paris-Est", 
        "name": "Pignotti, Simone"
      }, 
      {
        "affiliation": "LIGM/CNRS Universit\u00e9 Paris-Est", 
        "name": "Kucherov, Gregory"
      }
    ], 
    "description": "<p>Metagenomics is a powerful approach to study genetic content of environmental samples and it has been strongly promoted by Next-Generation Sequencing technologies. The aim of metagenomic classification is to assign each sequence of the metagenome to a corresponding taxonomic unit, or to classify it as \u201cnovel\u201d.</p>\n\n<p>To cope with increasingly large metagenomic projects, researchers resort to alignment-free methods. The most popular tool \u2013 Kraken \u2013 provides an extremely rapid read classification, but its index suffers from two major limitations: an enormous memory consumption and a lossy <em>k</em>-mer representation through their lowest common ancestors.</p>\n\n<p>We present Prophyle, a metagenomic classifier based on the Burrows-Wheeler Transform. ProPhyle uses a classification algorithm similar to Kraken but with an indexing strategy based on a bottom-up propagation of <em>k</em>-mers in the tree, assembling contigs at each node and matching using a standard full-text search. The obtained index occupies only a fraction of RAM compared to Kraken \u2013 13 GB instead of 90 GB for index construction and 14 GB instead of 72\u00a0GB for index querying. The resulting index is also more expressive, allowing users to retrieve a list of <em>all</em> genomes for every queried <em>k</em>-mer. Overall, ProPhyle provides an index for resource-frugal metagenomic classification, which is accurate even with single-species phylogenetic trees. Prophyle is available at http://github.com/karel-brinda/prophyle, released under the MIT license.</p>", 
    "doi": "10.5281/zenodo.1045427", 
    "language": "eng", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "publication_date": "2017-07-24", 
    "related_identifiers": [
      {
        "identifier": "10.5281/zenodo.1045426", 
        "relation": "isVersionOf", 
        "scheme": "doi"
      }
    ], 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1045427"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1045426"
          }
        }
      ]
    }, 
    "resource_type": {
      "title": "Poster", 
      "type": "poster"
    }, 
    "title": "ProPhyle: a phylogeny-based metagenomic classifier using the Burrows-Wheeler Transform"
  }, 
  "owners": [
    30579
  ], 
  "revision": 3, 
  "updated": "2017-12-22T23:11:42.817061+00:00"
}

Share

Cite as