There is a newer version of the record available.

Published April 26, 2016 | Version v1.0.0-M3
Software Open

mg7: mg7 v1.0.0-M3

  • 1. Era7 Bioinformatics
  • 2. oh no sequences! - Era7 bioinformatics
  • 3. ohnosequences!, Era7 Bioinformatics
  • 4. Gitter

Description

This is the third milestone on the way to the v1.0 release. Main purpose of this release was to introduce some essential features and check consistency of the pipeline (primarily for Illumina data). Here are the main changes since the v1.0-M2 release:

  • New features:

    • #9, #11: Added general stats for the standard pipeline (with Flash) (stats and summary steps)
    • #20: Added counts frequency tables
    • #35: Added taxonomic lineage in the results
    • #47: Changed to the fair lowest common ancestor (LCA) algorithm without the most-specific node modification
    • #56: Interchanged the order of the assign and merge steps to parallelize assignment (per BLAST chunk)
    • BLAST related improvements:
      • #46: Reads with no BLAST hits are accumulated per each sample and merged in FASTA files
      • #44: BLAST command itself and its parameters are completely configurable (so you can use not only blastn)
      • #50: Added a check for the set of minimal necessary BLAST output fields
      • #36, #51, #61: Reviewed and refined default BLAST parameters (for Illumina) and output format
      • #38, #49, #57, #58: Added configurable filter for the BLAST results (with default qcovs == 100%), plus an mandatory filter to keep only the top BLAST hits
  • Other improvements:

  • Important fixes:

    • #48: Configured CSV writer to use unix line-endings
    • #62: Fixed inconsistency in the accumulated counting algorithm
    • #52, #55: Fixes of the bugs introduced during M3 development

See the full list of pull requests merged in this release in the v1.0-M3 milestone.

Files

mg7-v1.0.0-M3.zip

Files (6.7 MB)

Name Size Download all
md5:a85b4208ce77f013be8b04f81d8f4aca
6.7 MB Preview Download

Additional details

Related works