Published December 7, 2017
| Version v1.19.0
Software
Open
broadinstitute/viral-ngs: v1.19.0
Creators
- 1. Broad Institute
- 2. Broad Institute of MIT and Harvard
- 3. MIT
- 4. @broadinstitute
- 5. DNAnexus
- 6. Aalborg University Hospital
Description
This is a release with many changes, including new WDL pipelines, a distribution of viral-ngs on DNAnexus that will be updated in sync with the latest version of viral-ngs, the ability to provide multiple references for scaffolding, and several critical bug fixes. With this release, the Docker image for viral-ngs moves from Docker Hub to quay.io/broadinstitute/viral-ngs.
New:
- WDL (more info) pipelines have been added, inspired by the previous DNAnexus implementation of viral-ngs. The WDL files currently reside within the
pipes/WDL/directory of viral-ngs. The pipelines can be executed locally or in the Google cloud viacromwell(on bioconda), or via the public distribution available on DNAnexus.- WDL workflows are tested locally on Travis via Cromwell
- WDL workflows are compiled for DNAnexus via dxWDL, and tested on DNAnexus
- a simple form of reference selection via
assembly.py::order_and_orient. Scaffolding is now performed using several references (in parallel); the one that yields the most non-N bases is chosen to be used for the scaffolded genome. For the positional argument,inReference, multiple FASTA files may now be provided, each containing one reference genome. Alternatively, multiple references may be given by specifying a single filename, and giving the number of reference segments with the--nGenomeSegmentsparameter. If multiple references are given, they must all contain the same number of segments listed in the same order.- This has been included in the new WDL pipelines
- New kraken execution strategy to process multiple inputs in one run
taxon_filter.pychanges todeplete_bmtagger_bamanddeplete_blastn_bam: can now accept blast/bmtagger databases as.tar.gz,.tar.lz4,.tar.bz2bundles and also as unindexed fasta files (that will be indexed on the fly)- new internal function
util.file.extract_tarballexposed on the CLI asread_utils.py::extract_tarball. Accepts stdin piped input.
Changed:
- various and extensive changes to how the viral-ngs Docker image is prepared and distributed:
- Note: The Docker image is now available from quay.io/broadinstitute/viral-ngs, which is faster for staging than Docker Hub
- the Docker image build process no longer relies on the
easy-deploy-viral-ngs.shscript
--threadsargparse option now common and available across viral-ngs commands- optimizations in
illumina.py::illumina_demux illumina.py::common_barcodesexecution time has been reduced- in
easy-deploy-viral-ngs.sh, some messages have been moved fromstdouttostderr taxon_filter.py: clean up and optimization aroundblastn-based read depletion- various development-related changes including:
- travis cleanup re: pip package installs, conditionals, build matrix
- Docker deployment bugfixes
Fixed:
- prevent
reports.py::plot_coveragefrom removing the bam file provided as input if it is already sorted and dupe removal is being not performed. In such cases the input bam is used directly and is now preserved. diamondtests for accession taxonomy fixed: subprocess.PIPE replaced with named pipes to prevent deadlockstaxon_filter.py::bmtagger_build_dbdefault value for word_size is now18, not8- fixes the use of fasta databases for
taxon_filter.py::deplete_bmtagger_bamanddeplete_human
Added/Upgraded:
- pysam
0.12.0.1->0.13.0 - samtools
1.5->1.6 - kraken
0.10.6_fork3->1.0.0_fork3 - lz4-bin
131added as requirement - pigz
2.3.4added as requirement - lbzip2
2.5added as requirement
Files
broadinstitute/viral-ngs-v1.19.0.zip
Files
(7.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:9491f4f34cc61a5cf59d2a084e03e919
|
7.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/broadinstitute/viral-ngs/tree/v1.19.0 (URL)