Published August 4, 2018
| Version v1.6
Software
Open
ewels/MultiQC: MultiQC Version 1.6
Authors/Creators
- Phil Ewels1
- Vlad Saveliev2
- Markus J. Ankenbrand3
- Tim Booth4
- Chris van Run5
- Aled Jones
- Tobias Neumann6
- Måns Magnusson7
- alexanderscholz
- Remi-Andre Olsen
- Fredrik Boulund8
- Robin Andeer9
- Sacha Laurent
- Guillermo Carrasco10
- Alexander Peltzer11
- Winni Kretzschmar
- Francesco
- Xin He
- Lorena Pantano12
- Matthias De Smet13
- Brad Chapman14
- Nicolas Servant15
- Joachim Wolff16
- Devon Ryan17
- Julian Gehring18
- chuan-wang19
- Senthilkumar Panneerselvam20
- Tor Solli-Nowlan21
- Maxime Garcia22
- Heather L. Wiencko
- 1. Science for Life Laboratory
- 2. @UMCCR
- 3. University of Würzburg
- 4. Edinburgh Genomics
- 5. Princess Maxima Centre for paediatric oncology
- 6. @IMPIMBA @ZuberLab @ObenaufLab
- 7. Science for Life Labs
- 8. Karolinska Institute
- 9. @Clinical-Genomics
- 10. @iZettle
- 11. @qbicsoftware
- 12. Harvard Chan School of Public Health
- 13. @CenterForMedicalGeneticsGhent
- 14. Harvard Chan Bioinformatics Core
- 15. Institut Curie
- 16. @BackofenLab Albert-Ludwigs-University Freiburg im Breisgau
- 17. Max Planck Institute of Immunobiology and Epigenetics
- 18. Illumina
- 19. NGI, SciLifeLab
- 20. @NationalGenomicsInfrastructure @SciLifeLab
- 21. Oslo Universityssykehus
- 22. @SciLifeLab | Karolinska Institutet
Description
Some of these updates are thanks to the efforts of people who attended the NASPM 2018 MultiQC hackathon session. Thanks to everyone who attended!
New Modules:- fastp
- An ultra-fast all-in-one FASTQ preprocessor (QC, adapters, trimming, filtering, splitting...)
- Module started by @florianduclot and completed by @ewels
- hap.py
- Hap.py is a set of programs based on htslib to benchmark variant calls against gold standard truth datasets
- Module written by @tsnowlan
- Long Ranger
- Works with data from the 10X Genomics Chromium. Performs sample demultiplexing, barcode processing, alignment, quality control, variant calling, phasing, and structural variant calling.
- Module written by @remiolsen
- miRTrace
- A quality control software for small RNA sequencing data.
- Module written by @chuan-wang
- BCFtools
- New plot showing SNP statistics versus quality of call from bcftools stats (@MaxUlysse and @Rotholandus)
- BBMap
- Support added for BBDuk kmer-based adapter/contaminant filtering summary stats (@boulund
- FastQC
- New read count plot, split into unique and duplicate reads if possible.
- Help text added for all sections, mostly copied from the excellent FastQC help.
- Sequence duplication plot rescaled
- FastQ Screen
- Samples in large-sample-number plot are now sorted alphabetically (@hassanfa
- MACS2
- Output is now more tolerant of missing data (no plot if no data)
- Peddy
- Picard
- New submodule to handle
ValidateSamFilereports (@cpavanrun) - WGSMetrics now add the mean and standard-deviation coverage to the general stats table (hidden) (@cpavanrun)
- New submodule to handle
- Preseq
- New config option to plot preseq plots with unique old coverage on the y axis instead of read count
- Code refactoring by @vladsaveliev
- QUAST
- Null values (
-) in reports now handled properly. Bargraphs always shown despite varying thresholds. (@vladsaveliev)
- Null values (
- RNA-SeQC
- Don't create the report section for Gene Body Coverage if no data is given
- Samtools
- Fixed edge case bug where MultiQC could crash if a sample had zero count coverage with idxstats.
- Adds % proper pairs to general stats table
- Skewer
- Read length plot rescaled
- Tophat
- Fixed bug where some samples could be given a blank sample name (@lparsons)
- VerifyBamID
- Change column header help text for contamination to match percentage output (@chapmanb)
- New config option
remove_sectionsto skip specific report sections from modules - Add
path_filters_excludeto exclude certain files when running modules multiple times. You could previously only include certain files. - New
exclude_*keys for file search patterns- Have a subset of patterns to exclude otherwise detected files with, by filename or contents
- Command line options all now use mid-word hyphens (not a mix of hyphens and underscores)
- Old underscore terms still maintained for backwards compatibility
- Flag
--view-tagsnow works without requiring an "analysis directory". - Removed Python dependency for
enum34(@boulund) - Columns can be added to
General Statstable for custom content/module. - New
--ignore-symlinksflag which will ignore symlinked directories and files. - New
--no-megaqc-uploadflag which disables automatically uploading data to MegaQC
- Fix path_filters for top_modules/module_order configuration only selecting if all globs match. It now filters searches that match any glob.
- Empty sample names from cleaning are now no longer allowed
- Stop prepend_dirs set in the config from getting clobbered by an unpassed CLI option (@tsnowlan)
- Modules running multiple times now have multiple sets of columns in the General Statistics table again, instead of overwriting one another.
- Prevent tables from clobbering sorted row orders.
- Fix linegraph and scatter plots data conversion (sporadically the incorrect
ymaxwas used to drop data points) (@cpavanrun) - Adjusted behavior of ceiling and floor axis limits
- Adjusted multiple file search patterns to make them more specific
- Prevents the wrong module from accidentally slurping up output from a different tool. By @cpavanrun (see PR #727)
- Fixed broken report bar plots when
-p/--export-plotswas specified (see issue #801)
Files
ewels/MultiQC-v1.6.zip
Files
(2.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:8a4ec26dba9443366d83f3f23108e451
|
2.0 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/ewels/MultiQC/tree/v1.6 (URL)