bcbio/bcbio-nextgen: v1.2.0

Brad Chapman; Rory Kirchner; Lorena Pantano; Matthias De Smet; Tetiana Khotiainsteva; Luca Beltrame; Vlad Saveliev; Roman Valls Guimera; Sergey Naumenko; John Kern; Christian Brueffer; Guillermo Carrasco; Mario Giovacchini; Ilya Sytchev; Paul Tang; Miika Ahdesmaki; Sehrish Kanwal; James J Porter; Steffen Möller; Vang Le; Alexandru Coman; Valentine Svensson; bogdang989; Meeta Mistry; Matt Edwards; Jeff Hammerbacher; Brent Pedersen; Peter Cock; apastore; Stephen Turner

doi:10.5281/zenodo.3659234

Published February 7, 2020 | Version v1.2.0

Software Open

bcbio/bcbio-nextgen: v1.2.0

1. Ginkgo Bioworks
2. Harvard Chan School of Public Health
3. @CenterForMedicalGeneticsGhent
4. @UMCCR
5. University of Melbourne
6. Lund University Cancer Center
7. @iZettle
8. Science for Life Laboratory
9. startup
10. AstraZeneca
11. The University of Melbourne
12. Recurse Center
13. University of Rostock
14. Aalborg University
15. @tss-yonder
16. FL60 Inc
17. HSPH
18. @hammerlab
19. University of Utah
20. University of Virginia

Fix for bismark not being a supported aligner.
Run ataqv (https://github.com/ParkerLab/ataqv) to calculate additional ATAQ-seq quality control metrics.
Workaround for some bcbioRNASeq plots failing with many samples when interesting_groups is not set.
Add known_fusions parameter for passing in known fusions to arriba.
Fix for tx2gene not working properly on some GTF files.
Sort MACS2 output with UNIX sort to avoid memory issues.
Run RiP on full peak file for ATAC-seq.
Run ataqv on unfiltered BAM file with the full peak file.
Run peddy on the population variant file, not the individual sample level file if joint calling was done.
Add STAR to MultiQC metrics.
Throw an error if STAR is run on a genome with alts.
Don't run bcbioRNASeq if there is only one sample. Thanks to @kmendler for the suggestion.
Improve arriba sensitivity by setting --peOverlapNbasesMin 10 and --alignSplicedMateMapLminOverLmate 0.5 when running STAR (see https://github.com/suhrig/arriba/issues/41).
Make TPM and counts files from tximport automatically.
Use --keepDuplicates when making the Salmon index. This keeps transcripts that are identical in the index instead of randomly choosing one. This helps when comparing to other ways of quantifying the transcripts, ensuring all of the transcripts are represented.
Remove unnecessary "quant" subdirectory for Salmon runs. This allows MultiQC to properly name the samples.
Ensure STAR log file is propagated to the upload directory.
Fix issue with memory not being specified properly when running bcbio_prepare_samples.py.
Run tximport automatically and store TPM in project/date/tpm and counts in project/date/counts.
Calculate ENCODE quality flags for ATAC-seq. See https://www.encodeproject.org/data-standards/terms/#library for a description of what the metrics mean.
Fix for command line being too long while joint genotyping thousands of samples.
Fix for command line being too long when running the CWL workflow with cromwell.

Files

bcbio/bcbio-nextgen-v1.2.0.zip

Files (17.8 MB)

Name	Size	Download all
bcbio/bcbio-nextgen-v1.2.0.zip md5:dd2fff280cee1dc28a9ea5c90760eed0	17.8 MB	Preview Download

Additional details

Is supplement to: https://github.com/bcbio/bcbio-nextgen/tree/v1.2.0 (URL)

	All versions	This version
Views	7,569	201
Downloads	603	26
Data volume	11.2 GB	462.8 MB

bcbio/bcbio-nextgen: v1.2.0

Authors/Creators

Description

Files

bcbio/bcbio-nextgen-v1.2.0.zip

Files (17.8 MB)

Additional details

Related works