There is a newer version of the record available.

Published July 5, 2023 | Version v1
Dataset Open

Supporting data for "Software pipelines for RNA-Seq, ChIP-Seq and Germline Variant calling analyses in Common Workflow Language (CWL)"

  • 1. Institute of Applied Biosciences (INAB), Centre for Research and Technology Hellas (CERTH)

Description

Datasets produced during the validation of CWL-based pipelines, designed for the analysis of data from RNA-Seq, ChIP-Seq and germline variant calling experiments. Specifically, the workflows were tested using publicly available High-throughput (HTS) data from published studies on Chronic Lymphocytic Leukemia (CLL) (accession numbers: E-MTAB-6962, GSE115772) and Genome in a Bottle (GIAB) project samples (accession numbers: SRR6794144, SRR22476789, SRR22476790, SRR22476791).

The supporting data include:

  • Differential transcript and gene expression results produced during the analysis with the CWL-based RNA-Seq pipeline
  • Bigwig and narrowPeak files, differential binding results, table of consensus peaks and read counts of EZH2 and H3K27me3, produced during the analysis with the CWL-based ChIP-Seq pipeline
  • VCF files containing the detected and filtered variants, along with the respective hap.py () results regarding comparisons against the GIAB golden standard truth sets for both CWL-based germline variant calling pipelines

Files

Supporting_data_for_CWL-based_software_pipelines.zip

Files (4.9 GB)