Published October 10, 2018 | Version 3.0.0
Dataset Open

Virtual ChIP-seq predictions of binding of 36 transcription factor in Roadmap Epigenomics Project tissues

  • 1. University of Toronto

Description

This dataset contains predictions of Virtual ChIP-seq for binding of 36 transcription factors in Roadmap Epigenomics dataset tissues with matched DNase-seq and RNA-seq data.

Tarball contains subfolders for each of the 36 TFs where Virtual ChIP-seq median MCC in validation cell types was > 0.3.

Each subfolder contains gzipped BED files. Each file is named as <Tissue>_<Age>_<TF>_<Accession>_Predictions.bed.gz. Columns correspond to Chromosome, Start, End, <Tissue>_<Age>_<TF>_<Accession>, Posterior probability

You can use the posterior probabilities provided in Virchip_PosteriorCutoffs_V3.0.0.tsv. These are posterior probability cutoffs which maximized MCC in H1-hESC cell type, or are set to 0.4 if there was no ChIP-seq data of that TF in H1-hESC (0.4 is the mode of all optimal posterior probability cutoffs in H1-hESC).

Files

Files (22.3 GB)

Name Size Download all
md5:6bc7fab8382ee6f6f882df03519d1e6f
581 Bytes Download
md5:ac5a901930868d3e707ed66038a70da0
22.3 GB Download