There is a newer version of this record available.

Dataset Open Access

Virtual ChIP-seq predictions of binding of 34 transcription factor in Roadmap Epigenomics Project tissues

Mehran Karimzadeh; Michael M. Hoffman

This dataset contains predictions of Virtual ChIP-seq for binding of 34 transcription factors in Roadmap Epigenomics dataset tissues with matched DNase-seq and RNA-seq data.

Tarball contains subfolders for each of the 34 TFs where Virtual ChIP-seq median MCC in validation cell types was > 0.3.

Each subfolder contains gzipped BED files. Each file is named as <Tissue>_<Age>_<TF>_<Accession>_Predictions.bed.gz. Columns correspond to Chromosome, Start, End, <Tissue>_<Age>_<TF>_<Accession>, Posterior probability

You can use the posterior probabilities provided in Virchip_PosteriorCutoffs.tsv. These are posterior probability cutoffs which maximized MCC in H1-hESC cell type, or are set to 0.4 if there was no ChIP-seq data of that TF in H1-hESC (0.4 is the mode of all optimal posterior probability cutoffs in H1-hESC).

Files (19.3 GB)
Name Size
Virchip_PosteriorCutoffs.tsv
md5:808d7096489b31de2cac94b4a76426c4
559 Bytes Download
virchipPredictions_V1.0.0.tar.gz
md5:61833b1b53954ef32643837b582de7a7
19.3 GB Download
216
53
views
downloads
All versions This version
Views 21633
Downloads 535
Data volume 607.2 GB57.9 GB
Unique views 19433
Unique downloads 333

Share

Cite as