There is a newer version of the record available.

Published September 28, 2022 | Version v1
Dataset Open

Multi-scale footprinting

Description

Data associated with the multi-scale footprinting project.

(1) Tn5_NN_model.h5

Pre-trained CNN-based Tn5 bias model implemented with Keras. Takes local DNA sequence context as input and predicts Tn5 insertion bias. See tutorial for how to use this model.

(2) Tn5ModelTutorial.ipynb, Tn5ModelTutorial.html

Tutorial showing how to use the pre-trained Tn5 bias model to score input sequences.

(3) hg38Tn5Bias.tar.gz, mm10Tn5Bias.tar.gz, panTro6Tn5Bias.tar.gz, sacCer3Tn5Bias.tar.gz, dm6Tn5Bias.tar.gz, danRer11Tn5Bias.tar.gz, ce11Tn5Bias.tar.gz

h5 files containing the genome-wide Tn5 bias pre-computed using our convolutional neural net model.

(4) dispModel.tar.gz

Zipped folder containing Tn5 cutting dispersion models for each footprint window radius. The footprint window size in our paper refers to the diameter the footprint window, which is twice the number listed here. During footprinting, these models are loaded into the footprintingProject object and then used for footprinting.

(5) cisBP_mouse_pwms_2021.rds, cisBP_human_pwms_2021.rds

Motif PWMs used in our study.

(6) TFBS_model.h5, TFBS_model_cluster_I.h5

Pre-trained TF binding prediction models. The models takes local multi-scale footprints as input and predict whether a genomic position is bound by a TF if the corresponding motif is present.

TFBS_model.h5 is the "TF habitation model" used in our study. It was trained using data of TFs from all TF clusters.

TFBS_model_cluster_I.h5 was instead only trained on cluster 1 TFs (the TFs that leave the strongest footprints) and is in general not applicable to other TFs.

(7) clusterLabels.txt, clusterLabelsAllTFs.txt

Cluster labels of TFs. clusterLabels.txt is the clustering result directly obtained from clustering multi-scale footprints of all TFs with ChIP data. clusterLabelsAllTFs.txt includes other TFs without ChIP data. The cluster membership of these TFs were assigned based on motif homology among TFs.

Files

Tn5ModelTutorial.ipynb

Files (39.0 GB)

Name Size Download all
md5:10d8d17f94f695c06c0f66968f67b55b
366.5 MB Download
md5:cc6a20fa4096b52b337b29809ce38bfa
231.6 kB Download
md5:2440737c23bfda359aa6b9c4e4cd02c7
162.9 kB Download
md5:8d4fe94ccbde141f6edefc1f0ce36c10
6.1 GB Download
md5:4df576eea4313d727b57ce361fd5a299
681.4 kB Download
md5:7f256a41b7232bd5c3389b0e190d9788
515.1 MB Download
md5:89f205e6be682b15f87a2c2cc00e8cbd
11.2 GB Download
md5:901b928946b65e7bfba3a93e085f19f0
9.8 GB Download
md5:ba208a4cdc2e1fc09d66cac44e85e001
11.0 GB Download
md5:ed811aabe1ffa4bdb1520d4b25ee9289
44.7 MB Download
md5:ec69fb1dc3269ca3eba27e412c4fef6b
994.7 kB Download
md5:9534a674ed894f71d65f5cab43a36414
525.1 kB Download
md5:e425f021e775785ba57daf28190003e1
333.6 kB Download
md5:a21e2859fdfc3cba98770c1952e4845b
93.3 kB Preview Download