Multi-scale footprinting
Creators
- Hu, Yan1
- Horlbeck, Max1
- Zhang, Ruochi1, 2
- Ma, Sai1
- Kartha, Vinay1
- Shrestha, Rojesh1
- Duarte, Fabiana1
- Labade, Ajay1
- Hock, Conrad1
- Kletzien, Heidi1
- Savage, Rachel1
- Earl, Andrew1
- Meliki, Alia1
- Castillo, Andrew1
- Durand, Neva2
- Tay, Tristan1
- Mattei, Eugenio2
- Anderson, Lauren2
- Shoresh, Noam2
- Wagers, Amy1
- Buenrostro, Jason1
Description
Data associated with the multi-scale footprinting project.
(1) Tn5_NN_model.h5
Pre-trained CNN-based Tn5 bias model implemented with Keras. Takes local DNA sequence context as input and predicts Tn5 insertion bias. See tutorial for how to use this model.
(2) Tn5ModelTutorial.ipynb, Tn5ModelTutorial.html
Tutorial showing how to use the pre-trained Tn5 bias model to score input sequences.
(3) hg38Tn5Bias.tar.gz, mm10Tn5Bias.tar.gz, panTro6Tn5Bias.tar.gz, sacCer3Tn5Bias.tar.gz, dm6Tn5Bias.tar.gz, danRer11Tn5Bias.tar.gz, ce11Tn5Bias.tar.gz
h5 files containing the genome-wide Tn5 bias pre-computed using our convolutional neural net model.
(4) dispModel.tar.gz
Zipped folder containing Tn5 cutting dispersion models for each footprint window radius. The footprint window size in our paper refers to the diameter the footprint window, which is twice the number listed here. During footprinting, these models are loaded into the footprintingProject object and then used for footprinting.
(5) cisBP_mouse_pwms_2021.rds, cisBP_human_pwms_2021.rds
Motif PWMs used in our study.
(6) TFBS_model.h5, TFBS_model_cluster_I.h5
Pre-trained TF binding prediction models. The models takes local multi-scale footprints as input and predict whether a genomic position is bound by a TF if the corresponding motif is present.
TFBS_model.h5 is the "TF habitation model" used in our study. It was trained using data of TFs from all TF clusters.
TFBS_model_cluster_I.h5 was instead only trained on cluster 1 TFs (the TFs that leave the strongest footprints) and is in general not applicable to other TFs.
(7) clusterLabels.txt, clusterLabelsAllTFs.txt
Cluster labels of TFs. clusterLabels.txt is the clustering result directly obtained from clustering multi-scale footprints of all TFs with ChIP data. clusterLabelsAllTFs.txt includes other TFs without ChIP data. The cluster membership of these TFs were assigned based on motif homology among TFs.
Files
Tn5ModelTutorial.ipynb
Files
(39.0 GB)
Name | Size | Download all |
---|---|---|
md5:10d8d17f94f695c06c0f66968f67b55b
|
366.5 MB | Download |
md5:cc6a20fa4096b52b337b29809ce38bfa
|
231.6 kB | Download |
md5:2440737c23bfda359aa6b9c4e4cd02c7
|
162.9 kB | Download |
md5:8d4fe94ccbde141f6edefc1f0ce36c10
|
6.1 GB | Download |
md5:4df576eea4313d727b57ce361fd5a299
|
681.4 kB | Download |
md5:7f256a41b7232bd5c3389b0e190d9788
|
515.1 MB | Download |
md5:89f205e6be682b15f87a2c2cc00e8cbd
|
11.2 GB | Download |
md5:901b928946b65e7bfba3a93e085f19f0
|
9.8 GB | Download |
md5:ba208a4cdc2e1fc09d66cac44e85e001
|
11.0 GB | Download |
md5:ed811aabe1ffa4bdb1520d4b25ee9289
|
44.7 MB | Download |
md5:ec69fb1dc3269ca3eba27e412c4fef6b
|
994.7 kB | Download |
md5:9534a674ed894f71d65f5cab43a36414
|
525.1 kB | Download |
md5:e425f021e775785ba57daf28190003e1
|
333.6 kB | Download |
md5:a21e2859fdfc3cba98770c1952e4845b
|
93.3 kB | Preview Download |