Published April 22, 2025
| Version v2
Dataset
Open
HDMA TF-MoDISco motifs per cluster
Creators
-
Liu, Betty B.1
-
Jessa, Selin1
-
Kim, Samuel H.1
-
Ng, Yan Ting2
- Higashino, Soon Il1
- Marinov, Georgi K.1
- Chen, Derek C.1
- Parks, Benjamin E.1
- Li, Li1
- Nguyen, Tri C.1
- Wang, Austin T.1
- Wang, Sean K.1
- Tan, Serena Y.1
- Kosicki, Michael3
- Pennacchio, Len A.3
- Ben-David, Eyal2
- Pasca, Anca M.1
- Kundaje, Anshul1
- Farh, Kyle K.H.2
- Greenleaf, William J.1
- 1. Stanford University
- 2. Illumina AI Lab
- 3. Lawrence Berkeley National Laboratory
Description
Per-cell-type motif discovery using TF-MoDISco for HDMA (Liu*, Jessa*, Kim*, Ng* et al, bioRxiv 2025). For the 189 cell types where high-quality models were trained, we provide the TF-MoDISco output, including h5 files per cell type containing CWMs, and HTML reports. These represent motifs learned through interpretation of the models with respect to counts. The file merged_modisco_patterns_map.tsv indicates which cell type patterns were clustered and aggregated to form compendium motifs. A detailed description of the main data types deposited on Zenodo can be found here.
Files
Files
(13.7 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:b612f1019ca5add9478d62507fc41d4a
|
13.7 GB | Download |
|
md5:f277e3a31acde932971bc5165d67a358
|
672.9 kB | Download |
Additional details
Related works
- Is supplement to
- Preprint: 10.1101/2025.04.30.651381 (DOI)