Published December 20, 2023
| Version 1.0.0
Model
Open
Dissection of core promoter syntax through single nucleotide resolution modeling of transcription initiation (CLIPNET models)
Description
Model weights for CLIPNET. Chromosomes in fold 0 were held out for all models. The 9 remaining folds were used to train models such that data fold i was withheld from training for model i (for assignments, see data_fold_assignments.csv). The models assume a 2-hot encoding of DNA (A=[2, 0, 0, 0], C=[0, 2, 0, 0], ..., Y=[0, 1, 0, 1], ...).
Processed data used for model training and to reproduce figures in our CLIPNET paper (preprint here) are archived at DOI 10.5281/zenodo.10597357
Model weights are also available for download via HuggingFace.
Files
data_fold_assignments.csv
Files
(870.8 MB)
Name | Size | Download all |
---|---|---|
md5:2c861a49eaa0db904429dbcd6b2fe191
|
178 Bytes | Preview Download |
md5:ba80a5aeb3a4e8facd6abcf7f2d2a3e4
|
96.8 MB | Download |
md5:246b4df044dc66f1562068ea2bdc4a6c
|
96.8 MB | Download |
md5:b65b5a566e76afe44222d87f8c9e05de
|
96.8 MB | Download |
md5:820bc04c335b7212dd5da00a49bbed12
|
96.8 MB | Download |
md5:4a6c881a8a6a6af600e2f5d65d889095
|
96.8 MB | Download |
md5:cc9fa58200885c00ad86638af15846e2
|
96.8 MB | Download |
md5:a39e79f54490493792d79050239f1555
|
96.8 MB | Download |
md5:bbc83f6fdfa6adafc8044c88a7dbb140
|
96.8 MB | Download |
md5:c55266acec027165cb578c25a0a8cade
|
96.8 MB | Download |