Published September 29, 2025
| Version v1
Dataset
Open
Datasets used in "Harnessing DNA Foundation Models for Cross-Species TFBS Prediction in Plant Genomes"
Authors/Creators
Description
This dataset accompanies the paper "Harnessing DNA Foundation Models for Cross-Species Transcription Factor Binding Site Prediction in Plant Genomes".
It contains:
-
Genome FASTA files of Arabidopsis thaliana and Sisymbrium irio in the
fastas/folder. -
DAP-seq peak files in the
peak_files/folder, obtained from:-
Malley2016: O'Malley et al., 2016 — https://pubmed.ncbi.nlm.nih.gov/27203113/
-
Sun2020: Sun et al., 2022 — https://pubmed.ncbi.nlm.nih.gov/35501452/
-
Files
data.zip
Files
(110.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:1ecc1cecb15f4a60a0644f3db887122b
|
110.1 MB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/Maryam-Haghani/TFBS
- Programming language
- Python
- Development Status
- Active