AnnoTEP-DB (Annotation of Transposable Elements in Plants Database - Frozen Release, v1.0): A Precomputed Repository for Lineage-Level Plant Transposable Elements Annotation
Description
AnnoTEP-DB (Frozen Release, v1.0): A Precomputed Repository for Lineage-Level Plant Transposable Elements Annotation
This Zenodo record provides a frozen, versioned snapshot of the AnnoTEP-DB resource — a comprehensive database of precomputed transposable element (TE) annotations and lineage-level classifications originally spanning 75 plant and algal genomes.
The full database (≈650 GB) exceeds Zenodo's 50 GB per-record storage limit. Therefore, this deposit includes a curated subset of representative genome-level archives, selected from the full collection to maximize phylogenetic diversity and ensure reproducibility of the main AnnoTEP-DB features.
The database was generated using the EDTA-GUI pipeline integrated with REXdb and RepeatModeler2, enabling lineage-aware TE classification, visualization, and comparative exploration across major plant clades.
To ensure long-term accessibility and reproducibility, the resource was restructured into two components:
-
AnnoTEP-DB-Interface.tar.gz — the complete interactive web interface (HTML, PHP, JavaScript, DataTables) required to deploy or browse AnnoTEP-DB locally;
-
Selected genome-level archives (.7z) — each containing the TE annotation outputs for a specific species, including FASTA and GFF3 files, summary tables, repeat landscapes, and TE lineage trees.
Each .7z genome archive (e.g., A_thaliana.7z, T_grandiflorum174.7z, Z_mays.7z) represents a complete standalone dataset comprising:
-
consensus TE libraries and genomic coordinates;
-
classification statistics by order, superfamily, and lineage;
-
repeat landscapes and insertion age profiles;
-
auxiliary visualization files and EDTA-GUI summary reports.
This frozen snapshot preserves the interface and the selected genome datasets in a format that remains fully accessible even without continued hosting.
Repository structure
-
AnnoTEP-DB-Interface.tar.gz — frozen interactive interface
-
Selected genome archives in .7z format (subset of the original 75 genomes)
Total compressed size of this deposit: ≤ 50 GB (Zenodo limit)
Original full database size: ≈ 650 GB
Annotation pipeline: EDTA-GUI v2.1
File formats: .tar.gz, .7z
License: Creative Commons Attribution 4.0 (CC BY 4.0)
DOI of this release: 10.5281/zenodo.17753532
Reconstruction of the split Z_mays archive
The Z_mays dataset exceeded Zenodo’s file-size constraints and was therefore uploaded in multiple parts.
To reconstruct the original archive, download all parts into the same folder and run:
cat Z_mays.7z.part_* > Z_mays.7z7z x Z_mays.7z
After extraction, the genome files can be placed in the directory/src/assets/pages
to enable full functionality of the AnnoTEP-DB interface.
Files
Files
(49.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:e36d8bb21f64c0450782961720deb483
|
1.4 GB | Download |
|
md5:f05a9556619982dcf6541fd037c8f230
|
2.6 GB | Download |
|
md5:b87f914b14f1b275f9ac0b523457881d
|
380.0 MB | Download |
|
md5:15e3d62668149726c28f8cdbc516be58
|
142.7 MB | Download |
|
md5:dc7b2934bb9e8978e955a5646ecf26d6
|
281.5 MB | Download |
|
md5:a5277f48ad75ad9278da31c7359a0413
|
700.5 MB | Download |
|
md5:e64105359763ad7f20752603a18c1d66
|
1.1 GB | Download |
|
md5:5c902a19ac61d07ff994984286d386bc
|
116.7 MB | Download |
|
md5:9d2e6fbdada74361c77db17558f2c268
|
1.8 GB | Download |
|
md5:a278630dbc98043de80d5a9d846f0ae6
|
2.2 GB | Download |
|
md5:d58fa0432db9c9788b28f53fa304e1f2
|
2.2 GB | Download |
|
md5:3d09af33f164bf2a3e806390b2853b32
|
3.5 GB | Download |
|
md5:a3c1025e82ebe5a29161a42a634b2e35
|
1.9 GB | Download |
|
md5:feaef02420ea5d228058ddca4e8432e1
|
336.2 MB | Download |
|
md5:459e25d1f20d971503de476aa35b0b6e
|
3.4 GB | Download |
|
md5:77071ab01c0a1b5df96075bd02686bc7
|
1.3 GB | Download |
|
md5:17ca3dee5cda44dbe3b3da314f942be0
|
1.5 GB | Download |
|
md5:f983c384812a977414c2b34c8ca7b598
|
521.5 MB | Download |
|
md5:fde7dc157ac76ac4056c622d78b4a91c
|
1.1 GB | Download |
|
md5:e5118203472065abf6b24b455c9868ec
|
1.5 GB | Download |
|
md5:aa1d16c25975b224177f749daaf91f19
|
3.6 GB | Download |
|
md5:dc17e9e24a503e93c8956bb7b2be95fb
|
543.6 MB | Download |
|
md5:867858ed771c228abfe753370b910ae1
|
867.0 MB | Download |
|
md5:55391f07d866b536cb3868a1a3209d9c
|
156.9 MB | Download |
|
md5:1b10a784279aaf21f3de9378e8b1c9c7
|
1.1 GB | Download |
|
md5:ddda627334ea8ce982cab231b2571ce7
|
1.5 GB | Download |
|
md5:70fe6a992fb45336c40a66bf49f751ed
|
154.1 MB | Download |
|
md5:4ec78625cb4ef41f9ba72f16538b363d
|
1.5 GB | Download |
|
md5:4da3e9f7de0ac5e57b5600980b9da990
|
1.0 GB | Download |
|
md5:d2c568313d257e342dcb75c87c5a0033
|
3.9 GB | Download |
|
md5:4f5014e1a488d12a070750c864bad99c
|
3.9 GB | Download |
|
md5:50339142f1bd1968b9e6d4d6b26d4ca8
|
3.4 GB | Download |
Additional details
Dates
- Available
-
2025database
Software
- Repository URL
- https://plantgenomics.ncc.unesp.br/AnnoTEP-DB/
- Programming language
- Shell , Perl , Python
- Development Status
- Active