Published November 28, 2025 | Version v1
Dataset Open

AnnoTEP-DB (Annotation of Transposable Elements in Plants Database - Frozen Release, v1.0): A Precomputed Repository for Lineage-Level Plant Transposable Elements Annotation

Description

AnnoTEP-DB (Frozen Release, v1.0): A Precomputed Repository for Lineage-Level Plant Transposable Elements Annotation

This Zenodo record provides a frozen, versioned snapshot of the AnnoTEP-DB resource — a comprehensive database of precomputed transposable element (TE) annotations and lineage-level classifications originally spanning 75 plant and algal genomes.

The full database (≈650 GB) exceeds Zenodo's 50 GB per-record storage limit. Therefore, this deposit includes a curated subset of representative genome-level archives, selected from the full collection to maximize phylogenetic diversity and ensure reproducibility of the main AnnoTEP-DB features.

The database was generated using the EDTA-GUI pipeline integrated with REXdb and RepeatModeler2, enabling lineage-aware TE classification, visualization, and comparative exploration across major plant clades.

To ensure long-term accessibility and reproducibility, the resource was restructured into two components:

  • AnnoTEP-DB-Interface.tar.gz — the complete interactive web interface (HTML, PHP, JavaScript, DataTables) required to deploy or browse AnnoTEP-DB locally;

  • Selected genome-level archives (.7z) — each containing the TE annotation outputs for a specific species, including FASTA and GFF3 files, summary tables, repeat landscapes, and TE lineage trees.

Each .7z genome archive (e.g., A_thaliana.7z, T_grandiflorum174.7z, Z_mays.7z) represents a complete standalone dataset comprising:

  • consensus TE libraries and genomic coordinates;

  • classification statistics by order, superfamily, and lineage;

  • repeat landscapes and insertion age profiles;

  • auxiliary visualization files and EDTA-GUI summary reports.

This frozen snapshot preserves the interface and the selected genome datasets in a format that remains fully accessible even without continued hosting.

Repository structure

  • AnnoTEP-DB-Interface.tar.gz — frozen interactive interface

  • Selected genome archives in .7z format (subset of the original 75 genomes)

Total compressed size of this deposit: ≤ 50 GB (Zenodo limit)
Original full database size: ≈ 650 GB
Annotation pipeline: EDTA-GUI v2.1 
File formats: .tar.gz, .7z
License: Creative Commons Attribution 4.0 (CC BY 4.0)
DOI of this release: 10.5281/zenodo.17753532

 

Reconstruction of the split Z_mays archive

The Z_mays dataset exceeded Zenodo’s file-size constraints and was therefore uploaded in multiple parts.
To reconstruct the original archive, download all parts into the same folder and run:

cat Z_mays.7z.part_* > Z_mays.7z
7z x Z_mays.7z

After extraction, the genome files can be placed in the directory
/src/assets/pages
to enable full functionality of the AnnoTEP-DB interface.

Files

Files (49.5 GB)

Name Size Download all
md5:e36d8bb21f64c0450782961720deb483
1.4 GB Download
md5:f05a9556619982dcf6541fd037c8f230
2.6 GB Download
md5:b87f914b14f1b275f9ac0b523457881d
380.0 MB Download
md5:15e3d62668149726c28f8cdbc516be58
142.7 MB Download
md5:dc7b2934bb9e8978e955a5646ecf26d6
281.5 MB Download
md5:a5277f48ad75ad9278da31c7359a0413
700.5 MB Download
md5:e64105359763ad7f20752603a18c1d66
1.1 GB Download
md5:5c902a19ac61d07ff994984286d386bc
116.7 MB Download
md5:9d2e6fbdada74361c77db17558f2c268
1.8 GB Download
md5:a278630dbc98043de80d5a9d846f0ae6
2.2 GB Download
md5:d58fa0432db9c9788b28f53fa304e1f2
2.2 GB Download
md5:3d09af33f164bf2a3e806390b2853b32
3.5 GB Download
md5:a3c1025e82ebe5a29161a42a634b2e35
1.9 GB Download
md5:feaef02420ea5d228058ddca4e8432e1
336.2 MB Download
md5:459e25d1f20d971503de476aa35b0b6e
3.4 GB Download
md5:77071ab01c0a1b5df96075bd02686bc7
1.3 GB Download
md5:17ca3dee5cda44dbe3b3da314f942be0
1.5 GB Download
md5:f983c384812a977414c2b34c8ca7b598
521.5 MB Download
md5:fde7dc157ac76ac4056c622d78b4a91c
1.1 GB Download
md5:e5118203472065abf6b24b455c9868ec
1.5 GB Download
md5:aa1d16c25975b224177f749daaf91f19
3.6 GB Download
md5:dc17e9e24a503e93c8956bb7b2be95fb
543.6 MB Download
md5:867858ed771c228abfe753370b910ae1
867.0 MB Download
md5:55391f07d866b536cb3868a1a3209d9c
156.9 MB Download
md5:1b10a784279aaf21f3de9378e8b1c9c7
1.1 GB Download
md5:ddda627334ea8ce982cab231b2571ce7
1.5 GB Download
md5:70fe6a992fb45336c40a66bf49f751ed
154.1 MB Download
md5:4ec78625cb4ef41f9ba72f16538b363d
1.5 GB Download
md5:4da3e9f7de0ac5e57b5600980b9da990
1.0 GB Download
md5:d2c568313d257e342dcb75c87c5a0033
3.9 GB Download
md5:4f5014e1a488d12a070750c864bad99c
3.9 GB Download
md5:50339142f1bd1968b9e6d4d6b26d4ca8
3.4 GB Download

Additional details

Dates

Available
2025
database

Software

Repository URL
https://plantgenomics.ncc.unesp.br/AnnoTEP-DB/
Programming language
Shell , Perl , Python
Development Status
Active