Published February 16, 2021 | Version 1
Dataset Open

Negative instances for detecting LTR-Retrotransposons using Machine Learning

  • 1. Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
  • 2. Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
  • 3. Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, France
  • 4. Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia

Description

This dataset is composed of genomic features other than LTR-Retrotransposons (LTR_RTs), such as coding sequences (CDS), different types of RNA (e.g., mRNA, tRNA, non-coding RNA, among others), and other types of transposable elements that do not belong to LTR-RTs (e.g., TEs Class II, PLEs, DIRs, LINEs, and SINEs) from the same plant species contained in InpactorDB (DOI 10.5281/zenodo.4386316). These additional transposable element sequences were available in databases such as PGSB PlantsDB, Repbase (v. 20.05, 2017), RepetDB, Ensembl Plants, and JGI (Joint Genome Institute).

Files

negative_instances_raw.zip

Files (907.1 MB)

Name Size Download all
md5:a520b2fc2c8417fbc6a6f495fe73e07b
907.1 MB Preview Download