Published August 18, 2025 | Version v3.2.2
Dataset Open

Virus Pathogen Database for EsViritu

  • 1. Baylor College of Medicine

Description

Databases for EsViritu: Read mapping pipeline for detection and measurement of virus pathogens from metagenomic data

 

File descriptions

v3.2.2/

  • virus_pathogen_database.all_metadata.tsv = sequence and taxonomy info for each sequence in database

  • virus_pathogen_database.fna = fasta formatted nucleotide database of all virus genomes and segments

  • virus_pathogen_database.mmi = minimap2 formatted index of database (for short read mapping)

 

Relates to the tool in the following repository:

EsViritu

Updates since 3.1.1:

  • addition of over 1,000 assemblies of human/animal/plant viruses that were previously not included for various reasons
  • removal of some endogenous virus genomes to avoid mislabeling human/animal DNA
  • Masking of a few virus genomes with low-complexity repeat regions

Files

Files (429.2 MB)

Name Size Download all
md5:fb6850259b82d39dab104f4b9145f04d
429.2 MB Download

Additional details

Software

Repository URL
https://github.com/cmmr/EsViritu
Programming language
Python