There is a newer version of the record available.

Published January 28, 2021 | Version 1
Dataset Open

Files for publication "Microseek: A Protein-Based Metagenomic Pipeline for Virus Diagnostic and Discovery"

Description

Context

These files correspond to the article “Microseek: A Protein-Based Metagenomic Pipeline for Virus Diagnostic and Discovery” submitted to Genes.

 

File content

  • empty_matrices: 50M-read Tissues and Plasma matrices, no spike;
  • matrices_spiked_known_viruses: 50M-read Tissues and Plasma matrices spiked with six known virus at d1, d10, d100;
  • matrices_spiked_neo_viruses: 50M-read Tissues and Plasma matrices spiked with 3 Neopneumoviruses at d1 and d10;
  • neo_viruses: Nucleotide and protein sequences of 3 Neopeumoviruses
  • output_microseek: Microseek outputs, raw results and results after background filtration

 

File listing 

empty_matrices.tar.xz
├── plasma.fastq 
└── tissue.fastq 

matrices_spiked_known_viruses
├── d1 
│   ├── spiked_plasma.fastq
│   └── spiked_tissue.fastq
├── d10 
│   ├── spiked_plasma.fastq
│   └── spiked_tissue.fastq
└── d100 
    ├── spiked_plasma.fastq
    └── spiked_tissue.fastq

matrices_spiked_neo_viruses.tar.xz
├── d1 
│   ├── plasma_spiked_with_neo1.fastq
│   ├── plasma_spiked_with_neo2.fastq
│   ├── plasma_spiked_with_neo3.fastq
│   ├── tissue_spiked_with_neo1.fastq
│   ├── tissue_spiked_with_neo2.fastq
│   └── tissue_spiked_with_neo3.fastq
└── d10 
    ├── plasma_spiked_with_neo1.fastq
    ├── plasma_spiked_with_neo2.fastq
    ├── plasma_spiked_with_neo3.fastq
    ├── tissue_spiked_with_neo1.fastq
    ├── tissue_spiked_with_neo2.fastq
    └── tissue_spiked_with_neo3.fastq

neo_viruses.tar.xz
├── genes 
│   ├── neo_1.fasta
│   ├── neo_2.fasta
│   └── neo_3.fasta
└── proteins 
    ├── neo_1.fasta
    ├── neo_2.fasta
    └── neo_3.fasta

output_microseek.tar.xz
├── empty_matrices
│   ├── matrix_plasma 
│   └── matrix_tissue 
├── matrices_spiked_known_viruses
│   ├── filtered
│   │   ├── d100_plasma 
│   │   ├── d100_tissue 
│   │   ├── d10_plasma 
│   │   ├── d10_tissue 
│   │   ├── d1_plasma 
│   │   └── d1_tissue 
│   └── non_filtered
│       ├── d100_plasma 
│       ├── d100_tissue 
│       ├── d10_plasma 
│       ├── d10_tissue 
│       ├── d1_plasma 
│       └── d1_tissue 
└── matrices_spiked_neo_viruses
    ├── filtered
    │   ├── plasma_spiked_with_neo1_at_d1 
    │   ├── plasma_spiked_with_neo1_at_d10 
    │   ├── plasma_spiked_with_neo2_at_d1 
    │   ├── plasma_spiked_with_neo2_at_d10 
    │   ├── plasma_spiked_with_neo3_at_d1 
    │   ├── plasma_spiked_with_neo3_at_d10 
    │   ├── tissue_spiked_with_neo1_at_d1 
    │   ├── tissue_spiked_with_neo1_at_d10 
    │   ├── tissue_spiked_with_neo2_at_d1 
    │   ├── tissue_spiked_with_neo2_at_d10 
    │   ├── tissue_spiked_with_neo3_at_d1 
    │   └── tissue_spiked_with_neo3_at_d10 
    └── non-filtered
        ├── plasma_spiked_with_neo1_at_d1 
        ├── plasma_spiked_with_neo1_at_d10 
        ├── plasma_spiked_with_neo2_at_d1 
        ├── plasma_spiked_with_neo2_at_d10 
        ├── plasma_spiked_with_neo3_at_d1 
        ├── plasma_spiked_with_neo3_at_d10 
        ├── tissue_spiked_with_neo1_at_d1 
        ├── tissue_spiked_with_neo1_at_d10 
        ├── tissue_spiked_with_neo2_at_d1 
        ├── tissue_spiked_with_neo2_at_d10 
        ├── tissue_spiked_with_neo3_at_d1 
        └── tissue_spiked_with_neo3_at_d10 

 

Files

Files (47.4 GB)

Name Size Download all
md5:7c63012387f304e8c48cf2482c946a65
4.7 GB Download
md5:6cfbb4f40dab663e4460c2fc35d4f6f2
14.2 GB Download
md5:ba86c3d45835191b8d5a48d392d7eb51
28.4 GB Download
md5:e7e5105484efe8f97566beaa5723741a
16.8 kB Download
md5:5b35d0d1d1fb465f0d2cb6c4ee58acfe
13.7 MB Download