There is a newer version of the record available.

Published July 26, 2022 | Version 2
Dataset Open

Files for publication "Microseek: A Protein-Based Metagenomic Pipeline for Virus Diagnostic and Discovery"

Description

Context

These files correspond to the article “Microseek: A Protein-Based Metagenomic Pipeline for Virus Diagnostic and Discovery” submitted to Genes.

 

File content

  • input_data-empty_matrices: 50M-read Tissues and Plasma matrices, no spike;
  • input_data-matrices_spiked_known_viruses: 50M-read Tissues and Plasma matrices spiked with six known virus at d1, d10, d100;
  • input_data-matrices_spiked_neo_viruses: 50M-read Tissues and Plasma matrices spiked with 3 Neopneumoviruses at d1 and d10;
  • input_data-neo_viruses: Nucleotide and protein sequences of 3 Neopeumoviruses
  • input_data-tick_sample: 
  • input_data-negative_control: 
  • output_microseek: Microseek outputs, raw results and results after background filtration

 

File listing 

empty_matrices.tar.xz
├── plasma.fastq 
└── tissue.fastq 

matrices_spiked_known_viruses
├── d1 
│   ├── spiked_plasma.fastq
│   └── spiked_tissue.fastq
├── d10 
│   ├── spiked_plasma.fastq
│   └── spiked_tissue.fastq
└── d100 
    ├── spiked_plasma.fastq
    └── spiked_tissue.fastq

matrices_spiked_neo_viruses.tar.xz
├── d1 
│   ├── plasma_spiked_with_neo1.fastq
│   ├── plasma_spiked_with_neo2.fastq
│   ├── plasma_spiked_with_neo3.fastq
│   ├── tissue_spiked_with_neo1.fastq
│   ├── tissue_spiked_with_neo2.fastq
│   └── tissue_spiked_with_neo3.fastq
└── d10 
    ├── plasma_spiked_with_neo1.fastq
    ├── plasma_spiked_with_neo2.fastq
    ├── plasma_spiked_with_neo3.fastq
    ├── tissue_spiked_with_neo1.fastq
    ├── tissue_spiked_with_neo2.fastq
    └── tissue_spiked_with_neo3.fastq

neo_viruses.tar.xz
├── genes 
│   ├── neo_1.fasta
│   ├── neo_2.fasta
│   └── neo_3.fasta
└── proteins 
    ├── neo_1.fasta
    ├── neo_2.fasta
    └── neo_3.fasta

output_microseek.tar.xz
├── empty_matrices
│   ├── matrix_plasma 
│   └── matrix_tissue 
├── matrices_spiked_known_viruses
│   ├── filtered
│   │   ├── d100_plasma 
│   │   ├── d100_tissue 
│   │   ├── d10_plasma 
│   │   ├── d10_tissue 
│   │   ├── d1_plasma 
│   │   └── d1_tissue 
│   └── non_filtered
│       ├── d100_plasma 
│       ├── d100_tissue 
│       ├── d10_plasma 
│       ├── d10_tissue 
│       ├── d1_plasma 
│       └── d1_tissue 
└── matrices_spiked_neo_viruses
    ├── filtered
    │   ├── plasma_spiked_with_neo1_at_d1 
    │   ├── plasma_spiked_with_neo1_at_d10 
    │   ├── plasma_spiked_with_neo2_at_d1 
    │   ├── plasma_spiked_with_neo2_at_d10 
    │   ├── plasma_spiked_with_neo3_at_d1 
    │   ├── plasma_spiked_with_neo3_at_d10 
    │   ├── tissue_spiked_with_neo1_at_d1 
    │   ├── tissue_spiked_with_neo1_at_d10 
    │   ├── tissue_spiked_with_neo2_at_d1 
    │   ├── tissue_spiked_with_neo2_at_d10 
    │   ├── tissue_spiked_with_neo3_at_d1 
    │   └── tissue_spiked_with_neo3_at_d10 
    └── non-filtered
        ├── plasma_spiked_with_neo1_at_d1 
        ├── plasma_spiked_with_neo1_at_d10 
        ├── plasma_spiked_with_neo2_at_d1 
        ├── plasma_spiked_with_neo2_at_d10 
        ├── plasma_spiked_with_neo3_at_d1 
        ├── plasma_spiked_with_neo3_at_d10 
        ├── tissue_spiked_with_neo1_at_d1 
        ├── tissue_spiked_with_neo1_at_d10 
        ├── tissue_spiked_with_neo2_at_d1 
        ├── tissue_spiked_with_neo2_at_d10 
        ├── tissue_spiked_with_neo3_at_d1 
        └── tissue_spiked_with_neo3_at_d10 

 

Files

Files (47.2 GB)

Name Size Download all
md5:7fcecf237816bcdf95775a4c6cfa25a8
4.6 GB Download
md5:81511627def4d19b7170e0b35a44013b
13.9 GB Download
md5:0681d2c0ec478cb49c7dc133919f2061
27.7 GB Download
md5:5a7d8183764401a80c77e1e692474d29
1.0 GB Download
md5:2df0ea563b4cbd6196422e6288767b37
19.2 kB Download
md5:86b48c001c58864fa156d65e0e27b004
8.9 MB Download