Published July 12, 2023 | Version 1
Dataset Open

Phables v 1.1.0 benchmarking data and results

  • 1. Vijini

Description

This record contains all the benchmarking datasets and results for the Phables manuscript. The datasets used are,

  1. A simulated dataset containing four phages Enterobacteria phage P22, Enterobacteria phage T7, Staphylococcus phage SAP13 TA-2022 and Staphylococcus phage SAP2 TA-2022, referred to as simPhage
  2. Water samples from Nansi Lake and Dongping Lake in n Shandong Province, China (NCBI BioProject number PRJNA756429), referred to as Lake Water
  3. Soil samples from flooded paddy fields from Hunan Province, China (NCBI BioProject number PRJNA866269), referred to as Paddy Soil
  4. Wastewater virome (NCBI BioProject number PRJNA434744), referred to as Wastewater
  5. Stool samples from patients with IBD and their healthy household controls (NCBI BioProject number PRJEB7772), referred to as IBD

The simulated dataset was assembled using metaSPAdes and the real datasets were preprocessed using Hecatomb. The Lake Water dataset was also assembled using MEGAHIT (referred to as Lake Water - MEGAHIT) and metaSPAdes (referred to as Lake Water - metaSPAdes). The simulated dataset was run using Phables and evaluated using metaQUAST. All the real datasets were run using PHAMB and Phables and the genomes were evaluated using CheckV.

Files

IBD.zip

Files (1.5 GB)

Name Size Download all
md5:b488e6c5276db5c08fd79e30eecdc7cd
88.9 MB Preview Download
md5:a0ca02af1acdded4a943051051112bfa
203.7 MB Preview Download
md5:a185139c8055a5d36127a85964369e4f
442.1 MB Preview Download
md5:83987de4491592f8a87fba4b88225998
216.4 MB Preview Download
md5:bd0a3368227f0acbe631bcceb03c0f01
28.5 MB Preview Download
md5:352b74980c6408267e52086eba082ea9
460.4 MB Preview Download
md5:e42bd5f58a2b79371a64f6a174be628c
38.8 MB Preview Download