Published January 18, 2022 | Version v2
Dataset Open

AMR Benchmarking dataset - Metagenomics

Authors/Creators

  • 1. Dalhousie University

Description

Metagenomic benchmarking dataset for AMR detection pipelines for assemblies focusing on ESKAPE pathogens in addition to Salmonella. This dataset consists of closed genomes from NCBI where paired-end Illumina data was available. These genomes were then randomly assigned a relative abundance, had additional AMR genes randomly inserted (to cover all AMR genes in CARD v3.1.4) and metagenomic Illumina reads simulated from them.

Metagenomic simulation was performed using: https://github.com/fmaguire/AMR_Metagenome_Simulator and the entire process can be repeated using the metagenome_benchmark.sh script included above.
 

Files
`amr_benchmarking_metagenome.csv` contains the metadata the input genome accessions, paths, and simulated copy number used for creation of the AMR metagenome.

`AMR_metagenome_labels.tsv` a two column csv containing names of all reads that are derived from an AMR gene and an identifier for the corresponding AMR gene. AMR genes are identified using CARD Antibiotic Resistance Ontology (ARO), with a suffix listing any SNVs for nmutation related resistance genes.
 

`simulated_metagenome.fna.gz` contains the full "assembled" true metagenomic contigs (derived directly from the input genome assemblies amplified to the correct copy number).

`metagenome_unsorted.bed` contains the location of AMR genes in the full "assembled" true metagenomic contigs

`simulated_metagenome_{1,2}.fq.gz` contain the simulated paired end metagenomics reads

`simulated_metagenome_error_free.bam` contains the error-free mapping location from which simulated reads were derived

 
 

Files

amr_benchmarking_metagenome.csv

Files (17.2 GB)

Name Size Download all
md5:a34673e3de7c9aafc1b677917ff7de6b
27.9 kB Preview Download
md5:66a7b26238e0f4d37b3afd1f73828cbc
2.7 MB Download
md5:cc481ed7249636b8b1addacd2ea3962a
882 Bytes Download
md5:b3bb17a85564c1ea7542481f101a72c8
565 Bytes Download
md5:ad737c6e7129a29b2552c4e14053ece4
2.4 MB Download
md5:75cba1615ec8b2d505f5c29cb36ea346
1.7 kB Preview Download
md5:cce82a9b0ac35ee3ddecf6868f0cf4c2
6.8 GB Download
md5:d4d084404a9236eda4ef9ce52eca35a1
3.6 GB Download
md5:a096ac175bcd159b6180f489d47a3003
4.8 GB Download
md5:5e5224e9d9e642dac30ebded5f25dc07
2.0 GB Download