Published November 1, 2022 | Version 1.0
Dataset Open

Benchmarking bioinformatic tools for amplicon-based sequencing of norovirus

  • 1. Teagasc Food Research Centre
  • 2. Marine Institute
  • 3. 0000-0002-8726-0328
  • 4. 0000-0002-5465-9068

Description

This repository contains associated datasets and accession numbers for a study entitled 'Benchmarking bioinformatic tools for amplicon-based sequencing of norovirus'. The scripts for this project can be found on the GitHub project page

Expected composition tsv files are the OTU tables for each simulation performed (001-010). OTU IDs in this case are the expected taxonomy with the associated accession numbers. Samples are numbered 1-40, including the simulation number. Expected sequences fasta files contain the sequences used as input for each simulation, without primers or Illumina adapter sequences.

Amplicons were generated using the following primers:

GI Primers 
GISKF: CTG CCC GAA TTY GTA AAT GA 4
GISKR: CCA ACC CAR CCA TTR TAC A 5

GII Primers 
G2SKF: CNT GGG AGG GCG ATC GCAA 8
G2SKR: CCR CCN GCA TRH CCR TTR TAC AT

In this study, three databases and multiple classifiers were compared. Here we include the taxonomy and fasta files for each database; noronet =NoroNet RIVM, calicinet= HuCat CDC and custom, randomly generated database. Fasta files for the classifiers include the GI/GII primers listed above in a 5-3 orientation. 

The tags.txt file contains the Illumina adapters used for the simulation component of the study.

Files

tags.txt

Files (1.3 MB)

Name Size Download all
md5:5356cd19b417c2e7a1904d598c83f7c1
31.2 kB Download
md5:4903d37c1ef09bbd452c3cc67ca8c56b
8.1 kB Download
md5:00a018137af858a683c801da48087b14
64.5 kB Download
md5:8e57db904af691079296366eefb81f6e
54.5 kB Download
md5:495762baee04ac47b6b9b7c1f26d5fd2
57.5 kB Download
md5:84ade9eff409221187aa0bf81daeaa21
54.2 kB Download
md5:0a8b8eea05b03d09a5ff7ddf482a330c
57.3 kB Download
md5:705f4309de7f08b2721c8ab7bca111d0
57.4 kB Download
md5:135233bc5113681343183fdec6dd4208
65.3 kB Download
md5:df998c04fa89f85122ca109a0d3f2c3c
63.5 kB Download
md5:578141fcec6100ed8f2ed6fbd404cfd9
48.2 kB Download
md5:dc30b04dc3ea62e17e7d00604ef7c0b1
64.5 kB Download
md5:e3e5e4b2edc25776bd49902cd594b3f3
55.8 kB Download
md5:843730bfdc73c69daa0d66ca9e9c62e7
58.6 kB Download
md5:a357f59f265d2b1ffedc8efeddcf1e24
47.4 kB Download
md5:5bfe69b1b420162e21370afe19d7ec3c
44.5 kB Download
md5:2cfb57775c683a52b49ce29110e8f6e5
47.0 kB Download
md5:9d1e450e35bdcbc158dd723f3dbe2fad
47.1 kB Download
md5:9543919a1585b47c502ad211bd7007ad
53.1 kB Download
md5:9687e735220a3240031ecd0a09109b62
52.2 kB Download
md5:c7be6bb2ca828826955667e68e6ba469
39.3 kB Download
md5:8d479c8c8dd28b758893f86684419ff5
53.2 kB Download
md5:277efef1894457420f3871dc0b7ec60c
45.7 kB Download
md5:55d4dcb68f393ee82ab1216427fd7d18
47.9 kB Download
md5:5f894e413bfbfa7aa76aaf9b9236468c
29.7 kB Download
md5:4131598c39b1ad47528e0c77e2ef6bc0
10.9 kB Download
md5:55b89e99ecb72969f347da6855d4626c
1.5 kB Preview Download

Additional details

References

  • lant in Tokyo, Japan. Water Sci Technol. 2006;54(11–12):301–8. 37. Kitajima M, Haramoto E, Phanuwan C, Katayama H, Ohgaki S. Detection of genogroup IV norovirus in wastewater and river water in Japan. Lett Appl Microbiol. 2009;49(5):655–8.
  • Gourlé H, Karlsson-Lindsjö O, Hayer J, Bongcam-Rudloff E. Simulating Illumina metagenomic data with InSilicoSeq. Bioinformatics. 2019 Feb 1;35(3):521–2.
  • Kroneman A, Vennema H, Deforche K, Avoort H v. d., Peñaranda S, Oberste MS, et al. An automated genotyping tool for enteroviruses and noroviruses. J Clin Virol. 2011;51(2):121–5.
  • Tatusov RL, Chhabra P, Diez-Valcarce M, Barclay L, Cannon JL, Vinjé J. Human Calicivirus Typing tool: A web-based tool for genotyping human norovirus and sapovirus sequences. J Clin Virol. 2020 Dec 13;104718.