Benchmarking bioinformatic tools for amplicon-based sequencing of norovirus
Creators
- 1. Teagasc Food Research Centre
- 2. Marine Institute
- 3. 0000-0002-8726-0328
- 4. 0000-0002-5465-9068
Description
This repository contains associated datasets and accession numbers for a study entitled 'Benchmarking bioinformatic tools for amplicon-based sequencing of norovirus'. The scripts for this project can be found on the GitHub project page.
Expected composition tsv files are the OTU tables for each simulation performed (001-010). OTU IDs in this case are the expected taxonomy with the associated accession numbers. Samples are numbered 1-40, including the simulation number. Expected sequences fasta files contain the sequences used as input for each simulation, without primers or Illumina adapter sequences.
Amplicons were generated using the following primers:
GI Primers
GISKF: CTG CCC GAA TTY GTA AAT GA 4
GISKR: CCA ACC CAR CCA TTR TAC A 5
GII Primers
G2SKF: CNT GGG AGG GCG ATC GCAA 8
G2SKR: CCR CCN GCA TRH CCR TTR TAC AT
In this study, three databases and multiple classifiers were compared. Here we include the taxonomy and fasta files for each database; noronet =NoroNet RIVM, calicinet= HuCat CDC and custom, randomly generated database. Fasta files for the classifiers include the GI/GII primers listed above in a 5-3 orientation.
The tags.txt file contains the Illumina adapters used for the simulation component of the study.
Files
tags.txt
Files
(1.3 MB)
Name | Size | Download all |
---|---|---|
md5:5356cd19b417c2e7a1904d598c83f7c1
|
31.2 kB | Download |
md5:4903d37c1ef09bbd452c3cc67ca8c56b
|
8.1 kB | Download |
md5:00a018137af858a683c801da48087b14
|
64.5 kB | Download |
md5:8e57db904af691079296366eefb81f6e
|
54.5 kB | Download |
md5:495762baee04ac47b6b9b7c1f26d5fd2
|
57.5 kB | Download |
md5:84ade9eff409221187aa0bf81daeaa21
|
54.2 kB | Download |
md5:0a8b8eea05b03d09a5ff7ddf482a330c
|
57.3 kB | Download |
md5:705f4309de7f08b2721c8ab7bca111d0
|
57.4 kB | Download |
md5:135233bc5113681343183fdec6dd4208
|
65.3 kB | Download |
md5:df998c04fa89f85122ca109a0d3f2c3c
|
63.5 kB | Download |
md5:578141fcec6100ed8f2ed6fbd404cfd9
|
48.2 kB | Download |
md5:dc30b04dc3ea62e17e7d00604ef7c0b1
|
64.5 kB | Download |
md5:e3e5e4b2edc25776bd49902cd594b3f3
|
55.8 kB | Download |
md5:843730bfdc73c69daa0d66ca9e9c62e7
|
58.6 kB | Download |
md5:a357f59f265d2b1ffedc8efeddcf1e24
|
47.4 kB | Download |
md5:5bfe69b1b420162e21370afe19d7ec3c
|
44.5 kB | Download |
md5:2cfb57775c683a52b49ce29110e8f6e5
|
47.0 kB | Download |
md5:9d1e450e35bdcbc158dd723f3dbe2fad
|
47.1 kB | Download |
md5:9543919a1585b47c502ad211bd7007ad
|
53.1 kB | Download |
md5:9687e735220a3240031ecd0a09109b62
|
52.2 kB | Download |
md5:c7be6bb2ca828826955667e68e6ba469
|
39.3 kB | Download |
md5:8d479c8c8dd28b758893f86684419ff5
|
53.2 kB | Download |
md5:277efef1894457420f3871dc0b7ec60c
|
45.7 kB | Download |
md5:55d4dcb68f393ee82ab1216427fd7d18
|
47.9 kB | Download |
md5:5f894e413bfbfa7aa76aaf9b9236468c
|
29.7 kB | Download |
md5:4131598c39b1ad47528e0c77e2ef6bc0
|
10.9 kB | Download |
md5:55b89e99ecb72969f347da6855d4626c
|
1.5 kB | Preview Download |
Additional details
References
- lant in Tokyo, Japan. Water Sci Technol. 2006;54(11–12):301–8. 37. Kitajima M, Haramoto E, Phanuwan C, Katayama H, Ohgaki S. Detection of genogroup IV norovirus in wastewater and river water in Japan. Lett Appl Microbiol. 2009;49(5):655–8.
- Gourlé H, Karlsson-Lindsjö O, Hayer J, Bongcam-Rudloff E. Simulating Illumina metagenomic data with InSilicoSeq. Bioinformatics. 2019 Feb 1;35(3):521–2.
- Kroneman A, Vennema H, Deforche K, Avoort H v. d., Peñaranda S, Oberste MS, et al. An automated genotyping tool for enteroviruses and noroviruses. J Clin Virol. 2011;51(2):121–5.
- Tatusov RL, Chhabra P, Diez-Valcarce M, Barclay L, Cannon JL, Vinjé J. Human Calicivirus Typing tool: A web-based tool for genotyping human norovirus and sapovirus sequences. J Clin Virol. 2020 Dec 13;104718.