Published April 26, 2022 | Version v2
Journal article Open

Tutorial for annotating microbial genomes

  • 1. NIOZ, Royal Netherlands Institute for Sea Research, Department of Marine Microbiology and Biogeochemistry, and Utrecht University, Netherlands

Description

This repository is part of a tutorial that teaches biology students how to:

  • download two genomes from NCBI and
  • annotate them using several databases such as the COGs, arCOGS, KOs, PFAM, …
  • combine the results into one comprehensive table.

The actual tutorial can be found on [Github](https://ndombrowski.github.io/Annotation_workflow/).

The goal of this repository is to provide any custom scripts or databases to run the majority of the tutorial. Specifically, you download the following:

- Databases.tar.gz contains any databases required to run this workflow (the paths from where the data was originally obtained can be found in the tutorial notebook

- Databases_for_trimmed_analyses.tar.gz contains any databases required to run the trimmed workflow (the paths from where the data was originally obtained can be found in the tutorial notebook

- scripts.tar.gz contains any custom scripts needed to run this workflow

- Example_data.tar.gz contains examples for the individual output files. This includes the whole analysis as well as a "trimmed" analysis were less database searches were included for a speed-up versions suited for shorter lessons.

taxonomy5.txt.gz contains a mapping file that links the ncbi taxonomy ID to a more informative taxonomy string

 

 

Files

Files (4.4 GB)

Name Size Download all
md5:7c9568381db32815ad123af3d541e5ff
2.8 GB Download
md5:5102d8ff21974951fd0344e6155d2756
1.4 GB Download
md5:8bb0cbd987658027c4a364fdd3f02e2a
170.6 MB Download
md5:626c8b999be1cff3668cada50ae198d0
6.0 kB Download
md5:893f60e36e86ca1c17a98f5cc492b30c
25.5 MB Download