Published August 23, 2023 | Version v2
Dataset Open

Pharokka Database v1.4.0

  • 1. University of Adelaide

Description

This holds the databases for Pharokka v1.4.0 (https://github.com/gbouras13/pharokka). 

It includes the PHROGs database made with MMSeqs2 (v13.45111) and Pyhmmer (v0.9.0 - .h3m file of all 38880 PHROGs), along with MMseqs2 compatible versions of the CARD v3.2.7 and VFDB (as of August 18 2023) databases.

The VFDB database has been clustered using MMSeqs2 at 50% sequence identity to reduce its size and speed up Pharokka's runtime.

`mmseqs easy-cluster VFDB_setB_pro_form.fas VFDBclusterRes tmp --min-seq-id 0.5 -c 0.8 --cov-mode 1`

It also includes the INPHARED mash sketch and annotation file current as of 1 August 2023 (https://github.com/RyanCook94/inphared). 

Files

Files (571.7 MB)

Name Size Download all
md5:c21144209b993c06fae2dac906d73b96
571.7 MB Download