PKZILLA domain phylogenies with outgroup representatives from Uniprot
Description
Source data files, executable code, and results for PKZILLA domain phylogenetic analysis, with inclusion of representative PKS domain outgroups from Uniprot.
Difference from previous version:
Difference with v1.0 of this Zenodo item series, is the text in the final plots has highlighting to allow the reader to quickly appreciate it without having to look closely at each taxonomic identifier, and KR11f0, KR20f0 were renamed in the plots to K11*, KR20*, respectively, based on new interpretations.
Domain phylogenetics method:
Representative PKS domains from bacteria, fungi, dinoflagellates, haptophytes, and human FAS were downloaded from the InterPro API using high level queries for taxonomically restricted polypeptides with the presence of PKS domains, and their Uniprot reviewed (Swiss-Prot) or unreviewed (TrEMBL) status, and then further filtered based on the presence of well known polyketide names in the metadata of the matching Uniprot entry (see impactful_polyketides.tsv):
erythromycin
rapamycin
sirolimus
doxorubicin
amphotericin
tacrolimus
fk506
mupirocin
nystatin
ivermectin
salinomycin
monensin
tetracycline
doxorubicin
plicamycin
daunorubicin
epothilone
discodermolide
brefeldin
narasin
pikromycin
actinorhodin
aflatoxin
lovastatin
saxitoxin
quinolidomicin
curacin
The PKS domains of the PKZILLAs and those PKS domains from the representative Uniprot PKS polypeptides were then multiple sequence aligned with kalign2 v2.0.4 and unrooted phylograms calculated with RAxML-NG v. 1.2.0 with parameters "--model LG+G4m" without bootstrapping.
Plots were generated from the resulting newick files using ete3 v3.1.3 (Huerta-Cepas et al. 2016). Text within PDF plots was highlighted with color using PyMuPDF v1.23.19.
References:
Files
PKZILLA_domain_phylogenies_with_uniprot_representatives.zip
Files
(23.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:fc6dc2240f8e62b17c1bb4e84f5b6565
|
23.7 MB | Preview Download |
Additional details
Related works
- Continues
- Dataset: 10.5281/zenodo.10152639 (DOI)
- Dataset: 10.5281/zenodo.10023460 (DOI)