There is a newer version of the record available.

Published January 28, 2024 | Version v1.1
Dataset Open

PKZILLA domain phylogenies with outgroup representatives from Uniprot

  • 1. ROR icon University of California, San Diego

Description

Source data files, executable code, and results for PKZILLA domain phylogenetic analysis, with inclusion of representative PKS domain outgroups from Uniprot.

Difference from previous version:

Difference with v1.0 of this Zenodo item series, is the text in the final plots has highlighting to allow the reader to quickly appreciate it without having to look closely at each taxonomic identifier, and KR11f0, KR20f0 were renamed in the plots to K11*, KR20*, respectively, based on new interpretations.  

Domain phylogenetics method:

Representative PKS domains from bacteria, fungi, dinoflagellates, haptophytes, and human FAS were downloaded from the InterPro API using high level queries for taxonomically restricted polypeptides with the presence of PKS domains, and their Uniprot reviewed (Swiss-Prot) or unreviewed (TrEMBL) status, and then further filtered based on the presence of well known polyketide names in the metadata of the matching Uniprot entry (see impactful_polyketides.tsv):

erythromycin
rapamycin
sirolimus
doxorubicin
amphotericin
tacrolimus
fk506
mupirocin
nystatin
ivermectin
salinomycin
monensin
tetracycline
doxorubicin
plicamycin
daunorubicin
epothilone
discodermolide
brefeldin
narasin
pikromycin
actinorhodin
aflatoxin
lovastatin
saxitoxin
quinolidomicin
curacin

The PKS domains of the PKZILLAs and those PKS domains from the representative Uniprot PKS polypeptides were then multiple sequence aligned with kalign2 v2.0.4 and unrooted phylograms calculated with RAxML-NG v. 1.2.0 with parameters "--model LG+G4m" without bootstrapping.

Plots were generated from the resulting newick files using ete3 v3.1.3 (Huerta-Cepas et al. 2016). Text within PDF plots was highlighted with color using PyMuPDF v1.23.19.

References:

J. Huerta-Cepas, F. Serra, and P. Bork, “ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data,” Molecular Biology and Evolution, vol. 33, no. 6, pp. 1635–1638, Jun. 2016, doi: 10.1093/molbev/msw046.

 

Files

PKZILLA_domain_phylogenies_with_uniprot_representatives.zip

Files (23.7 MB)

Additional details

Related works

Continues
Dataset: 10.5281/zenodo.10152639 (DOI)
Dataset: 10.5281/zenodo.10023460 (DOI)

Funding

National Institutes of Health
ELUCIDATING THE BIOSYNTHESIS OF A MODEL LADDER-FRAME POLYETHER TOXIN F32ES032276