There is a newer version of the record available.

Published December 2, 2023 | Version v1.0
Dataset Open

PKZILLA domain phylogenies with outgroup representatives from Uniprot

  • 1. ROR icon University of California, San Diego

Description

Source data files, executable code, and results for PKZILLA domain phylogenetic analysis, with inclusion of representative PKS domain outgroups from Uniprot.

Domain phylogenetics method:

Representative PKS domains from bacteria, fungi, dinoflagellates, haptophytes, and human FAS were downloaded from the InterPro API using high level queries for taxonomically restricted polypeptides with the presence of PKS domains, and their Uniprot reviewed (Swiss-Prot) or unreviewed (TrEMBL) status, and then further filtered based on the presence of well known polyketide names in the metadata of the matching Uniprot entry.

The PKS domains of the PKZILLAs and those PKS domains from the representative Uniprot PKS polypeptides were then multiple sequence aligned with kalign2 v2.0.4 and unrooted phylograms calculated with RAxML-NG v. 1.2.0 with parameters "--model LG+G4m" without bootstrapping.

Plots were generated from the resulting newick files using ete3. 

Files

PKZILLA_domain_phylogenies_with_uniprot_representatives.zip

Files (23.2 MB)

Additional details

Related works

Continues
Dataset: 10.5281/zenodo.10152639 (DOI)
Dataset: 10.5281/zenodo.10023460 (DOI)

Funding

National Institutes of Health
ELUCIDATING THE BIOSYNTHESIS OF A MODEL LADDER-FRAME POLYETHER TOXIN F32ES032276