Published November 24, 2024 | Version v1
Publication Open

Diversity of FAAL enzymes and prediction of their substrate specificity using FAALPred

  • 1. ICBAS – School of Medicine and Biomedical Sciences Abel Salazar, University of Porto, Porto
  • 2. Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of Porto
  • 3. ROR icon Centro Interdisciplinar de Investigação Marinha e Ambiental

Description

FAALs (Fatty Acyl-AMP Ligases) recruit and incorporate fatty acids during the biosynthesis of secondary metabolites. Their diversity, distribution and substrate specificity remain poorly understood, which limits functional predictions from sequence data. In this study, we explored the prevalence and diversity of FAAL enzymes across the tree of life and show that FAALs are widely distributed among bacteria, with distinct clades associated with specific taxonomic groups and/or biosynthetic pathways. Specifically, bacterial FAALs were found to be predominantly associated with type I PKS, NRPS or NPRS-PKS hybrid biosynthetic pathways. The phylogenetic placement of FAALs was not correlated to the chain length of the fatty acids that they activate and load. Therefore, we developed a deep learning-based prediction algorithm (FAALPred) to forecast the chain length of the fatty acid substrate of a given FAAL sequence. The robustness and accuracy of the predictions generated by FAALPred were validated using independent in vitro and in silico data. We anticipate that FAALPred will not only accelerate secondary metabolite structural predictions and subsequent discovery from FAAL-associated pathways, but also facilitate metabolic engineering of lipoylation.

Files

scripts_material_methods.zip

Files (449.2 MB)

Name Size Download all
md5:8c22738dffdfdf23fe849d257066137c
121.5 kB Download
md5:d0fd9260b15d908752435faa0db62b42
16.6 kB Preview Download
md5:8d065bd1a9fb7bad20d71b2d41bd115d
12.1 MB Download
md5:69e964739ab2984f8e4465ef599a67d9
53.7 kB Download
md5:36275d81c6fa73e45eedd28d680d1c62
36.7 MB Download
md5:b455c6c2137bf5175f08a653ec360ee6
382.9 MB Download
md5:565a362bab8a6fe93bafc932d9e15d91
10.9 MB Download
md5:a12faf0e591a78f1ab6b9df2b1a77db5
6.2 MB Download
md5:0409f5b5443b59d1d03d3a92ec15586c
90.8 kB Download
md5:4b42faf9acd3dec580b7d25a79c1bb66
18.1 kB Download
md5:09072236fd58519867f4b0f31c7b3af8
11.5 kB Download

Additional details

Funding

European Union
952374 952374
European Research Council
ERC-StG FattyCyanos No. 759840
Fundação para a Ciência e Tecnologia
UIDB/04423/2020
Fundação para a Ciência e Tecnologia
UIDP/04423/2020
Fundação para a Ciência e Tecnologia
2020.08183.BD
Fundação para a Ciência e Tecnologia
Inteligência Artificial em Cloud (2ª edição) CPCA - IAC/AV/591374/2023

Dates

Withdrawn
2025-07-04

Software

Programming language
Python