Diversity of FAAL enzymes and prediction of their substrate specificity using FAALPred
Authors/Creators
Description
FAALs (Fatty Acyl-AMP Ligases) recruit and incorporate fatty acids during the biosynthesis of secondary metabolites. Their diversity, distribution and substrate specificity remain poorly understood, which limits functional predictions from sequence data. In this study, we explored the prevalence and diversity of FAAL enzymes across the tree of life and show that FAALs are widely distributed among bacteria, with distinct clades associated with specific taxonomic groups and/or biosynthetic pathways. Specifically, bacterial FAALs were found to be predominantly associated with type I PKS, NRPS or NPRS-PKS hybrid biosynthetic pathways. The phylogenetic placement of FAALs was not correlated to the chain length of the fatty acids that they activate and load. Therefore, we developed a deep learning-based prediction algorithm (FAALPred) to forecast the chain length of the fatty acid substrate of a given FAAL sequence. The robustness and accuracy of the predictions generated by FAALPred were validated using independent in vitro and in silico data. We anticipate that FAALPred will not only accelerate secondary metabolite structural predictions and subsequent discovery from FAAL-associated pathways, but also facilitate metabolic engineering of lipoylation.
Files
scripts_material_methods.zip
Files
(449.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:8c22738dffdfdf23fe849d257066137c
|
121.5 kB | Download |
|
md5:d0fd9260b15d908752435faa0db62b42
|
16.6 kB | Preview Download |
|
md5:8d065bd1a9fb7bad20d71b2d41bd115d
|
12.1 MB | Download |
|
md5:69e964739ab2984f8e4465ef599a67d9
|
53.7 kB | Download |
|
md5:36275d81c6fa73e45eedd28d680d1c62
|
36.7 MB | Download |
|
md5:b455c6c2137bf5175f08a653ec360ee6
|
382.9 MB | Download |
|
md5:565a362bab8a6fe93bafc932d9e15d91
|
10.9 MB | Download |
|
md5:a12faf0e591a78f1ab6b9df2b1a77db5
|
6.2 MB | Download |
|
md5:0409f5b5443b59d1d03d3a92ec15586c
|
90.8 kB | Download |
|
md5:4b42faf9acd3dec580b7d25a79c1bb66
|
18.1 kB | Download |
|
md5:09072236fd58519867f4b0f31c7b3af8
|
11.5 kB | Download |
Additional details
Funding
- European Union
- 952374 952374
- European Research Council
- ERC-StG FattyCyanos No. 759840
- Fundação para a Ciência e Tecnologia
- UIDB/04423/2020
- Fundação para a Ciência e Tecnologia
- UIDP/04423/2020
- Fundação para a Ciência e Tecnologia
- 2020.08183.BD
- Fundação para a Ciência e Tecnologia
- Inteligência Artificial em Cloud (2ª edição) CPCA - IAC/AV/591374/2023
Dates
- Withdrawn
-
2025-07-04
Software
- Programming language
- Python