Published November 20, 2023 | Version v1
Dataset Open

Inverton dataset

Creators

Description

Invertible promoters (invertons) are crucial regulatory elements in bacteria, facilitating gene expression changes under stress. Despite their importance, their prevalence and the range of regulated gene functions are largely unknown. We introduced DeepInverton, a deep learning model that identifies invertons across a broad phylogenetic spectrum without using sequencing reads. By analyzing 68,733 bacterial genomes and 9,382 metagenomes, we have uncovered over 200,000 nonredundant invertons, and have also highlighted their abundance in pathogens. Additionally, we identified a post-Cambrian Explosion increase of invertons, paralleling species diversification. Furthermore, we revealed that invertons regulate diverse functions, including antimicrobial resistance and biofilm formation, underscoring their role in environmental adaptation. Notably, the majority of inverton identifications by DeepInverton have been confirmed by the in vitro experiments. The comprehensive inverton profiles have deepened our understanding of invertons at pan-genome and pan-metagenome scales, enabling a broad spectrum of applications in microbial ecology and synthetic biology.

Files

core_inverton_dataset.txt

Files (166.3 MB)

Name Size Download all
md5:4beae9c602189baa028c4f7b501504c2
13.6 MB Preview Download
md5:3914c189fab1098ac74538870c7748af
134.3 MB Preview Download
md5:2615ee4e5a2b8fb24069c960a7060d60
16.5 MB Preview Download
md5:6d156dafe027a4a2575686dd41e9156f
2.0 MB Preview Download