Published November 9, 2021 | Version v1
Dataset Open

Supplementary dataset for Correlational networking guides the discovery of cryptic natural product biosynthetic enzymes

  • 1. University of South Carolina
  • 2. The University of Hong Kong
  • 3. Lanzhou University
  • 4. Shenzhen Bay Laboratory

Description

Supplementary files for the paper: Correlational networking guides the discovery of cryptic natural product biosynthetic enzymes

23777967_protease.fasta.xz:
    A fasta file (compressed by xz) containing 23777967 protease sequences obtained from 161954 bacterial genomes
23777967_protease_cluster.csv.xz:
    A csv file (compressed by xz) containing MMseqs2 cluster information of 23777967 proteases
    This csv file has 5 columns: rep (representative sequence name), mem (member sequence name), number of members in cluster, cluster No.
Fig1C_cytoscape.zip:
    Cytoscape file corresponding to Fig.1C, as well as its node and edge tables
Fig2C_cytoscape.zip:
    Cytoscape file corresponding to Fig.2C, as well as its node and edge tables
FigS2_cytoscape.zip:
    Cytoscape file corresponding to Supplementary Fig.2, as well as its node and edge tables. Size was differently scaled for very large nodes containing more than 1000 precursors/proteases

Files

Fig1C_cytoscape.zip

Files (1.3 GB)

Name Size Download all
md5:d0e51285b63bd8e4f9d9d80f52910afe
850.0 MB Download
md5:82904ae6c0d0668c3b54e24a049728f6
391.0 MB Download
md5:7199389f54e21fb0250aff3d93fe5681
125.7 kB Preview Download
md5:183992d64a6067d4a8b1b941d23e73b1
121.5 kB Preview Download
md5:dab2b0474f3f2a751f201654f1d0ed3e
201.4 kB Preview Download
md5:675964b110cbc59f4b1ce19c05db532b
19.9 MB Download

Additional details

Related works

Is published in
Journal article: 10.1101/2021.07.26.453782 (DOI)