Published November 19, 2024 | Version v1.3.2
Dataset Open

TMVisDB Minimal Database

Description

TMVisDB - Database Release

This is the database release accompanying the paper "TMVisDB: A Resource for Transmembrane Protein Annotations and 3D Visualizations" (2025).

Description

TMVisDB is a comprehensive SQLite database containing annotations for over 46 million predicted transmembrane proteins. The database combines AlphaFold DB structure predictions with transmembrane topology predictions from the protein language model (pLM) based method TMbed.

This release contains the core database structure and annotations, with sequence information and database indices removed due to size constraints. The full database with sequences and indices is available through the web interface at https://tmvisdb.rostlab.org.

Database Contents:

 

TMbed prediction

 

Taxonomy (Domain)

       

Transmembrane type

Signal Peptide

All

Bacteria

Eukaryota

Archaea

Other

Alpha-helix (TMH)

no

39,391,449

28,020,596

9,944,469

991,477

434,907

 

yes

4,475,707

2,977,947

1,306,796

152,801

38,163

Beta-stranded barrels (TMB)

no

323,017

269,126

43,215

1,401

9,275

 

yes

1,814,523

1,786,881

7,959

1,608

18,075

Both

no

20,918

16,937

3,584

90

307

 

yes

22,836

21,731

771

52

282

Total

46,048,450

33,093,218

11,306,794

1,147,429

501,009

 

Schema

The database consists of the following main tables:

  • organism: Contains taxonomic information
  • sequence: Links proteins to their organisms and UniProt identifiers
  • annotation: Stores transmembrane topology predictions
  • tminfo: Contains transmembrane statistics and classifications

A diagram of the database scheme can be found here

Note on Removed Data

To make the database distribution feasible, we removed:

  1. All sequence information from the sequence table
  2. All database indices (which can be recreated using the schema)

Users requiring sequence information should:

  1. Reference the UniProt accessions provided in the database
  2. Access the full database through our web interface
  3. Or rebuild indices locally if needed using the schema provided in our GitHub repository

Associated Resources

Files

Files (34.9 GB)

Name Size Download all
md5:4aaba06e05cade1f87c159a7bce1bc5b
34.9 GB Download

Additional details

Related works

Describes
Preprint: 10.1101/2024.11.22.624323 (DOI)
Is compiled by
Software: https://tmvisdb.rostlab.org (URL)
Is required by
Software: https://github.com/t03i/TMVisDB (URL)
Is source of
Publication: 10.1186/s12859-022-04873-x (DOI)

Software

Repository URL
https://github.com/t03i/TMVisDB
Programming language
Python , TypeScript , Svelte , SQL
Development Status
Active