Published August 12, 2023 | Version v1
Dataset Open

MS-BioGraphs MSA50

Description

https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs/

Name MS-BioGraphs – MSA50
URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA50
Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints
Edge Weighted Yes
Directed Yes
Number of Vertices 1,757,323,526
Number of Edges 125,312,536,732
Maximum In-Degree 543,117
Maximum Out-Degree 297,981
Minimum Weight 98
Maximum Weight 634,925
Number of Zero In-Degree Vertices 6,437,984
Number of Zero Out-Degree Vertices 8,542,018
Average In-Degree 71.6
Average Out-Degree 71.7
Size of The Largest Weakly Connected Component 117,980,151,055
Number of Weakly Connected Components 363,090,851
Creation Details MS-BioGraphs: Sequency Similarity Graph Datasets
Format WebGraph
License CC BY-NC-SA
QUB IDF 2223-052
DOI 10.5281/zenodo.7820821
Citation
Koohi Esfahani, Mohsen, Boldi, Paolo, 
Vandierendonck, Hans, Kilpatrick, Peter, 
Vigna,Sebastiano. (2023). 
MS-BioGraphs - MSA50. 
https://doi.org/10.5281/zenodo.7820821. 
http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA50.
Bibtex
@misc{MS-BioGraphs-MSA50,
year = {2023},
author = {Mohsen Koohi Esfahani and Paolo Boldi and 
Hans Vandierendonck and Peter Kilpatrick and 
Sebastiano Vigna},
title = {{MS-BioGraphs - MSA50}},
doi = {10.5281/zenodo.7820821},
url = {http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA50},
howpublished= {\url{http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA50}}}

Files

Files (892 Bytes)

Name Size Download all
md5:ad2cba59365bd813630688e1ca5ec57f
892 Bytes Download