Published August 12, 2023 | Version v1
Dataset Open

MS-BioGraphs MSA500

Description

https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs/

Name MS-BioGraphs – MSA500
URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA500
Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints
Edge Weighted Yes
Directed Yes
Number of Vertices 1,757,323,526
Number of Edges 1,244,904,754,157
Maximum In-Degree 229,442
Maximum Out-Degree 814,461
Minimum Weight 98
Maximum Weight 634,925
Number of Zero In-Degree Vertices 6,437,984
Number of Zero Out-Degree Vertices 16,843,087
Average In-Degree 711.0
Average Out-Degree 715.3
Size of The Largest Weakly Connected Component 1,244,203,865,823
Number of Weakly Connected Components 148,861,367
Creation Details MS-BioGraphs: Sequency Similarity Graph Datasets
Format WebGraph
License CC BY-NC-SA
QUB IDF 2223-052
DOI 10.5281/zenodo.7820810
Citation
Koohi Esfahani, Mohsen, Boldi, Paolo, 
Vandierendonck, Hans, Kilpatrick, Peter, 
Vigna,Sebastiano. (2023). 
MS-BioGraphs - MSA500. 
https://doi.org/10.5281/zenodo.7820810. 
http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA500.
Bibtex
@misc{MS-BioGraphs-MSA500,
year = {2023},
author = {Mohsen Koohi Esfahani and Paolo Boldi and 
Hans Vandierendonck and Peter Kilpatrick and 
Sebastiano Vigna},
title = {{MS-BioGraphs - MSA500}},
doi = {10.5281/zenodo.7820810},
url = {http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA500},
howpublished= {\url{http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA500}}}

Files

Files (902 Bytes)

Name Size Download all
md5:9f68584b383f6b53446abe96e35234e2
902 Bytes Download