Published August 12, 2023 | Version v1
Dataset Open

MS-BioGraphs MSA200

Description

https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs/

Name MS-BioGraphs – MSA200
URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA200
Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints
Edge Weighted Yes
Directed Yes
Number of Vertices 1,757,323,526
Number of Edges 500,444,322,597
Maximum In-Degree 658,879
Maximum Out-Degree 709,176
Minimum Weight 98
Maximum Weight 634,925
Number of Zero In-Degree Vertices 6,437,984
Number of Zero Out-Degree Vertices 7,471,315
Average In-Degree 285.8
Average Out-Degree 286.0
Size of The Largest Weakly Connected Component 496,880,685,957
Number of Weakly Connected Components 221,467,156
Creation Details MS-BioGraphs: Sequency Similarity Graph Datasets
Format WebGraph
License CC BY-NC-SA
QUB IDF 2223-052
DOI 10.5281/zenodo.7820815
Citation
Koohi Esfahani, Mohsen, Boldi, Paolo, 
Vandierendonck, Hans, Kilpatrick, Peter, 
Vigna,Sebastiano. (2023). 
MS-BioGraphs - MSA200. 
https://doi.org/10.5281/zenodo.7820815. 
http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA200.
Bibtex
@misc{MS-BioGraphs-MSA200,
year = {2023},
author = {Mohsen Koohi Esfahani and Paolo Boldi and 
Hans Vandierendonck and Peter Kilpatrick and 
Sebastiano Vigna},
title = {{MS-BioGraphs - MSA200}},
doi = {10.5281/zenodo.7820815},
url = {http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA200},
howpublished= {\url{http://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA200}}}

Files

Files (897 Bytes)

Name Size Download all
md5:fbafb335283871ff8fee87f6c870609e
897 Bytes Download