There is a newer version of the record available.

Published December 9, 2024 | Version Version 0.0.2
Dataset Open

Sample Graph Datasets in CSV Format

  • 1. Scientific Software Center (Universität Heidelberg)

Description

Sample Graph Datasets in CSV Format

Note: none of the data sets published here contain actual data, they are for testing purposes only. 

Description

This data repository contains graph datasets, where each graph is represented by two CSV files: one for node information and another for edge details. To link the files to the same graph, their names include a common identifier based on the number of nodes. For example:

  • dataset_30_nodes_interactions.csv:contains 30 rows (nodes).
  • dataset_30_edges_interactions.csv: contains 47 rows (edges).
  • the common identifier dataset_30 refers to the same graph.

CSV nodes

Each dataset contains the following columns:

Name of the Column Type Description
UniProt ID string protein identification
label string protein label (type of node)
properties string a dictionary containing properties related to the protein.

CSV edges

Each dataset contains the following columns:

Name of the Column Type Description
Relationship ID string relationship identification
Source ID string identification of the source protein in the relationship
Target ID string identification of the target protein in the relationship
label string relationship label (type of relationship)
properties string a dictionary containing properties related to the relationship.

Metadata

Graph Number of Nodes Number of Edges Sparse graph

dataset_30*

30

47

Y

dataset_60*

60

181

Y

dataset_120*

120

689

Y

dataset_240*

240

2819

Y

dataset_300*

300

4658

Y

dataset_600*

600

18004

Y

dataset_1200*

1200

71785

Y

dataset_2400*

2400

288600

Y

dataset_3000*

3000

449727

Y

dataset_6000*

6000

1799413

Y

dataset_12000*

12000

7199863

Y

dataset_24000*

24000

28792361

Y

 

This repository include two (2) additional tiny graph datasets to experiment before dealing with larger datasets.

CSV nodes (tiny graphs)

Each dataset contains the following columns:

Name of the Column Type Description
ID string node identification
label string node label (type of node)
properties string a dictionary containing properties related to the node.

CSV edges (tiny graphs)

Each dataset contains the following columns:

Name of the Column Type Description
ID string relationship identification
source string identification of the source node in the relationship
target string identification of the target node in the relationship
label string relationship label (type of relationship)
properties string a dictionary containing properties related to the relationship.

Metadata (tiny graphs)

Graph Number of Nodes Number of Edges Sparse graph
dataset_dummy* 3 6 N
dataset_dummy2* 3 6 N

Files

dataset_12000_edges_interactions.csv

Files (2.5 GB)

Name Size Download all
md5:706ffcd42ffe7f2e92f8075ade2f1091
456.2 MB Preview Download
md5:a4893164eb2a1933e2b85eec811b62a9
1.6 MB Preview Download
md5:8eb744c4a97ef08a30244c4cab675a19
4.5 MB Preview Download
md5:e50ed1280a4aa8c765fa573c15186b20
160.8 kB Preview Download
md5:0ad8c31de37a58127ec9964e869e6096
43.3 kB Preview Download
md5:257c9ff15ca74c531eda044c46cb189d
15.9 kB Preview Download
md5:717e5e0df7c13dafc5a5f22d7c1c4d1b
1.8 GB Preview Download
md5:78f13907df1769f714a04bda204c715c
3.2 MB Preview Download
md5:e66db84272a189b8c45c28e4747e8dc9
18.3 MB Preview Download
md5:670f88764ce6563ffecb874adc8e04e1
321.8 kB Preview Download
md5:7743eab46ec905f2937baae71f280365
178.7 kB Preview Download
md5:029fc65525f5ce6fb1f4cabf5660a1a4
32.4 kB Preview Download
md5:309cc520344ff73afb645fb5f0afc272
28.5 MB Preview Download
md5:3758d7ee4ffd9d4113d0a6d668f30eb3
402.3 kB Preview Download
md5:0d10383dc432b5bc4802b861c7256066
295.7 kB Preview Download
md5:c0ba0a15d6dbafb6788f6fcf3a832b21
40.3 kB Preview Download
md5:f74aac116d34ed40f70b648427d95f51
3.1 kB Preview Download
md5:82115cbeba416de30997043eb6624b03
4.1 kB Preview Download
md5:de64176d2a90d5cf5be1d7382ae9f94b
114.0 MB Preview Download
md5:8c69308d88db93898fcfd2299e3f803d
804.5 kB Preview Download
md5:1c9c40d72fc38299f6d797a12a551bd3
1.1 MB Preview Download
md5:4592389da0845a7e52a78a43b0c271ac
80.1 kB Preview Download
md5:8c6a317afd474eaba6daad19da9f5e99
12.2 kB Preview Download
md5:226a21a3464bc61b9aabc1b1638b54fb
8.0 kB Preview Download
md5:cd84d459d8baf04ceb5b91fad424cce0
387 Bytes Preview Download
md5:4172116787aed5e444a697e21b28183c
323 Bytes Preview Download
md5:ebb439ea907e01f8cf3d7311deb0dc31
362 Bytes Preview Download
md5:4bfe250abe4d28602475596e67788b1a
117 Bytes Preview Download