Sample Graph Datasets in CSV Format
Authors/Creators
- 1. Scientific Software Center (Universität Heidelberg)
Description
Sample Graph Datasets in CSV Format
Note: none of the data sets published here contain actual data, they are for testing purposes only.
Description
This data repository contains graph datasets, where each graph is represented by two CSV files: one for node information and another for edge details. To link the files to the same graph, their names include a common identifier based on the number of nodes. For example:
dataset_30_nodes_interactions.csv:contains 30 rows (nodes).dataset_30_edges_interactions.csv: contains 47 rows (edges).- the common identifier
dataset_30refers to the same graph.
CSV nodes
Each dataset contains the following columns:
| Name of the Column | Type | Description |
| UniProt ID | string | protein identification |
| label | string | protein label (type of node) |
| properties | string | a dictionary containing properties related to the protein. |
CSV edges
Each dataset contains the following columns:
| Name of the Column | Type | Description |
| Relationship ID | string | relationship identification |
| Source ID | string | identification of the source protein in the relationship |
| Target ID | string | identification of the target protein in the relationship |
| label | string | relationship label (type of relationship) |
| properties | string | a dictionary containing properties related to the relationship. |
Metadata
| Graph | Number of Nodes | Number of Edges | Sparse graph |
|
dataset_30* |
30 |
47 |
Y |
|
dataset_60* |
60 |
181 |
Y |
|
dataset_120* |
120 |
689 |
Y |
|
dataset_240* |
240 |
2819 |
Y |
|
dataset_300* |
300 |
4658 |
Y |
|
dataset_600* |
600 |
18004 |
Y |
|
dataset_1200* |
1200 |
71785 |
Y |
|
dataset_2400* |
2400 |
288600 |
Y |
|
dataset_3000* |
3000 |
449727 |
Y |
|
dataset_6000* |
6000 |
1799413 |
Y |
|
dataset_12000* |
12000 |
7199863 |
Y |
|
dataset_24000* |
24000 |
28792361 |
Y |
This repository include two (2) additional tiny graph datasets to experiment before dealing with larger datasets.
CSV nodes (tiny graphs)
Each dataset contains the following columns:
| Name of the Column | Type | Description |
| ID | string | node identification |
| label | string | node label (type of node) |
| properties | string | a dictionary containing properties related to the node. |
CSV edges (tiny graphs)
Each dataset contains the following columns:
| Name of the Column | Type | Description |
| ID | string | relationship identification |
| source | string | identification of the source node in the relationship |
| target | string | identification of the target node in the relationship |
| label | string | relationship label (type of relationship) |
| properties | string | a dictionary containing properties related to the relationship. |
Metadata (tiny graphs)
| Graph | Number of Nodes | Number of Edges | Sparse graph |
| dataset_dummy* | 3 | 6 | N |
| dataset_dummy2* | 3 | 6 | N |
Files
dataset_12000_edges_interactions.csv
Files
(2.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:706ffcd42ffe7f2e92f8075ade2f1091
|
456.2 MB | Preview Download |
|
md5:a4893164eb2a1933e2b85eec811b62a9
|
1.6 MB | Preview Download |
|
md5:8eb744c4a97ef08a30244c4cab675a19
|
4.5 MB | Preview Download |
|
md5:e50ed1280a4aa8c765fa573c15186b20
|
160.8 kB | Preview Download |
|
md5:0ad8c31de37a58127ec9964e869e6096
|
43.3 kB | Preview Download |
|
md5:257c9ff15ca74c531eda044c46cb189d
|
15.9 kB | Preview Download |
|
md5:717e5e0df7c13dafc5a5f22d7c1c4d1b
|
1.8 GB | Preview Download |
|
md5:78f13907df1769f714a04bda204c715c
|
3.2 MB | Preview Download |
|
md5:e66db84272a189b8c45c28e4747e8dc9
|
18.3 MB | Preview Download |
|
md5:670f88764ce6563ffecb874adc8e04e1
|
321.8 kB | Preview Download |
|
md5:7743eab46ec905f2937baae71f280365
|
178.7 kB | Preview Download |
|
md5:029fc65525f5ce6fb1f4cabf5660a1a4
|
32.4 kB | Preview Download |
|
md5:309cc520344ff73afb645fb5f0afc272
|
28.5 MB | Preview Download |
|
md5:3758d7ee4ffd9d4113d0a6d668f30eb3
|
402.3 kB | Preview Download |
|
md5:0d10383dc432b5bc4802b861c7256066
|
295.7 kB | Preview Download |
|
md5:c0ba0a15d6dbafb6788f6fcf3a832b21
|
40.3 kB | Preview Download |
|
md5:f74aac116d34ed40f70b648427d95f51
|
3.1 kB | Preview Download |
|
md5:82115cbeba416de30997043eb6624b03
|
4.1 kB | Preview Download |
|
md5:de64176d2a90d5cf5be1d7382ae9f94b
|
114.0 MB | Preview Download |
|
md5:8c69308d88db93898fcfd2299e3f803d
|
804.5 kB | Preview Download |
|
md5:1c9c40d72fc38299f6d797a12a551bd3
|
1.1 MB | Preview Download |
|
md5:4592389da0845a7e52a78a43b0c271ac
|
80.1 kB | Preview Download |
|
md5:8c6a317afd474eaba6daad19da9f5e99
|
12.2 kB | Preview Download |
|
md5:226a21a3464bc61b9aabc1b1638b54fb
|
8.0 kB | Preview Download |
|
md5:cd84d459d8baf04ceb5b91fad424cce0
|
387 Bytes | Preview Download |
|
md5:4172116787aed5e444a697e21b28183c
|
323 Bytes | Preview Download |
|
md5:ebb439ea907e01f8cf3d7311deb0dc31
|
362 Bytes | Preview Download |
|
md5:4bfe250abe4d28602475596e67788b1a
|
117 Bytes | Preview Download |