Published November 10, 2023 | Version v3
Dataset Open

The OREGANO knowledge graph for computational drug repurposing

Description

The files here are data files from the OREGANO project, which consists of building a holistic knowledge graph on drugs, including natural compounds. Here is the list of files:

 

- OREGANO_V2.tsv : The triplet file used for link prediction. 3 columns : Subjet ; Predicate ; Object

- oreganov2.1_metadata_complet.ttl : The OREGANO knowledge graph in turtle format with the names and cross-references of the various integrated entities.

 

The following files contain the cross-references of OREGANO entities according to their type. They are all organised as follows: the external sources are the titles of the columns and each line begins with the identifier of the entity in OREGANO :

- TARGET.tsv: Cross-reference table of the 22,096 targets.
- PHENOTYPES.tsv: Cross-reference table of the 11,605 phenotypes.
- DISEASES.tsv:  Cross-reference table of the 18,333 diseases.
- PATHWAYS.tsv: Cross-reference table of the 2,129 pathways.
- GENES.tsv: Cross-reference table of the 35,794 genes.
- COMPOUND.tsv:  Cross-reference table of the 90,868 compounds.
- INDICATIONS.tsv: Cross-reference table of the 2,714 indications.
- SIDE_EFFECT.tsv: Cross-reference table of the 6,060 side-effects.
- ACTIVITY.tsv: Names of the 78 activities.
- EFFECT.tsv: Names of the 171 effects.

The OREGANO knowledge graph is composed of 11 types of nodes and 19 types of links. The current version of the graph contains 88,937 nodes and 824,231 links.

A SPARQL endpoint has been provided to enable users to retrieve and explore the knowledge graph at OREGANO SPARQL endpoint .

 

The integration files and the knowledge graph are available on the GitHub of the OREGANO project in the Integration folder: Gitub repository .

 

Files

Files (186.6 MB)

Name Size Download all
md5:d1faec938bd32ec9bb45ae5bba14d7a2
4.0 kB Download
md5:d86277d0e849c3774acb3c6be1878b44
8.6 MB Download
md5:747a48c9786fd50912bb8d7f33d630df
998.7 kB Download
md5:9b1656a45df7d0369dc2a8e5f52b5c2b
7.1 kB Download
md5:8fe9a917b23ab715a268feda50b4a8d7
4.0 MB Download
md5:eae2cbaebc64b1387cce1755a386d1cb
120.7 kB Download
md5:3e534cadf7717c705cbd5e6dd8c392a6
34.7 MB Download
md5:b087300f5e597d263a03c02eaae53bc7
114.3 MB Download
md5:d3f0abefe355c4b9cd1eea4cc8d58021
55.5 kB Download
md5:cada3e9316ba4ec654dd2b9ec2afa23c
713.1 kB Download
md5:8388b5a37db250128e50571d42d96cbc
277.2 kB Download
md5:46e189c556b2b010952d4212b8779cef
22.9 MB Download

Additional details

Related works

Is identical to
Dataset: 10.6084/m9.figshare.23553114.v3 (DOI)

Dates

Available
2023-10-16