There is a newer version of the record available.

Published February 11, 2021 | Version v1.0.0_03SEP2021
Dataset Open

PheKnowLator Human Disease Knowledge Graph Benchmarks -- v1.0.0

Authors/Creators

  • 1. University of Colorado Anschutz Medical Campus

Description

PKT Human Disease Knowledge Graph Benchmark Builds (v1.0.0)

Build Date: September 03, 2019

The KG Benchmark Builds can also be downloaded from Zenodo:
👉 KGs: https://doi.org/10.5281/zenodo.7030200
👉 Embeddings: https://zenodo.org/record/7030189

 

Required Input Documents

  • resource_info.txt
  • class_source_list.txt
  • instance_source_list.txt
  • ontology_source_list.txt

 

Data

Data Download Date: November 30, 2018

Ontologies

Classes

Instances

 

Knowledge Graphs

Knowledge Representation
We worked with a PhD-level biologist to develop a knowledge representation (see the figure below) that modeled mechanisms underlying human disease.

 

To do this, we manually mapped all possible combinations of the following six node types:

  • Humans Diseases
  • Human Phenotypes
  • Human Genes
  • Gene Ontology concepts
  • Reactome Pathways
  • Chemicals

As shown in the figure above, the Basic Formal Ontology and Relation Ontology ontologies were then used to create edges between the node types.

 

As shown in this figure, the following edge-types were created:

 

Knowledge Graph
The knowledge graph represented above was built using the following steps: Merge Ontologies: Merge ontologies using the OWL Tools API
Express New Ontology Concept Annotations: Create new ontology annotations by asserting a relation between the instance and an instance of the ontology class. For example to assert the following relations:

Morphine --> is substance that treats --> Migraine

We would need to create two axioms:

  • isSubstanceThatTreats(Morphine, x1)
  • instanceOf(x1, Migraine)

While the instance of the HP class hemiplegic migraines can be treated as an anonymous node in the knowledge graph, we generate a new international resource identifier for each newly generated instance.

Deductively Close Knowledge Graph: The knowledge graph is deductively closed by using the OWL 2 EL reasoner, ELK via Protégé v5.1.1. ELK is able to classify instances and supports inferences over class hierarchies and object properties. inference over disjointness, intersection, and existential quantification (ontology class hierarchies).

Generate Edge List: The final step before exporting the edge list is to remove any nodes that are not biologically meaningful or would otherwise reduce the performance of machine learning algorithms and the algorithm used to generate embeddings.

 

🚨 AVAILABLE FILES 🚨Available KG benchmark files are zipped and listed below. For additional details on what each file contains, please see the associated Wiki page 👉 here.

Files

PheKnowLator_v1_ClassInstancesOnly_KG.owl.zip

Files (493.1 MB)

Name Size Download all
md5:c7b99411be0fc61eb70a834a11772e0a
79.2 MB Preview Download
md5:76f781e3f7b6cd981675bfa16e397882
935.2 kB Preview Download
md5:6d81e9fcf5dac835bf5045ea96cf8daa
1.9 MB Preview Download
md5:7aa165a1b6c400fba031921246a19a6e
36.1 MB Preview Download
md5:3b2a154c509a35c48c0d1d8100cd0852
94.5 kB Preview Download
md5:19d9909c656508921577e65596b68b9c
14.2 MB Preview Download
md5:51d6e5335507912299d6edf8d7c3b3bb
21.1 MB Preview Download
md5:c7ca419f7ec3c3b2ca0d8939ddd961d3
37.9 MB Preview Download
md5:eeaca2a57d91f5e47b76bd0fbd40d8f4
20.9 MB Preview Download
md5:841d00d9dba86fb811a6eed8f7d4d05f
20.7 MB Preview Download
md5:de5bebc6c2a53c60974f5f1815c7a66d
20.9 MB Preview Download
md5:15d95e44a94eae185bf5c86670cfc96c
14.0 MB Preview Download
md5:876380e9221cc679f0b5f86b2c2cf26d
37.5 MB Preview Download
md5:a7a52c8633403360af3c7555adc702d4
1.9 MB Preview Download
md5:3961282608e3e493d94f75c386430812
82.8 MB Preview Download
md5:e4e686cffd0e372c12a1dc657be7a911
82.8 MB Preview Download
md5:056dca446ad480746af903bb462e2a3a
20.0 MB Preview Download

Additional details

Related works

Is referenced by
Dataset: https://zenodo.org/record/8173020 (URL)