Exploring the Role of Generative AI in Constructing Knowledge Graphs for Drug Indications with Medical Context
Creators
Description
The medical context for a drug indication provides crucial information on how the drug can be used in practice. However, the extraction of medical context from drug indications remains poorly explored, as most research concentrates on the recognition of medications and associated diseases. Indeed, most databases cataloging drug indications do not contain their medical context in a machine-readable format. This paper proposes the use of a large language model for constructing DIAMOND-KG, a knowledge graph of drug indications and their medical context. The study 1) examines the change in accuracy and precision in providing additional instruction to the language model, 2) estimates the prevalence of medical context in drug indications, and 3) assesses the quality of DIAMOND-KG against NeuroDKG, a small manually curated knowledge graph. The results reveal that more elaborated prompts improve the quality of extraction of medical context; 71% of indications had at least one medical context; 63.52% of extracted medical contexts correspond to those identified in NeuroDKG. This paper demonstrates the utility of using large language models for specialized knowledge extraction, with a particular focus on extracting drug indications and their medical context. We provide DIAMOND-KG as a FAIR RDF graph supported with an ontology. Openly accessible, DIAMOND-KG may be useful for downstream tasks such as semantic query answering, recommendation engines, and drug repositioning research.
Files
SWAT-KG.pdf
Files
(298.5 kB)
Name | Size | Download all |
---|---|---|
md5:a3dbc1217b8e5e6fc8caac3ae9899ab7
|
298.5 kB | Preview Download |
Additional details
Identifiers
Related works
- Is part of
- Conference proceeding: urn:nbn:de:0074-3890-8 (URN)
Funding
References
- S. I. Avram, et al., Drugcentral 2021 supports drug discovery and repositioning, Nucleic Acids Research 49 (2020) D1160 – D1169.
- S. J. Nelson, et al., Formalizing drug indications on the road to therapeutic intent, JAMIA 24 (2017) 1169–1172.
- S. Marchesin, et al., Building a large gene expression-cancer knowledge base with limited human annotations, Database J. Biol. Databases Curation 2023 (2023).
- M. Trajanoska, et al., Enhancing knowledge graph construction using large language models, 2023. arXiv:2305.04676.
- J. H. Caufield, et al., Ontogpt, 2023. URL: https://monarch-initiative.github.io/ontogpt/.
- D. S. Wishart, et al., Drugbank: a comprehensive resource for in silico drug discovery and exploration, Nucleic Acids Research 34 (2005) D668 – D672.
- A. Névéol, Z. Lu, Automatic integration of drug indications from multiple health resources, in: Proc. of the 1st ACM international health informatics symposium, 2010, pp. 666–673.
- W. Q. Wei, et al., Development and evaluation of an ensemble resource linking medications to their indications, JAMIA 20 (2013) 954 – 961.
- C. Peng, et al., Clinical concept and relation extraction using prompt-based machine reading comprehension, JAMIA 30 (2023) 1486–1493.
- H. Hu, et al., A generative drug–drug interaction triplets extraction framework based on large language models, Proc. of the Association for Information Science and Technology 60 (2023) 980–982.
- R. Khare, et al., Labeledin: Cataloging labeled indications for human drugs, Journal of biomedical informatics 52 (2014) 448–56.
- K. Moodley, et al., InContext: curation of medical context for drug indications, Journal of Biomedical Semantics 12 (2021) 2.
- Hypothesis, Hypothesis.is – Open Annotation Tool, 2023. URL: https://web.hypothes.is, accessed: 2023-06-13.
- BioPortal, Bioportal annotator, 2023. URL: https://bioportal.bioontology.org/annotator, accessed: 2023-06-13.
- J. Yang, et al., Publishing Medical Context of Neurological Drug Indications as a Knoweldge Graph, Technical Report, Institute of Data Science, Maastrich University, Maastricht, the Netherlands, 2021. URL: https://github.com/MaastrichtU-IDS/neuro_dkg/blob/master/ publication.pdf.
- M. D. Wilkinson, et al., The fair guiding principles for scientific data management and stewardship, Scientific data 3 (2016) 1–9.