Published January 3, 2025 | Version v1
Conference paper Open

Exploring the Role of Generative AI in Constructing Knowledge Graphs for Drug Indications with Medical Context

  • 1. ROR icon University of Liverpool
  • 2. ROR icon Università di Camerino
  • 3. ROR icon Vienna University of Economics and Business
  • 4. ROR icon University of Stavanger
  • 5. ROR icon University of Padua
  • 6. ROR icon Linköping University
  • 7. ROR icon Maastricht University

Description

The medical context for a drug indication provides crucial information on how the drug can be used in practice. However, the extraction of medical context from drug indications remains poorly explored, as most research concentrates on the recognition of medications and associated diseases. Indeed, most databases cataloging drug indications do not contain their medical context in a machine-readable format. This paper proposes the use of a large language model for constructing DIAMOND-KG, a knowledge graph of drug indications and their medical context. The study 1) examines the change in accuracy and precision in providing additional instruction to the language model, 2) estimates the prevalence of medical context in drug indications, and 3) assesses the quality of DIAMOND-KG against NeuroDKG, a small manually curated knowledge graph. The results reveal that more elaborated prompts improve the quality of extraction of medical context; 71% of indications had at least one medical context; 63.52% of extracted medical contexts correspond to those identified in NeuroDKG. This paper demonstrates the utility of using large language models for specialized knowledge extraction, with a particular focus on extracting drug indications and their medical context. We provide DIAMOND-KG as a FAIR RDF graph supported with an ontology. Openly accessible, DIAMOND-KG may be useful for downstream tasks such as semantic query answering, recommendation engines, and drug repositioning research.

Files

SWAT-KG.pdf

Files (298.5 kB)

Name Size Download all
md5:a3dbc1217b8e5e6fc8caac3ae9899ab7
298.5 kB Preview Download

Additional details

Related works

Is part of
Conference proceeding: urn:nbn:de:0074-3890-8 (URN)

Funding

European Commission
KnowGraphs - Knowledge Graphs at Scale 860801
European Commission
ODECO - Towards a sustainable Open Data ECOsystem 955569
European Commission
HEREDITARY - HetERogeneous sEmantic Data integratIon for the guT-bRain interplaY 101137074

References

  • S. I. Avram, et al., Drugcentral 2021 supports drug discovery and repositioning, Nucleic Acids Research 49 (2020) D1160 – D1169.
  • S. J. Nelson, et al., Formalizing drug indications on the road to therapeutic intent, JAMIA 24 (2017) 1169–1172.
  • S. Marchesin, et al., Building a large gene expression-cancer knowledge base with limited human annotations, Database J. Biol. Databases Curation 2023 (2023).
  • M. Trajanoska, et al., Enhancing knowledge graph construction using large language models, 2023. arXiv:2305.04676.
  • J. H. Caufield, et al., Ontogpt, 2023. URL: https://monarch-initiative.github.io/ontogpt/.
  • D. S. Wishart, et al., Drugbank: a comprehensive resource for in silico drug discovery and exploration, Nucleic Acids Research 34 (2005) D668 – D672.
  • A. Névéol, Z. Lu, Automatic integration of drug indications from multiple health resources, in: Proc. of the 1st ACM international health informatics symposium, 2010, pp. 666–673.
  • W. Q. Wei, et al., Development and evaluation of an ensemble resource linking medications to their indications, JAMIA 20 (2013) 954 – 961.
  • C. Peng, et al., Clinical concept and relation extraction using prompt-based machine reading comprehension, JAMIA 30 (2023) 1486–1493.
  • H. Hu, et al., A generative drug–drug interaction triplets extraction framework based on large language models, Proc. of the Association for Information Science and Technology 60 (2023) 980–982.
  • R. Khare, et al., Labeledin: Cataloging labeled indications for human drugs, Journal of biomedical informatics 52 (2014) 448–56.
  • K. Moodley, et al., InContext: curation of medical context for drug indications, Journal of Biomedical Semantics 12 (2021) 2.
  • Hypothesis, Hypothesis.is – Open Annotation Tool, 2023. URL: https://web.hypothes.is, accessed: 2023-06-13.
  • BioPortal, Bioportal annotator, 2023. URL: https://bioportal.bioontology.org/annotator, accessed: 2023-06-13.
  • J. Yang, et al., Publishing Medical Context of Neurological Drug Indications as a Knoweldge Graph, Technical Report, Institute of Data Science, Maastrich University, Maastricht, the Netherlands, 2021. URL: https://github.com/MaastrichtU-IDS/neuro_dkg/blob/master/ publication.pdf.
  • M. D. Wilkinson, et al., The fair guiding principles for scientific data management and stewardship, Scientific data 3 (2016) 1–9.