Published June 12, 2025 | Version v1
Conference paper Open

Towards Generating Synthetic EHR Knowledge Graphs – a Probabilistic Approach

  • 1. Institute of Logic and Computation, TU Wien, Austria
  • 2. Faculty of Computer Science and Engineering Ss. Cyril and Methodius University in Skopje, N. Macedonia
  • 3. Aalborg University

Description

Advances in medical AI and data analytics require large amounts of patient data. Due to privacy concerns, such data is not always available. Synthetic data generation promises a solution to provide the required data despite privacy restrictions. In this paper, we therefore introduce SynMedRDF, an open-source tool to generate synthetic Electronic Health Records. It ensures clinical accuracy by using real-world probabilities and correlations. The data is output as an RDF knowledge graph, enabling structure- and semantics-aware sharing, linking, and analysis.

Files

GOBLIN25_Camera-Ready_23.pdf

Files (285.8 kB)

Name Size Download all
md5:45959acb1757f1a24d4d141a4a38ac43
285.8 kB Preview Download

Additional details

Funding

European Cooperation in Science and Technology
GOBLIN: Global Network on Large-Scale, Cross-domain and Multilingual Open Knowledge Graphs CA23147