Published July 7, 2025 | Version v1
Poster Open

RAG-Enhanced LLM Pipeline for Semantic Mapping of Context-based Features to OMOP Vocabulary

  • 1. ROR icon Hasselt University
  • 2. Data Science Institute (DSI), Hasselt University
  • 3. ROR icon Flemish Institute for Technological Research

Description

This work presents a Retrieval-Augmented Generation large language model pipeline that automates the mapping of context-based clinical features to OMOP vocabulary concepts. The system stores OMOP concepts in a vector database, retrieves the most semantically relevant matches based on user input, and uses an LLM to generate context-aware concept suggestions with explanations.

The approach improves mapping accuracy compared to standard tools while enhancing transparency and usability. It supports efficient feature extraction and contributes to safer and more effective evaluation of AI applications in healthcare.

Original abstract included.

Files

A_102_RAG-LLM_Poster_Sariga Kakkamani - sariga k.pdf

Additional details

Additional titles

Alternative title
Accelerating Feature Extraction with AI-Powered RAG-LLM: Automated Concept Mapping to OMOP-CDM Vocabulary.

Funding

European Health and Digital Executive Agency
Real-world-data Enabled Assessment for heaLth regulatory decision-Making (REALM) 101095435