Published July 11, 2025 | Version v1
Presentation Open

Inference Training with Retrieval Augmented Generation - EXAKI training

Description

EXAKI training number 2.

This webinar, led by METU, introduces the EXA4MIND AI Inference Service, which supports advanced querying capabilities in high-performance computing (HPC) environments.

During the training session, a local implementation of Retrieval-Augmented Generation (RAG) is showcased, demonstrating how it can enhance the precision of large language model (LLM) responses.

The session walks through a complete RAG pipeline—from preprocessing a university knowledge base into manageable chunks, generating vector embeddings using an embedding model, to storing these embeddings in a vector database for semantic similarity search.

The training concludes by demonstrating how to send an API request to the inference service using the enriched prompt.

Files

2 Session Inference Training _ SLIDES.pptx.pdf

Files (1.4 MB)

Name Size Download all
md5:8bdefe9f8bc814ef747436b5bb430fe6
1.4 MB Preview Download

Additional details

Funding

European Commission
EXA4MIND - EXtreme Analytics for MINing Data spaces 101092944

Dates

Accepted
2025-07-11