Published April 2, 2025 | Version v1
Presentation Open

Bridging the Gap between Private Data and AI: A Beginner's Guide to Retrieval Augmented Generation (RAG)

  • 1. ROR icon Forschungszentrum Jülich

Description

In April 2025, Fritz Niesel, Mascha Schmidt and Victor Haguet presented about RAG and their work. 

Abstract: AI is transforming administrative and scientific workflows by automating tedious cognitive tasks. However, one of our most valuable resources - private and sensitive documents - often remains inaccessible to AI solutions. Retrieval Augmented Generation (RAG) offers a way to bridge this gap. It enables AI models to efficiently process and search large private data sources while maintaining control over data access and security.

In this talk, we’ll guide you through the process of building a RAG system, from understanding its core components to integrating it into real-world applications. We’ll begin by introducing RAG and demonstrating how it enhances knowledge retrieval where standard LLMs fall short. Then, we’ll break down key concepts such as chunking, embedding, vector storage and retrieval. Finally, we’ll explore how to tailor RAG for advanced use cases and integrate it into data-driven applications.

Whether you’re a developer, researcher, or AI enthusiast, this session will provide a clear roadmap for using RAG effectively. Join us to learn when to use RAG, what to expect, and how to take your LLM applications to the next level.

Link to the notebook: https://codebase.helmholtz.cloud/sdlaml-public-ressources/RAG_tutorial_langchain_blablador/

The presentation recording is available on the HiRSE YouTube Channel: https://www.youtube.com/watch?v=L-3ypGheljs&t=2136s 

Learn more about the HiRSE Seminar Series: https://www.helmholtz-hirse.de/series.html

Files

RAG_A_Beginners_Guide.pdf

Files (2.7 MB)

Name Size Download all
md5:96559970a343b6bb2037d7a6fca37b69
2.7 MB Preview Download