RISE Crash Course: "Information Extraction from Images with AI"

Hindermann, Maximilian

doi:10.5281/zenodo.17416426

Published October 22, 2025 | Version v3

Presentation Open

RISE Crash Course: "Information Extraction from Images with AI"

Hindermann, Maximilian (Contact person)¹

1. University of Basel

Slides for RISE Crash Course: "Information Extraction from Images with AI"

In this two-hour course, you will learn how multimodal large language models, such as ChatGPT-4o, Gemini 1.5, or Claude Sonnet 3.5, can be used to extract structured information directly from images. This approach eliminates the often necessary intermediate step of text recognition and transcription that is common in traditional methods (such as Transkribus).

Using concrete examples from ongoing research projects, the course will demonstrate the practical possibilities and limitations of this technology. It will also address the technical and methodological prerequisites required for successful implementation. Additionally, aspects of data quality, the FAIRness (Findability, Accessibility, Interoperability, Reusability) of the extracted data, as well as the associated costs, will be considered and reflected upon.

Files

RISE Crash Course Information Extraction from Images with AI.pdf.pdf

Files (2.4 MB)

Name	Size	Download all
RISE Crash Course Information Extraction from Images with AI.pdf.pdf md5:bad0501a13c895dce467cdbefdfa1674	2.4 MB	Preview Download

	All versions	This version
Views	280	62
Downloads	250	26
Data volume	796.7 MB	71.5 MB

RISE Crash Course: "Information Extraction from Images with AI"

Authors/Creators

Description

Files

RISE Crash Course Information Extraction from Images with AI.pdf.pdf

Files (2.4 MB)