Published March 24, 2026
| Version v0.5.0
Software
Open
RISE-UNIBAS/humanities_data_benchmark
Description
This repository contains benchmark datasets (images and text), prompts, ground truths, and evaluation scripts for assessing the performance of large language models (LLMs) on humanities-related tasks. The suite is designed as a resource for researchers and practitioners interested in systematically evaluating how well various LLMs perform on digital humanities (DH) tasks involving visual and text-like materials. For detailed test results and model comparisons, visit our results dashboard at https://rise-services.rise.unibas.ch/benchmarks/.
Notes
Files
RISE-UNIBAS/humanities_data_benchmark-v0.5.0.zip
Files
(474.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:f8aafb8bef18cf27645e483bf8724265
|
474.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/RISE-UNIBAS/humanities_data_benchmark/tree/v0.5.0 (URL)
Software
- Repository URL
- https://github.com/RISE-UNIBAS/humanities_data_benchmark