Norwegian Medical Question Answering Dataset - NorMedQA

Riegler, Michael A.

doi:10.5281/zenodo.15346637

There is a newer version of the record available.

Published May 5, 2025 | Version v5

Dataset Open

Norwegian Medical Question Answering Dataset - NorMedQA

Riegler, Michael A.¹

1. Simula Research Laboratory

Contributors

Contact person:

Riegler, Michael A.¹

1. Simula Research Laboratory

This benchmark dataset consists of 1401 medical question-and-answer pairs primarily in Norwegian (Bokmål and Nynorsk), designed for evaluating Large Language Models (LLMs). The content originates from publicly available sources containing medical exam questions and has undergone cleaning and preprocessing. The dataset is structured in JSON format, with each record containing the source document name, question number (where available), the question text, and the reference answer text and the wrong answers text if the answer was multiple choice. It is suitable for use within evaluation frameworks such as lm-evaluation-harness (Github with config and code example: https://github.com/kelkalot/normedqa)to assess model capabilities in medical knowledge retrieval and reasoning specific to the Norwegian context.

Files

norwegian_medical_qa_v2.json

Files (1.1 MB)

Name	Size	Download all
norwegian_medical_qa_v2.json md5:3bac1a4f73270db2c701c37d8c2315d0	1.1 MB	Preview Download

287

Views

Downloads

Show more details

	All versions	This version
Views	287	15
Downloads	48	6
Data volume	48.7 MB	6.7 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 5, 2025
Modified: May 5, 2025

Norwegian Medical Question Answering Dataset - NorMedQA

Creators

Contributors

Contact person:

Description

Files

norwegian_medical_qa_v2.json

Files (1.1 MB)