Published October 17, 2025 | Version v1
Dataset Open

Fornorsking og fjernlesing

  • 1. EDMO icon University of Oslo
  • 2. National library of Norway

Description

Dataset underpinning the article “Fornorsking og fjernlesing. Annengjøring av samisk og kvensk kultur og identitet i norsk prosa og drama (1850–1950)”.

Contains:
Korpus_same_kven.xlsx (list of 40 works, metadata, NB links)
all_etnonyms_after_llm_luminon.xlsx (concordances and GPT-4 results)
Analyze_data_from_luminons.html (semantic groupings, chi-square analysis)
make_concordances_run_LLM_lapp.html (example of concordance extraction and LLM run)
README.md (metadata and citation)

Built using the NB DH-Lab API and analyzed with OpenAI GPT-4 for classification into four categories: annengjøring, innlemming, nøytral, and uklart.
The dataset documents corpus, concordances, and quantitative results supporting the published study.

Files

Readme (2).md

Files (1.6 MB)

Name Size Download all
md5:fe84f409981f7be8199342f88d652cda
211.7 kB Download
md5:9954df9128df8b8aeaaaad6b38907436
482.8 kB Download
md5:50b03e8ced5ac311d33d1aa96de4a428
11.9 kB Download
md5:e99c8bfe2d1dfd6c1228e2f174f8211d
875.4 kB Download
md5:f474e4d61e106cfac9cf01840b534e02
1.8 kB Preview Download