Conference paper Open Access

Software demonstration: meaning-based querying of historical corpora with MacBERTh

Fonteyn, Lauren; Manjavacas, Enrique

This is an abstract for a software demonstration at DHBenelux 2022.

This software demonstration will focus on MacBERTh, a BERT-based model pre-trained on Early Modern and Late Modern English (3.9B (tokenized) words, time span: 1450-1950; Manjavacas & Fonteyn 2021, 2022). We will demonstrate how MacBERTh may help researchers (i) access and (ii) analyse the semantic information encoded in linguistic corpus data in a (semi-)automatic way.

Files (184.7 kB)
Name Size
Fonteyn_Manjavacas_DHBENE_2022.pdf
md5:bfe59edbf4643c72a0f6e968d8ebb519
184.7 kB Download
49
42
views
downloads
All versions This version
Views 4949
Downloads 4242
Data volume 7.8 MB7.8 MB
Unique views 4444
Unique downloads 3737

Share

Cite as