Published May 3, 2022 | Version v1
Conference paper Open

Software demonstration: meaning-based querying of historical corpora with MacBERTh

  • 1. Leiden University

Description

This is an abstract for a software demonstration at DHBenelux 2022.

This software demonstration will focus on MacBERTh, a BERT-based model pre-trained on Early Modern and Late Modern English (3.9B (tokenized) words, time span: 1450-1950; Manjavacas & Fonteyn 2021, 2022). We will demonstrate how MacBERTh may help researchers (i) access and (ii) analyse the semantic information encoded in linguistic corpus data in a (semi-)automatic way.

Files

Fonteyn_Manjavacas_DHBENE_2022.pdf

Files (184.7 kB)

Name Size Download all
md5:bfe59edbf4643c72a0f6e968d8ebb519
184.7 kB Preview Download