Conference paper Open Access

Clustering Ideological Terms in Historical Newspaper Data with Diachronic Word Embeddings

Jani Marjanen; Lidia Pivovarova; Elaine Zosa; Jussi Kurunmaki

During the course of the nineteenth century, ideological language mostly expressed through isms such as liberalism, socialism or conservatism, entered the lexicon in most European languages. Previous research has based on reading key texts claimed that the
suffix ism was introduced to new linguistic domains during the period up to WWI, many ofwhich do not relate to ideology. This paper uses a data-driven way to study the emergence of isms in nineteenth-century Finnish newspapers and uses word embeddings to cluster them and to trace their thematic expansion in the period. As such, the study provides a quantitatively sound way of tracking how isms relate to ideological language and more generally contributes to the understanding of the development of political language in
Finland.

Files (569.3 kB)
Name Size
Isms_clustering_for_HistoInformatics2019.pdf
md5:4aac859b0e166e8f638dfcbcf741ada7
569.3 kB Download
54
43
views
downloads
All versions This version
Views 5424
Downloads 4315
Data volume 23.8 MB8.5 MB
Unique views 4620
Unique downloads 3815

Share

Cite as