Conference paper Open Access

Word Clustering for Historical Newspapers Analysis

Lidia Pivovarova; Jani Marjanen; Elaine Zosa

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Lidia Pivovarova</dc:creator>
  <dc:creator>Jani Marjanen</dc:creator>
  <dc:creator>Elaine Zosa</dc:creator>
  <dc:description>This paper is a part of a collaboration between computer scientists and historians aimed at development of novel methods for historical newspapers analysis. We present a case study of ideological terms ending with -ism suffix in nineteenthcentury
Finnish newspapers. We propose a two-step procedure to trace differences in word usages over time: training of diachronic embeddings on several time slices and when clustering embeddings of selected words together with their neighbours
to obtain historical context. The obtained clusters turn out to be useful for historical studies. The paper also discusses
specific difficulties related to development of historian-oriented tools.</dc:description>
  <dc:title>Word Clustering for Historical Newspapers Analysis</dc:title>
All versions This version
Views 159159
Downloads 9292
Data volume 83.6 MB83.6 MB
Unique views 150150
Unique downloads 8888


Cite as