Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published November 29, 2019 | Version v1
Other Open

Embeddings built on 19th century newspapers from Finland

  • 1. University of Helsinki

Description

Embeddins built on 19th century Finnish and Swedish newspapers from Finalnd.

Used in the following papers, please cite at least one of them:

@inproceedings{pivovarova2019word,
  title={Word Clustering for Historical Newspapers Analysis},
  author={Pivovarova, Lidia and Marjanen, Jani and Zosa, Elaine},
  booktitle={Ranlp Workshop on Language technology for Digital Humanities},
  year={2019}
}
@inproceedings{marjanen2019clustering,
  title={Clustering ideological terms in historical newspaper data with diachronic word embeddings},
  author={Marjanen, Jani and Pivovarova, Lidia and Zosa, Elaine and Kurunm{\"a}ki, Jussi},
  booktitle={5th International Workshop on Computational History, HistoInformatics 2019},
  year={2019},
  organization={CEUR-WS}
}

 

Files

Files (474.3 MB)

Name Size Download all
md5:0e4b0e7d0885fd706acf76045efbbf30
474.3 MB Download

Additional details

Funding

NewsEye – NewsEye: A Digital Investigator for Historical Newspapers 770299
European Commission
EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153
European Commission

References

  • Pivovarova, Lidia, Jani Marjanen, and Elaine Zosa. "Word Clustering for Historical Newspapers Analysis." Ranlp Workshop on Language technology for Digital Humanities. 2019.
  • Marjanen, Jani, et al. "Clustering ideological terms in historical newspaper data with diachronic word embeddings." 5th International Workshop on Computational History, HistoInformatics 2019. CEUR-WS, 2019.