Thesis Open Access

A Method for Wavelet-Based Time Series Analysis of Historical Newspapers

Avikainen, Jari

Thesis supervisor(s)

Toivonen, Hannu

This thesis presents a wavelet-based method for detecting moments of fast change in the textual contents of historical newspapers. The method works by generating time series of the relative frequencies of different words in the newspaper contents over time, and calculating their wavelet transforms. Wavelet transform is essentially a group of transformations describing the changes
happening in the original time series at different time scales, and can therefore be used to pinpoint moments of fast change in the data. The produced wavelet transforms are then used to detect fast changes in word frequencies by examining products of multiple scales of the transform.

The aim of this thesis is to examine the applicability of a wavelet transform-based method for change detection in time series generated from historical newspaper data. The change detection method examined in the thesis was developed as a part of NewsEye, an EU-funded project that aims to provide improved tools and methods for performing historical research using newspaper archives as the source material.

Files (1.3 MB)
Name Size
A Method for Wavelet-Based Time Series Analysis of Historical Newspapers.pdf
md5:26d489e8136e426cdc38ac4b3760d63f
1.3 MB Download
121
65
views
downloads
All versions This version
Views 121121
Downloads 6565
Data volume 86.9 MB86.9 MB
Unique views 107107
Unique downloads 5757

Share

Cite as