Published November 1, 2022 | Version v1
Journal article Open

Nonstationary data stream classification with online active learning and siamese neural networks

  • 1. University of Cyprus

Description

We have witnessed in recent years an ever-growing volume of information becoming available in a streaming manner in various application areas. As a result, there is an emerging need for online learning methods that train predictive models on-the-fly. A series of open challenges, however, hinder their deployment in practice. These are, learning as data arrive in real-time one-by-one, learning from data with limited ground truth information, learning from nonstationary data, and learning from severely imbalanced data, while occupying a limited amount of memory for data storage. We propose the ActiSiamese algorithm, which addresses these challenges by combining online active learning, siamese networks, and a multi-queue memory. It develops a new density-based active learning strategy which considers similarity in the latent (rather than the input) space. We conduct an extensive study that compares the role of different active learning budgets and strategies, the performance with/without memory, the performance with/without ensembling, in both synthetic and real-world datasets, under different data nonstationarity characteristics and class imbalance levels. ActiSiamese outperforms baseline and state-of-the-art algorithms, and is effective under severe imbalance, even only when a fraction of the arriving instances’ labels is available. We publicly release our code to the community.

Notes

This work has been supported by the European Research Council (ERC) under grant agreement No 951424 (Water-Futures), by the European Union's Horizon 2020 research and innovation programme under grant agreements No 883484 (PathoCERT) and No 739551 (TEAMING KIOS CoE), and from the Republic of Cyprus through the Deputy Ministry of Research, Innovation and Digital Policy.

Files

Neurocomputing_ActiSiamese.pdf

Files (11.3 MB)

Name Size Download all
md5:64fd19f17027c7c9385b175cb55c727e
11.3 MB Preview Download

Additional details

Funding

European Commission
Water-Futures – Smart Water Futures: designing the next generation of urban drinking water systems 951424
European Commission
KIOS CoE – KIOS Research and Innovation Centre of Excellence 739551
European Commission
PathoCERT – Pathogen Contamination Emergency Response Technologies 883484