Published March 7, 2018 | Version v1
Journal article Open

Harnessing Sliding-Window Execution Semantics for Parallel Stream Processing

  • 1. University of Pisa
  • 2. University Naples Federico II
  • 3. University of Turin

Description

According to the recent trend in data acquisition and processing technology, big data are increasingly available in the form of unbounded streams of elementary data items to be processed in real-time. In this paper we study in detail the paradigm of sliding windows, a well-known technique for approximated queries that update their results continuously as new fresh data arrive from the stream. In this work we focus on the relationship between the various existing sliding window semantics and the way the query processing is performed from the parallelism perspective. From this study two alternative parallel models are identified, each covering semantics with very precise properties. Each model is described in terms of its pros and cons, and parallel implementations in the FastFlow framework are analyzed by discussing the layout of the concurrent data structures used for the efficient windows representation in each model.

Files

Exp.zip

Files (84.7 kB)

Name Size Download all
md5:7fdd33b4827bda5b39b7321cba28a3e8
84.7 kB Preview Download

Additional details

Funding

European Commission
RePhrase - REfactoring Parallel Heterogeneous Resource-Aware Applications - a Software Engineering Approach 644235