Published August 24, 2017 | Version v1
Journal article Open

Continuous Skyline Queries on Multicore Architectures

  • 1. University of Pisa, Italy

Description

The emergence of real-time decision-making applications in domains like high-frequency trading, emergency management, and service level analysis in communication networks has led to the definition of new classes of queries. Skyline queries are a notable example. Their results consist of all the tuples whose attribute vector is not dominated (in the Pareto sense) by one of any other tuple. Because of their popularity, skyline queries have been studied in terms of both sequential algorithms and parallel implementations for multiprocessors and clusters. Within the Data Stream Processing paradigm, traditional database queries on static relations have been revised in order to operate on continuous data streams. Most of the past papers propose sequential algorithms for continuous skyline queries, whereas there exist very few works targeting implementations on parallel machines. This paper contributes to fill this gap by proposing a parallel implementation for multicore architectures. We propose (i) a parallelization of the eager algorithm based on the notion of Skyline Influence Time, (ii) optimizations of the reduce phase and load-balancing strategies to achieve near-optimal speedup, and (iii) a set of experiments with both synthetic benchmarks and a real dataset in order to show our implementation effectiveness.

Files

Exp.zip

Files (105.8 MB)

Name Size Download all
md5:9961c1654e5b3754f0cb5384d317390a
19.4 MB Preview Download
md5:9ebe096011def01b3d398fb627488882
1.2 MB Preview Download
md5:47c9bf0b70ceb5dc181335726f107ccf
85.2 MB Preview Download

Additional details

Related works

Is supplement to
10.1002/cpe.3866 (DOI)

Funding

RePhrase – REfactoring Parallel Heterogeneous Resource-Aware Applications - a Software Engineering Approach 644235
European Commission