Published April 19, 2021 | Version v1
Journal article Open

WindFlow: High-Speed Continuous Stream Processing With Parallel Building Blocks

Description

Nowadays, we are witnessing the diffusion of Stream Processing Systems (SPSs) able to analyze data streams in near realtime. Traditional SPSs like Storm and Flink target distributed clusters and adopt the continuous streaming model , where inputs are processed as soon as they are available while outputs are continuously emitted. Recently, there has been a great focus on SPSs for scale-up machines. Some of them (e.g., BriskStream ) still use the continuous model to achieve low latency. Others optimize throughput with batching approaches that are, however, often inadequate to minimize latency for live-streaming applications. Our contribution is to show a novel software engineering approach to design the runtime system of SPSs targeting multicores, with the aim of providing a uniform solution able to optimize throughput and latency. The approach has a formal nature based on the assembly of components called building blocks , whose composition allows optimizations to be easily expressed in a compositional manner. We use this methodology to build a new SPS called WindFlow . Our evaluation showcases the benefits of WindFlow : it provides lower latency than SPSs for continuous streaming, and can be configured to optimize throughput, to perform similarly and even better than batch-based scale-up SPSs.

Files

Preprint-TPDS-2021.pdf

Files (3.0 MB)

Name Size Download all
md5:adf210760c5e5bc7b8f9ac7ef9297741
3.0 MB Preview Download

Additional details

Funding

European Commission
TEACHING – A computing toolkit for building efficient autonomous applications leveraging humanistic intelligence 871385