Published April 21, 2021 | Version v1
Conference paper Open

Profiling dataflow systems on multiple abstraction levels

Description

 

Dataflow graphs are a popular abstraction for describing computation, used in many systems for high-level optimization. For execution, dataflow graphs are lowered and optimized through layers of program representations down to machine instructions. Unfortunately, performance profiling such systems is cumbersome, as today's profilers present results merely at instruction and function granularity. This obfuscates the connection between profiles and high-level constructs, such as operators and pipelines, making interpretation of profiles an exercise in puzzling and deduction.

In this paper, we show how to profile compiling dataflow systems at higher abstraction levels. Our approach tracks the code generation process and aggregates profiling data to any abstraction level. This bridges the semantic gap to match the engineer's current information need and even creates a comprehensible way to report timing information within profiling data. We have evaluated this approach within our compiling DBMS Umbra, showing that the approach is generally applicable for compiling dataflow systems and can be implemented with high accuracy and reasonable overhead.

 

Files

profiling.pdf

Files (2.3 MB)

Name Size Download all
md5:13ad56a8794f94325f05dd4f1387ecd8
2.3 MB Preview Download

Additional details

Funding

CompDB – The Computational Database for Real World Awareness 725286
European Commission