Published October 22, 2019 | Version v1
Conference paper Open

Global optimization of operand transfer fusion in heterogeneous computing

  • 1. Linköping University

Description

We consider the problem of minimizing, for a dataflow graph of kernel calls, the overall number of operand data transfers, and thus, the accumulated transfer startup overhead, in heterogeneous systems with non-shared memory. Our approach analyzes the kernel-operand dependence graph and reorders the operand arrays in memory such that transfers and memory allocations of multiple operands adjacent in memory can be merged, saving transfer startup costs and memory allocation overheads.

Files

global_optimization.pdf

Files (5.4 MB)

Name Size Download all
md5:f658ca9e00a2c91479568f39439dc743
5.4 MB Preview Download

Additional details

Funding

EXA2PRO – Enhancing Programmability and boosting Performance Portability for Exascale Computing Systems 801015
European Commission