Published October 22, 2019
| Version v1
Conference paper
Open
Global optimization of operand transfer fusion in heterogeneous computing
Description
We consider the problem of minimizing, for a dataflow graph of kernel calls, the overall number of operand data transfers, and thus, the accumulated transfer startup overhead, in heterogeneous systems with non-shared memory. Our approach analyzes the kernel-operand dependence graph and reorders the operand arrays in memory such that transfers and memory allocations of multiple operands adjacent in memory can be merged, saving transfer startup costs and memory allocation overheads.
Files
global_optimization.pdf
Files
(5.4 MB)
Name | Size | Download all |
---|---|---|
md5:f658ca9e00a2c91479568f39439dc743
|
5.4 MB | Preview Download |