Conference paper Open Access

Distributing and Parallelizing Non-canonical Loops

Aubert, Clément; Rubiano, Thomas; Rusch, Neea; Seiller, Thomas

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Aubert, Clément</dc:creator>
  <dc:creator>Rubiano, Thomas</dc:creator>
  <dc:creator>Rusch, Neea</dc:creator>
  <dc:creator>Seiller, Thomas</dc:creator>
  <dc:description>This work leverages an original dependency analysis to parallelize loops regardless of their form in imperative programs. Our algorithm distributes a loop into multiple parallelizable loops, resulting in gains in execution time comparable to state-of-the-art automatic source-to-source code transformers when both are applicable. Our graph-based algorithm is intuitive, language-agnostic, proven correct, and applicable to all types of loops. Importantly, it can be applied even if the loop iteration space is unknown statically or at compile time, or more generally if the loop is not in canonical form or contains loop-carried dependency. As contributions we deliver the computational technique, proof of its preservation of semantic correctness, and experimental results to quantify the expected performance gains. We also show that many comparable tools cannot distribute the loops we optimize, and that our technique can be seamlessly integrated into compiler passes or other automatic parallelization suites.</dc:description>
  <dc:source>Lecture Notes in Computer Science 1-24</dc:source>
  <dc:title>Distributing and Parallelizing Non-canonical Loops</dc:title>
Views 17
Downloads 6
Data volume 3.7 MB
Unique views 17
Unique downloads 6


Cite as