Report Open Access

Vectorization with Haswell and CilkPlus

Juan José, Fumero Alfonso; Andrzej, Nowak


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.7594</identifier>
  <creators>
    <creator>
      <creatorName>Juan José, Fumero Alfonso</creatorName>
      <affiliation>CERN openlab Summer Student</affiliation>
    </creator>
    <creator>
      <creatorName>Andrzej, Nowak</creatorName>
      <affiliation>Summer Student Supervisor</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Vectorization With Haswell And Cilkplus</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2013</publicationYear>
  <dates>
    <date dateType="Issued">2013-09-01</date>
  </dates>
  <resourceType resourceTypeGeneral="Text">Report</resourceType>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/7594</alternateIdentifier>
  </alternateIdentifiers>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-sa/4.0/">Creative Commons Attribution Share-Alike 4.0</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;Project Specification:&lt;/strong&gt;&lt;br /&gt;
This project concerns the parallel computing and vectorization field for Physics Computing at CERN. The document summarises the results and experience from vectorization activities and an initial evaluation of the CilkPlus technology with two different benchmarks from CERN.&lt;br /&gt;
&lt;strong&gt;Abstract:&lt;/strong&gt;&lt;br /&gt;
With the release of the Intel Sandy Bridge processor, vectorization ceased to be a &amp;ldquo;nice to have&amp;rdquo; feature and became a necessity. This work is focused on optimization, running comparative measurements of available vectorization technologies currently under investigation by the CERN Concurrency Forum. In particular, the project involves an assessment of the limits of autovectorization in two compilers, an evaluation of CilkPlus as implemented in ICC/GCC and an evaluation of AVX/AVX2 benefits with respect to legacy SSE workloads.&lt;/p&gt;</description>
  </descriptions>
</resource>

Share

Cite as