Software Open Access

The GPU version of LICOM3 under HIP framework and its large-scale application (updated)

Hailong Liu; Pengfei Wang; Jinrong Jiang; Pengfei Lin


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">LASG/IAP Climate system Ocean Model version 3 (LICOM3)</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Heterogeneous-compute Interface for Portability (HIP)</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Graphics processing units (GPUs)</subfield>
  </datafield>
  <controlfield tag="005">20201209081128.0</controlfield>
  <controlfield tag="001">4302813</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">LASG, Institute of Atmospheric Physics, Chinese Academy of Sciences</subfield>
    <subfield code="0">(orcid)0000-0001-8561-6733</subfield>
    <subfield code="a">Pengfei Wang</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Computer Network Information Center, Chinese Academy of Sciences</subfield>
    <subfield code="0">(orcid)0000-0003-4463-8666</subfield>
    <subfield code="a">Jinrong Jiang</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">LASG, Institute of Atmospheric Physics, Chinese Academy of Sciences</subfield>
    <subfield code="0">(orcid)0000-0003-2361-0066</subfield>
    <subfield code="a">Pengfei Lin</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2992010704</subfield>
    <subfield code="z">md5:fedab598d523d0a03caa81a48aad7a66</subfield>
    <subfield code="u">https://zenodo.org/record/4302813/files/gmd-licom3-hip-codedata.tar.gz</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2020-12-03</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">software</subfield>
    <subfield code="o">oai:zenodo.org:4302813</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">LASG, Institute of Atmospheric Physics, Chinese Academy of Sciences</subfield>
    <subfield code="0">(orcid)0000-0002-8780-0398</subfield>
    <subfield code="a">Hailong Liu</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">The GPU version of LICOM3 under HIP framework and its large-scale application (updated)</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;A high-resolution (1/20&amp;deg;) global ocean general circulation model with Graphics processing units (GPUs) code implementations is developed based on the LASG/IAP Climate system Ocean Model version 3 (LICOM3) under Heterogeneous-compute Interface for Portability (HIP) framework. The dynamic core and physics package of LICOM3 are both ported to the GPU, and 3-dimensional parallelization is applied. The HIP version of the LICOM3 (LICOM3-HIP) is 42 times faster than what the same number of CPU cores dose, when 384 AMD GPUs and CPU cores are used. The LICOM3-HIP has excellent scalability; it can still obtain speedup of more than four on 9216 GPUs comparing to 384 GPUs. In this phase, we successfully performed a test of 1/20&amp;deg; LICOM3-HIP using 6550 nodes and 26200 GPUs, and at the grand scale, the model&amp;rsquo;s time to solution can still obtain an increasing, about 2.72 simulated years per day (SYPD). The high performance was due to putting almost all of computation processes inside GPUs, and thus greatly reduces the time cost of data transfer between CPUs and GPUs. At the same time, a 14-year spin-up integration following the phase 2 of Ocean Model Intercomparison Project (OMIP-2) protocol of surface forcing has been conducted, and the preliminary results have been evaluated. We found that the model results have little differences from the CPU version. Further comparison with observations and lower-resolution LICOM3 results suggests that the 1/20&amp;deg; LICOM3-HIP can not only reproduce the observations, but also produce much smaller scale activities, such as submesoscale eddies and frontal scales structures.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4302812</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4302813</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">software</subfield>
  </datafield>
</record>
42
8
views
downloads
All versions This version
Views 4242
Downloads 88
Data volume 23.9 GB23.9 GB
Unique views 3535
Unique downloads 77

Share

Cite as