Presentation Open Access

flox: Fast & furious GroupBy reductions with Dask at Pangeo-scale

Cherian, Deepak


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Cherian, Deepak</dc:creator>
  <dc:date>2021-11-17</dc:date>
  <dc:description>The "groupby" or the "split-apply-combine" paradigm is ubiquitous in scientific analysis, though it may be named differently e.g. "binning", "histogramming", "resampling", "compositing", or "climatology reductions". Xarray implements the groupby paradigm through a "GroupBy" object. Historically the underlying algorithm is not dask-aware, and tends to fail disastrously with large Pangeo-scale distributed workflows. Here I present "flox": a new package that explores effective strategies for groupby reductions at scale with dask. Ongoing work will plug this package in to xarray in a backwards-compatible manner, allowing the community to seamlessly benefit from significantly more efficient groupby computations.See https://flox.readthedocs.io for more.</dc:description>
  <dc:identifier>https://zenodo.org/record/5772165</dc:identifier>
  <dc:identifier>10.5281/zenodo.5772165</dc:identifier>
  <dc:identifier>oai:zenodo.org:5772165</dc:identifier>
  <dc:relation>doi:10.5281/zenodo.5772164</dc:relation>
  <dc:relation>url:https://zenodo.org/communities/pangeo</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>Pangeo</dc:subject>
  <dc:subject>Xarray</dc:subject>
  <dc:title>flox: Fast &amp; furious GroupBy reductions with Dask at Pangeo-scale</dc:title>
  <dc:type>info:eu-repo/semantics/lecture</dc:type>
  <dc:type>presentation</dc:type>
</oai_dc:dc>
77
25
views
downloads
All versions This version
Views 7777
Downloads 2525
Data volume 293.4 MB293.4 MB
Unique views 7070
Unique downloads 2323

Share

Cite as