There is a newer version of the record available.

Published April 15, 2023 | Version py-0.17.3
Software Open

pola-rs/polars: Python Polars 0.17.3

  • 1. @aertslab
  • 2. Quansight
  • 3. University of California, Berkeley
  • 4. @coralogix
  • 5. Munin Data ApS
  • 6. @twin-labs
  • 7. Universidad Carlos III(student)
  • 8. @QuantCo
  • 9. @Quantco
  • 10. Jupiter Intelligence
  • 11. @solokeys @trussed-dev @yamnord @ycrypto
  • 12. Pharmatronics (a one man consultancy)
  • 13. Aguila Engineering

Description

🏆 Highlights

  • support DataFrame init from pydantic model data (#8178)
🚀 Performance improvements
  • fail fast on non-inferable strings in strptime if no fmt is provided (#8111)
  • make chunks search more resilient (#8229)
  • SIMD accelerated arg_min/arg_max (via argminmax) (#8074)
  • speed up csv parsing for slower datetimes formats (#8213)
  • improve datetime interpret perf (#8209)
  • arr.eval run on groupby expression engine when possible (#8199)
  • ~2-3x speedup for DataFrame init from pydantic models (#8181)
✨ Enhancements
  • add use_earliest argument to replace_time_zone for dealing with ambiguous datetimes (#8087)
  • fail loudly on .%f directive, as it differs from the Python standard library (#8237)
  • SQL CTE's (#8208)
  • automatically convert series OP expr -> pl.lit(series) OP expr where OP is arithmetic (#8225)
  • add pickle support for LazyFrame (#8220)
  • add duration cumsum and remainder (#8219)
  • support DataFrame init from nested dataclass, pydantic, and NamedTuple objects (#8185)
  • better algorithm for streaming unique (#8003)
  • Add approx distinct count via approx_unique() (#7937)
  • add percentiles to describe methods (#8169)
  • support DataFrame init from pydantic model data (#8178)
  • display skipped row if same number of rows (#8170)
🐞 Bug fixes
  • add special numpy float branch in anyvalue conversion (#8259)
  • fix boolean par materialization (#8257)
  • improve null/empty list construction (#8255)
  • fix offsets in parallel utf8 materialization (#8254)
  • nested struct logical type consistency (#8249)
  • keep literal state if elementwise function is applied (#8195)
  • decimal ensure backed arrow arrays have correct dtype (#8193)
🛠️ Other improvements
  • parametric/hypothesis testing code cleanups (#8253)
  • Rename strptime/strftime args (#8221)
  • change sampling ratio for groupby strategy (#8223)
  • Rename Expr.list to implode (#8165)
  • don't panic on err in offset_by (#8210)
  • re-enable test parallization for Windows tests (#8214)
  • Fix small typo: "im memory" -> "in memory" (#8187)
  • remove unused dtype_to_arrow_type (#8177)

Thank you to all our contributors for making this release possible! @MarcoGorelli, @alexander-beedie, @avimallu, @borchero, @chitralverma, @clickingbuttons, @ghuls, @josh, @jvdd, @rben01, @ritchie46, @stinodego and @universalmind303

Files

pola-rs/polars-py-0.17.3.zip

Files (2.2 MB)

Name Size Download all
md5:5a34d294d588481acfe2d9c28c95ffc7
2.2 MB Preview Download

Additional details

Related works