Published November 10, 2023
| Version py-0.19.13
Software
Open
pola-rs/polars: Python Polars 0.19.13
Authors/Creators
- Ritchie Vink
- Stijn de Gooijer1
- Alexander Beedie
- Marco Edward Gorelli2
- J van Zundert
- Gert Hulselmans3
- Weijie Guo4
- universalmind303
- Orson Peters
- Marshall5
- Matteo Santamaria6
- Daniรซl Heres7
- Josh Magarick
- ibENPC
- Moritz Wilksch8
- Jorge Leitao9
- chielP
- Jonas Haag10
- nameexhaustion
- Marc van Heerden
- Oliver Borchert10
- Colin Jermain
- Joshua Peek
- Ryan Russell
- Chris Pryer11
- Adriรกn Gallego Castellanos12
- Jeremy Goh
- Liam Brannigan
- illumination-k
- 1. @pola-rs
- 2. Quansight
- 3. @aertslab
- 4. @alibaba
- 5. The Rockefeller University
- 6. University of California, Berkeley
- 7. @coralogix
- 8. @QuantCo
- 9. Munin Data ApS
- 10. @Quantco
- 11. @twin-labs
- 12. Universidad Carlos III(student)
Description
๐ Highlights
- improve join performance through radix partitioned join (#12270)
โ ๏ธ Deprecations
- Rename
write_csvparameterhas_headertoinclude_header(#12351) - deprecate
_saturatingin duration string language, make it the default (#12301) - Switch args for
Decimaland set defaultscale=0(#12224) - Rename
dt.secondstodt.total_seconds(likewise for days, hours, minutes, milliseconds, microseconds, and nanoseconds) (#12179) - Deprecate
DataFrame.as_dictpositional input (#12131)
๐ Performance improvements
- indexvec in group-by (#12371)
- reduce allocations in hash join (#12368)
- change concurrency parameters (#12321)
- improve join performance through radix partitioned join (#12270)
- remove extra multiplication in hash_to_partition (#12233)
- allow non-power-of-two partitions (#12225)
- Reduce compute in error message for failed datetime parsing (#12147)
โจ Enhancements
- updated
BytecodeParserfor Python 3.12 (#12348) - Add
round_sig_figsexpression for rounding to significant figures (#11959) - change concurrency parameters (#12321)
- deprecate
_saturatingin duration string language, make it the default (#12301) - auto infer
ambiguousfor truncate and round (#12204) - allow construction of
Datetimeseries fromdatetime.datearray (#12175) - New
Configoptions for numeric formatting: digit grouping and thousands/decimal separator (#12099) - allow non-aggregation predicate in ternary groupby (#12286)
- Add
name=in.write_avroto set schema name (#12255) - Update
write_deltato write large arrow types without casting (#12260) - Add support for reading zstd compressed files (no-options) in read_csv (#12214)
- start prefetching all files immediately (#12201)
- expose more options to plugin registration (#12197)
- Add
.list.to_arrayexpression (#12192) - consolidate & improve all casting failure error messages (#12168)
- Add Binary dtype to hypothesis tests (#12140)
- tunable concurrency (#12171)
- support reverse sort in streaming (#12169)
- Add
.arr.to_listexpression (#12136) - Support decimals in assert utils (#12119)
- add concurrency budget (#12117)
- improved support for use of file-like objects with
DataFrame"write" methods (#12113) - Introduce ignore_nulls for str.concat (#12108)
๐ Bug fixes
- Do not cast lit if has same dtype (#12342)
- Fix index column name of rolling/dynamic group by (#12365)
- ternary broadcasting with empty truthy or falsy and agg predicate (#12357)
- uint64 should be correctly extracted from python object (#12338)
- ignore IDE-mediated DeprecationWarning when debugging tests under 3.12 (#12343)
- expr_output_name include literal (#12335)
- Fix Decimal dtype table repr (#12318)
- Fix behavior of month intervals in
date_range(#12317) - scan emtpy csv miss row_count (#12316)
- zip_with also broadcast mask (#12309)
- respect hive_partitioning flag when dealing with multiple files (#12315)
- parquet, add row_count to empty file materialization (#12310)
- Fix invalid DeprecationWarning generated from
date_rangedefined with 'saturating' interval (#12311) - fix download ranges in parquet (#12313)
- object store path derivation for local URL (#12308)
- don't move right endpoint of windows in rolling in default
offset==-periodcase (#12267) - Raise more informative error on invalid
reshapeinput (#12288) - incorrect super type for literals in nested binary exprs (#12238)
- typo in exception message (#12278)
- fix ambiguous aggregation type (#12269)
- return frames from
read_excelin the originally specified order (#12243) - Consistently propagate nulls for
numpyufuncs (#12212) - respect return_scalar of list scalars (#12251)
- fix plugins system on Windows (#12230)
- potential overflow (#12206)
- always start a new thread if the thread is already blocking (#12202)
- with_row_count should block predicate push down for lazy csv (#12187)
- rechunk failed-list series before iterate (#12189)
- Fix interchange protocol boolean buffer size (#12177)
- fix incorrect desc sort behavior (#12141)
takeshould block predicate pushdown (#12130)- use null type when read from unknown row (#12128)
- boundary predicate to block all accumulated predicates in push down (#12105)
- make python
schema_overridesinformation available to the rust-side inference code when initialising from records/dicts (#12045) - fix panic when initializing Series with array of list dtype (#12148)
- Fix schema of arr.min/max (#12127)
- ensure filter predicate inputs exist in schema (#12089)
๐ ๏ธ Other improvements
- updated
BytecodeParserfor Python 3.12 (#12348) - Workaround for maturin issue (#12370)
- Fix incorrect boundary column name in
group_by_dynamicdocstrings (#12366) - Fix typo in
rolling_*docstrings (#12362) - Fix ruff linting invocation (#12350)
- Clean up conversion utils (#11789)
- Organize Cargo.toml (#12323)
- Consolidate "getting started" and "user guide" sections (#12246)
- Minor updates to prepare for Python 3.12 support (#12314)
- Move script for testing map warning (#12306)
- simplify expr checking in predicate push down (#12287)
- Remove external link (#12223)
- Fix rebase issue breaking CI (#12296)
- Add top-level
make clippy, simplify Rust linting workflows (#12290) - ensure we git-ignore ALL
.venvdirs (#12289) - incorrect super type for literals in nested binary exprs (#12238)
- Remove recommended setting from IDE docs (#12275)
- Clean up Python test workflow (#12261)
- clarify contains selector (#12265)
- Add
py-polarsto Cargo workspace (#12256) - Use
.with_columnsin some docstrings (#12250) - Add test for
scan_csvplusslice(#12239) - Fix emphasis formatting in docstring (#12240)
- Fix emphasis formatting in docstring (#12237)
- add deprecation notices to the docs for expressions moved into the new
namenamespace (#12236) - update Cargo.lock (#12226)
- make sort test work with unstable sort (#12221)
- Build Python wheels on
manylinux_2_28(#12211) - Include
rust-toolchain.tomlwith sdist/wheels (#12184) - Standardize project name formatting across docs (#12185)
- Update
sqlparserto0.39(#12173) - pin ring (#12176)
- Improve
strip_{prefix, suffix}&strip_chars_{start, end}(#12161) - Fix tests for pyarrow 14 (#12170)
- Fix rendering of note in
DataFrame.fold(#12164) - Fix triggers for docs deployment (#12159)
- Refactor some tests (#12121)
- Consolidate contributing info (#12109)
- Fix typo in user-guide/expressions/plugins.md (#12115)
- Render docstring text in single backticks as code (#12096)
- use more ergonomic syntax in select/with_columns where possible (#12101)
- Update CODEOWNERS (#12107)
- visualize plugin directory layout in user guide (#12092)
- Minor tweak in code example in section Expressions/Aggregation (#12033)
- Minor tweak in code example in section Expressions/Missing data (#12080)
- Minor improvements to the docs website (#12084)
Thank you to all our contributors for making this release possible! @JulianCologne, @MarcoGorelli, @Priyansh121096, @alexander-beedie, @cmdlineluser, @daviskirk, @dependabot, @dependabot[bot], @dgilman, @hirohira9119, @ion-elgreco, @jrycw, @mcrumiller, @moritzwilksch, @nameexhaustion, @orlp, @owrior, @rancomp, @reswqa, @ritchie46, @rob-sil, @stefmolin, @stinodego and @wsyxbcl
Files
pola-rs/polars-py-0.19.13.zip
Files
(3.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:02ea8fe330fc9b095add23f364953f2e
|
3.8 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/pola-rs/polars/tree/py-0.19.13 (URL)