Published March 28, 2024
| Version py-0.20.17
Software
Open
pola-rs/polars: Python Polars 0.20.17
Authors/Creators
- Ritchie Vink
- Stijn de Gooijer1
- Alexander Beedie
- Marco Edward Gorelli2
- Weijie Guo3
- J van Zundert
- Gert Hulselmans4
- universalmind303
- Orson Peters
- Marshall
- chielP
- Simon Lin
- Matteo Santamaria5
- Daniรซl Heres6
- Josh Magarick
- ibENPC
- Moritz Wilksch7
- Jorge Leitao8
- Mick van Gelderen
- Petros Barbagiannis
- Jonas Haag9
- Oliver Borchert10
- Karl Genockey
- Marc van Heerden
- Liam Brannigan
- Colin Jermain
- Joshua Peek
- Ryan Russell
- Chris Pryer
- 1. @pola-rs
- 2. Quansight
- 3. @alibaba
- 4. @aertslab
- 5. University of California, Berkeley
- 6. @coralogix
- 7. @QuantCo
- 8. Munin Data ApS
- 9. forml.eu
- 10. @Quantco
Description
๐ Highlights
- Full plan CSE (#15264)
โ ๏ธ Deprecations
- Rename parameter
bytogroup_byinDataFrame.upsample/group_by_dynamic/rolling(#14840) - Rename
from_reprparameter fromtbltodata(#15156)
๐ Performance improvements
- load bits one word at a time for BitmapIter (#15333)
- Ipc exec multiple paths (#15040)
- add SIMD support for if-then-else kernels (#15131)
โจ Enhancements
- Remove 'FileCacher' optimization (#15357)
- check input type in entropy (#15351)
- Implements
arr.n_unique(#15296) - CSE don't scan share if predicate pushdown predicates don't match (#15328)
- Add
read_databasesupport forSurrealDB("ws" and "http") (#15269) - Only allow inputs of type
Sequenceinfrom_records(#15329) - In hypothesis testing strategies, enable Decimal strategy by default (#15321)
- Remove cached nodes when finished (#15310)
- Full plan CSE (#15264)
- More robust handling of
asyncdatabase calls (#15202) - Add
nameparameter toGroupBy.lenmethod (#15235) - Add IR for expressions. (#15168)
- Improve
read_databasewhen reading from Kรนzu graph database (#15218) - Warn if
map_elementsis called withoutreturn_dtypespecified (#15188) - Add support for
asyncSQLAlchemy connections toread_database(#15162) - Infer
time_unitinpl.durationwhen nanoseconds is specified (#14987) - Add
strictparameter tofrom_dict/from_records(#15158)
๐ Bug fixes
- Return correct dtype for
s.clear()when dtype isObject(#15315) - ensure first datapoint is always included in group_by_dynamic (#15312)
- Non-exhaustive patterns: arrow-schema::DataType in polars-arrow (#15250)
- use dynamic stacks for problematic recursive functions (#15355)
- Adding default ddof for
Series.list.stdandSeries.list.var(#15267) - Raise properly for slices not supported by
LazyFrame(#15331) - Propagate strictness in
from_dicts(#15344) - Raise error when
schema_overridescontains nonexistent column name (#15290) - Enforce integer
dtypeinput forint_rangeandint_ranges(#15339) - Preserve Decimal precision when constructing empty Series (#15320)
- Fix cache dot visualization (#15311)
- Handle special case correctly when slicing a
LazyFrame(#15297) - Properly propagate
strictflag when constructing a Struct Series from any values (#15302) - Consistent expansion of nested struct data during
DataFrameinit from dict (#15217) - Raise when join projects name with suffix that doesn't exist (#15256)
- Ensure ooc_start is set (#15255)
- Fix bug where rolling operations were ignoring
check_sortedin some cases (#15227) - Fix lazy schema for
rleexpression (#15248) - incorrect negative offset in multi-byte string slicing (#15140)
- do not clamp negative offsets to start of array prematurely (#15242)
- allow null index in list.get and array.get (#15239)
- Avoid loading all columns in
read_parquetwhencolumnsparameter is specified (#15229) - properly support nulls_last + descending (#15212)
- fix nested runtime panic (#15216)
- Block rounding/truncating to negative durations (#15175)
- Ensure the
cs.temporal()selector uses wildcard time zone matching forDatetime(#13683) - Consistently raise
TypeErroron constructor failure (#15178) - Properly propagate strictness in some constructor cases (#15166)
- Fix constructing a Series from a list of Series with given dtype (#15144)
๐ Documentation
- Fix time unit in
timestampexample (#15281) - Fix link to renamed method (.list.lengths -> .list.len) (#15228)
- Update Excel and database pages in user guide (#14721)
- Add examples for
Series.search_sorted(#14737) - Correct the definition of an expression in the user guide (#14750)
- Add a note about the behaviour of lower/upper bounds for
is_between, and add an example (#15197)
๐ฆ Build system
- Update Cargo lock (#15370)
๐ ๏ธ Other improvements
- Memory usage test infrastructure, plus a test for #15098 (#15285)
- Update CODEOWNERS (#15352)
- remove try_apply_values (#15336)
- always use non-legacy float_sum for mean (#15343)
- remove legacy bitmap module (#15335)
- Fix test not writing to temporary directory (#15318)
- Reorganize tests for
clearoperation (#15304) - Rename
Cache[count]toCache[cache_hits](#15300) - Cleanup file_caching optimization call (#15299)
- Minor refactor of
PyDataFrame.from_dicts(#15274) - remove 'fast-projection' node (#15253)
- cleanup column names in optimizer (#15252)
- remove left_most_input_name from expr ir (#15251)
- add AlignedBitmapSlice (#15171)
- Run non-benchmark tests in benchmark workflow (#15207)
- Add
wrapping_absto arithmetic kernel (#15210) - remove raw buffers from BinViewArray (#15206)
- Enable
RUST_BACKTRACE=1in the CI test suite (#15204) - Split
read_databasefunctionality into cleaner module structure (#15201) - Clean up some of the AnyValue conversion logic (#15190)
- remove parts of legacy bit_util (#15169)
- remove legacy arrow compute (#15164)
- Split up
dataframemodule in PyO3 bindings (#15165) - Remove unused private constructors (#15160)
Thank you to all our contributors for making this release possible! @MarcoGorelli, @alexander-beedie, @braaannigan, @c-peters, @cojmeister, @deanm0000, @dependabot, @dependabot[bot], @itamarst, @kszlim, @mbuhidar, @mcrumiller, @mickvangelderen, @orlp, @petrosbar, @reswqa, @ritchie46, @rob-sil, @sportfloh, @stinodego and @yutannihilation
Files
pola-rs/polars-py-0.20.17.zip
Files
(4.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:7513dbe0b8b4c6f824dd18fa189cd833
|
4.4 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/pola-rs/polars/tree/py-0.20.17 (URL)