Published November 22, 2018
| Version v0.15.0
Software
Open
CamDavidsonPilon/lifelines: v0.15.0
Authors/Creators
- Cameron Davidson-Pilon1
- Jonas Kalderstam2
- Ben Kuhn3
- Paul Zivich
- Andrew Fiore-Gartland4
- Luis Moneda
- Daniel WIlson5
- Alex Parij6
- Kyle Stark
- Steven Anton7
- Lilian Besson8
- Jona9
- Harsh Gadgil10
- Dave Golland
- Sean Hussey11
- Javad Noorbakhsh12
- Andreas Klintberg
- Matt Braymer-Hayes13
- Lukasz14
- Jonathan Séguin15
- Jeff Rose16
- Isaac Slavitt
- Eric Martin
- Eduardo Ochoa
- Dylan Albrecht
- dhuynh17
- Denis Zgonjanin1
- Daniel Chen18
- Chris Fournier1
- André F. Rendeiro19
- 1. Shopify
- 2. @neo4j
- 3. Wave
- 4. Fred Hutchinson Cancer Research Center
- 5. @Esri
- 6. Autodesk
- 7. ID Analytics
- 8. ENS de Cachan - Paris Saclay University
- 9. Berlin Institute for Medical Systems Biology
- 10. Bell
- 11. Ampion, Inc.
- 12. @TheJacksonLaboratory
- 13. @lytics
- 14. Axelspace
- 15. IRIC | Plateforme de bioinformatique
- 16. Origin Rose
- 17. @Microsoft
- 18. @VirginiaTech - @bi-sdal - GBCB
- 19. CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences
Description
- adding
robustparams toCoxPHFitter'sfit. This enables atleast i) using non-integer weights in the model (these could be sampling weights like IPTW), and ii) mis-specified models (ex: non-proportional hazards). Under the hood it's a sandwich estimator. This does not handle ties, so if there are high number of ties, results may significantly differ from other software. standard_errors_is now a property on fittedCoxPHFitterwhich describes the standard errors of the coefficients.variance_matrix_is now a property on fittedCoxPHFitterwhich describes the variance matrix of the coefficients.- new criteria for convergence of
CoxPHFitterandCoxTimeVaryingFittercalled the Newton-decrement. Tests show it is as accurate (w.r.t to previous coefficients) and typically shaves off a single step, resulting in generally faster convergence. See https://www.cs.cmu.edu/~pradeepr/convexopt/Lecture_Slides/Newton_methods.pdf. Details about the Newton-decrement are added to theshow_progressstatements. - Minimum suppport for scipy is 1.0
- Convergence errors in models that use Newton-Rhapson methods now throw a
ConvergenceError, instead of aValueError(the former is a subclass of the latter, however). AalenAdditiveModelraisesConvergenceWarninginstead of printing a warning.KaplanMeierFitternow has a cumulative plot option. Examplekmf.plot(invert_y_axis=True)- a
weights_coloption has been added toCoxTimeVaryingFitterthat allows for time-varying weights. WeibullFitterhas a newshow_progressparam and additional information if the convergence fails.CoxPHFitter,ExponentialFitter,WeibullFitterandCoxTimeVaryFittermethodprint_summaryis updated with new fields.WeibullFitterhas renamed the incorrect_jacobianto_hessian_.variance_matrix_is now a property on fittedWeibullFitterwhich describes the variance matrix of the parameters.- The default
WeibullFitter().timelinehas changed from integers between the min and max duration to n floats between the max and min durations, where n is the number of observations. - Performance improvements for
CoxPHFitter(~20% faster) - Performance improvements for
CoxTimeVaryingFitter(~100% faster) - In Python3, Univariate models are now serialisable with
pickle. Thanks @dwilson1988 for the contribution. For Python2,dillis still the preferred method. baseline_cumulative_hazard_(and derivatives of that) onCoxPHFitternow correctly incorporate theweights_col.- Fixed a bug in
KaplanMeierFitterwhen late entry times lined up with death events. Thanks @pzivich - Adding
cluster_colargument toCoxPHFitterso users can specify groups of subjects/rows that may be correlated. - Shifting the "significance codes" for p-values down an order of magnitude. (Example, p-values between 0.1 and 0.05 are not noted at all and p-values between 0.05 and 0.1 are noted with
., etc.). This deviates with how they are presented in other software. There is an argument to be made to remove p-values from lifelines altogether (become the changes you want to see in the world lol), but I worry that people could compute the p-values by hand incorrectly, a worse outcome I think. So, this is my stance. P-values between 0.1 and 0.05 offer very little information, so they are removed. There is a growing movement in statistics to shift "signficant" findings to p-values less than 0.01 anyways. - New fitter for cumulative incidence of multiple risks
AalenJohansenFitter. Thanks @pzivich! See "Methodologic Issues When Estimating Risks in Pharmacoepidemiology" for a nice overview of the model.
Files
CamDavidsonPilon/lifelines-v0.15.0.zip
Files
(3.6 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:6aafa024deb247ad3b0db41fc017234f
|
3.6 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/CamDavidsonPilon/lifelines/tree/v0.15.0 (URL)