Does Coding in Pythonic Zen Peak Performance? Preliminary Experiments of Nine Pythonic Idioms at Scale
Creators
- 1. Faculty of Engineering, Kasetsart University
- 2. Nara Institute of Science and Technology
Description
Link to pre-print: https://arxiv.org/abs/2203.14484
How to run
- Extract pythonnic_performance.zip and move into the extracted directory.
- Execute install.sh to install required dependencies.
- Execute run.sh to start the experiment.
- Execute python statistic.py to perform the statistical test and show data statistics.
Note that files in submitted_output are our experimental results shown in the paper.
Abstract
In field of data science, and for academics in general, the Python programming language is a popular choice, mainly because of its libraries for storing, manipulating, and gaining insight from data.
Evidence includes the versatile set of machine learning, data visualization, and manipulation packages used for the ever-growing size of available data.
The Zen of Python is a set of guiding design principles that developers use to write acceptable and elegant Python code.
Most principles revolve around simplicity.
However, as the need to compute on large amounts of data, performance has become a necessity for the Python programmer.
The new idea in this paper is to empirically confirm whether writing the Pythonic way peaks performance at scale.
As a starting point, we conduct a set of preliminary experiments to evaluate nine Pythonic code examples by comparing the performance of both Pythonic and Non-Pythonic code snippets.
Our results reveal that writing in Pythonic idioms may save memory and time.
We show that incorporating list comprehension, generator expression, zip, and itertools.zip\_longest can save up to 7,000 MB and 32.25 seconds.
The results open more questions on how they could be utilized in a real-world setting.
Files
pythonic_performance.zip
Files
(575.1 MB)
Name | Size | Download all |
---|---|---|
md5:68ce77b0e045bee55db48bdac9bc434d
|
575.1 MB | Preview Download |
md5:cf8a56ad70998fb28c857c3e04e0a7ee
|
229 Bytes | Download |