There is a newer version of the record available.

Published August 12, 2022 | Version v1.0.1-book
Book Open

Minimalist Data Wrangling with Python

Authors/Creators

Description

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results.

This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.


Dr Marek Gagolewski is currently a Senior Lecturer in Applied AI at Deakin University in Melbourne, Australia and an Associate Professor in Data Science (on leave) at the Faculty of Mathematics and Information Science, Warsaw University of Technology, Poland. His research interests are related to data science, in particular: modelling complex phenomena, developing usable, general purpose algorithms, studying their analytical properties, and finding out how people use, misuse, understand, and misunderstand methods of data analysis in research, commercial, and decision making settings. In his spare time, he writes books for his students and develops free (libre) data analysis software, such as stringi – one of the most often downloaded R packages, and genieclust – a fast and robust clustering algorithm in both Python and R.

Notes

Please cite this book as: Gagolewski M. (2022), Minimalist Data Wrangling with Python, Zenodo, Melbourne, DOI: 10.5281/zenodo.6451068, ISBN: 978-0-6455719-1-2, URL: https://datawranglingpy.gagolewski.com/

Files

datawranglingpy-screen-v1.0.1-20220812.pdf

Files (31.9 MB)

Name Size Download all
md5:56454c4665835b99d35db59d41a6e15c
6.2 MB Preview Download
md5:8d191fc0d3e919f1b7f28293767cc3b1
25.7 MB Preview Download

Additional details

Related works

Is published in
Book: 978-0-645-57191-2 (ISBN)
Presentation: https://datawranglingpy.gagolewski.com (URL)
Is supplement to
Software: https://github.com/gagolews/datawranglingpy/tree/v1.0.1 (URL)