Published February 20, 2025 | Version 0.1.0
Computational notebook Open

Preparing the Palmer Penguins Data for the datasets Package in R

  • 1. ROR icon University of Warwick
  • 2. Freelance
  • 3. ROR icon University of Alaska Fairbanks
  • 4. UC Santa Barbara
  • 1. Universität Innsbruck

Description

The Palmer Penguins data, contained in the palmerpenguins R package as the penguins and penguins_raw data frames, have become popular for data exploration and visualisation, particularly in an educational context.
 
With a view to adding this data to the datasets package that is included in the base R distribution, this project contains the following:
 
  • "./data-raw/penguins.R": a script to create penguins and penguins_raw from the data made available through the Environmental Data Initiative
  • "./data/penguins.rda": the data file created from running "./data-raw/penguins.R"
  • "./man/penguins.Rd": A documentation file for the penguins and penguins_raw datasets
  • "./analysis/palmerpenguins-comparison.R": A script comparing our versions of penguins and penguins_raw with those in the palmerpenguins package
  • "./analysis/Gorman_2014_reproduction.Rmd": A computational notebook reproducing Tables 1 and 2 from Gorman et al. (2014)
  • "./analysis/Gorman_2014_reproduction.html": The result of rendering the above-mentioned notebook to html

The data was originally published in Gorman et al. (2014). Their inclusion in the datasets package included in the base R distribution was motivated by Horst et al. (2022).

Files

penguins-datasets-R.zip

Files (253.0 kB)

Name Size Download all
md5:38c3280aa6f6c294fec2c6246c220eb4
253.0 kB Preview Download

Additional details

Funding

Engineering and Physical Sciences Research Council
Sustainability and EDI (Equality, Diversity, and Inclusion) in the R Project EP/V052128/1

Software

Repository URL
https://github.com/EllaKaye/penguins-datasets-r
Programming language
R, RMarkdown

References

  • Gorman, K. B., Williams, T. D., Fraser, W. R. (2014) Ecological sexual dimorphism and environmental variability within a community of Antarctic penguins (genus Pygoscelis). PLoS ONE 9(3):e90081. https://doi.org/10.1371/journal.pone.0090081
  • Horst, A. M., Hill, A. P. and Gorman, K. B. (2022) Palmer Archipelago Penguins Data in the palmerpenguins R Package - An Alternative to Anderson's Irises. R Journal, 14(10). https://doi.org/10.32614/RJ-2022-020