There is a newer version of the record available.

Published August 17, 2021 | Version 2.0.0
Dataset Open

PUDL Data Release v2.0.0

  • 1. UC Berkeley
  • 2. Carbon Impact Consulting
  • 3. Carnegie Mellon University

Description

PUDL Data Release 2.0.0

This is a data release from the Public Utility Data Liberation (PUDL) project.

Using This Data

The data in this archive is stored in a combination of SQLite database files, and Apache Parquet datasets. It can be used as a standalone resource, or in conjunction with the PUDL software. The PUDL documentation contains data dictionaries for many of the data tables.

If you want to use the data in conjunction with the PUDL software, we've included a Docker image within the archive that will run a Jupyter Notebook Server containing examples of use based on our PUDL Examples repository. This Docker image contains all of the required software, and can access the associated archived data.

Make sure that you've got Docker installed and running, and also have docker-compose. You'll want to allocate at least 8GB of memory to Docker.

To use the Docker container to access and work with the data, download and extract the compressed tar archive on you computer.

Inside the directory that is created when you extract the archive, you will find a Docker image. Load that image into your Docker environment locally with:  

docker load -i pudl-jupyter.tar

Then within that same directory, run:

docker-compose up

This should start a Jupyter Notebook Server, and provide you with a link to connect to the server running on your local computer, beginning with https://127.0.0.1:48512 or https://localhost:48512

You can select the tutorial notebooks from within the notebook interface. The README file contained in the archive and the PUDL Examples repository both provide more details on how to access and work with the data.

Contact Us

If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. You can also:

Files

Files (6.4 GB)

Name Size Download all
md5:b8602d2597e1dd07d67e99cc7c4b259c
6.4 GB Download

Additional details

Related works

Is compiled by
Software: 10.5281/zenodo.5207986 (DOI)
Software: https://github.com/catalyst-cooperative/pudl/tree/v0.4.0 (URL)
Is documented by
Software documentation: https://catalystcoop-pudl.readthedocs.io/en/v0.4.0/ (URL)