Dataset Open Access
PUDL Data Release 2.0.0
This is a data release from the Public Utility Data Liberation (PUDL) project.
Using This Data
The data in this archive is stored in a combination of SQLite database files, and Apache Parquet datasets. It can be used as a standalone resource, or in conjunction with the PUDL software. The PUDL documentation contains data dictionaries for many of the data tables.
If you want to use the data in conjunction with the PUDL software, we've included a Docker image within the archive that will run a Jupyter Notebook Server containing examples of use based on our PUDL Examples repository. This Docker image contains all of the required software, and can access the associated archived data.
To use the Docker container to access and work with the data, download and extract the compressed tar archive on you computer.
Inside the directory that is created when you extract the archive, you will find a Docker image. Load that image into your Docker environment locally with:
docker load -i pudl-jupyter.tar
Then within that same directory, run:
This should start a Jupyter Notebook Server, and provide you with a link to connect to the server running on your local computer, beginning with
https://127.0.0.1:48512 or https://localhost:48512
You can select the tutorial notebooks from within the notebook interface. The README file contained in the archive and the PUDL Examples repository both provide more details on how to access and work with the data.
If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. You can also: