PUDL Data Release v2.0.0
Creators
- 1. Catalyst Cooperative
Contributors
Project members:
- 1. UC Berkeley
- 2. Carbon Impact Consulting
- 3. Carnegie Mellon University
Description
PUDL Data Release 2.0.0
This is a data release from the Public Utility Data Liberation (PUDL) project.
- GitHub repository for the software used to generate this data.
- Zenodo archive of the particular version (v0.4.0) of the software that went into this release. For use in citations & long-term accessibility you can use this doi: https://doi.org/10.5281/zenodo.5207986
- Documentation and release notes for the software and data.
- The software can be installed via the Python Package Index (PyPI) or from conda-forge.
Using This Data
The data in this archive is stored in a combination of SQLite database files, and Apache Parquet datasets. It can be used as a standalone resource, or in conjunction with the PUDL software. The PUDL documentation contains data dictionaries for many of the data tables.
If you want to use the data in conjunction with the PUDL software, we've included a Docker image within the archive that will run a Jupyter Notebook Server containing examples of use based on our PUDL Examples repository. This Docker image contains all of the required software, and can access the associated archived data.
Make sure that you've got Docker installed and running, and also have docker-compose. You'll want to allocate at least 8GB of memory to Docker.
To use the Docker container to access and work with the data, download and extract the compressed tar archive on you computer.
Inside the directory that is created when you extract the archive, you will find a Docker image. Load that image into your Docker environment locally with:
docker load -i pudl-jupyter.tar
Then within that same directory, run:
docker-compose up
This should start a Jupyter Notebook Server, and provide you with a link to connect to the server running on your local computer, beginning with https://127.0.0.1:48512 or https://localhost:48512
You can select the tutorial notebooks from within the notebook interface. The README file contained in the archive and the PUDL Examples repository both provide more details on how to access and work with the data.
Contact Us
If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. You can also:
- Subscribe to our announcements list for email updates.
- Use the Github issue tracker to file bugs, suggest improvements, or ask for help.
- Email the project team at pudl@catalyst.coop for private communications.
- Follow @CatalystCoop on Twitter.
Files
Files
(6.4 GB)
Name | Size | Download all |
---|---|---|
md5:b8602d2597e1dd07d67e99cc7c4b259c
|
6.4 GB | Download |
Additional details
Related works
- Is compiled by
- Software: 10.5281/zenodo.5207986 (DOI)
- Software: https://github.com/catalyst-cooperative/pudl/tree/v0.4.0 (URL)
- Is documented by
- Software documentation: https://catalystcoop-pudl.readthedocs.io/en/v0.4.0/ (URL)