Public Utility Data Liberation Project (PUDL) Data Release
Authors/Creators
- 1. Catalyst Cooperative
Description
v2026.2.0 (2026-02-12)
This is a quarterly PUDL data release, and includes quarterly updates to data sources that are released continuously, like EIA-930, bulk EIA electricity API data, EPA CEMS hourly emissions and EIA-860M. This is also our first release of the FERC EQR company identifiers table, and tables from USDA’s Rural Utility Service (RUS) forms 7 and 12, which collect financial and operational information about rural utilities in a manner similar to EIA Form 861 and FERC Form 1. FERC EQR data is now available for download, though in a slightly different location due to its scale. Along for the ride are improvements to accuracy, memory performance, and Zenodo handling. See below for all the details.
New Data
RUS 7
-
Extracted data for ten USDA RUS tables. See #4897 and PR #4906.
-
Transformed and published USDA RUS tables. See #4885, PR #4939, PR #4971 and PR #4974.
RUS-12
-
Extracted data for twelve USDA RUS tables. See #4900 and PR #4916.
-
Transformed and published USDA RUS tables. See #4901, PR #4970 and PR #4979.
FERC EQR
-
Added the company identifiers (CID) table from EQR. See #4851 and #4967. Also, note that the actual FERC EQR data is available on PUDL Viewer as well as on S3 for direct download
Expanded Data Coverage
-
Updated DOIs for the EIA-191 and EIA-757a (they pertain to natural gas) since we extract them, even though we don’t process the data yet. This added 2 more years to the EIA-191 data. See PR #4879.
EPA CEMS
EIA-860M
EIA-923
EIA-930
EIA Bulk Electricity API
EIA-176
Documentation
-
Added a data source documentation page for the FERC EQR. See #4852 and PR #4879.
-
Added data access instructions for the FERC EQR and created examples specific to our larger (>1GB) and partitioned tables in the PUDL Data Dictionary. See issues #4869, #4951 and PR #4958. Affected tables include:
Bug Fixes & Data Cleaning
-
We added an automatic script to help match FERC and EIA utilities with near-identical utility names as part of our ongoing data updates. As a result, we have matched an additional 115 utilities and resolved a small handful of cases where a FERC utility was mapped to more than one PUDL ID. Through this process, we also identified a bug that was resulting in us assigning the least common utility name and prime mover code to records to harvested EIA records when there were inconsistent values reported. Fixing this resulted in overall improved accuracy of the data. 3,650 utilities were reassigned names, resulting in approximately 150 additional matches to SEC 10K filings. 86 generators were reassigned prime mover codes, resulting in re-allocated net generation. See #1317, #4934 and #4913, as well as PR #4975.
Performance Improvements
-
Improved memory performance of EIA-930 by translating transforms to use
duckdb.
Quality of Life Improvements
-
Consolidated local and remote Zenodo cache management under a single API that uses the high-level abstraction of the
upath.UPathclass. See issue #4860 and PR #4870. -
Pulled the list of Zenodo DOIs that define the raw input data used by PUDL out into a stand-alone settings file, rather than hard-coding them in the PUDL Datastore module. This makes the DOIs more easily accessible for use in other contexts, such as when calculating the GitHub Actions cache hash. Also made the GitHub Actions cache more lenient, so that if it misses on an exact cache key, it will just download the most recent cache of inputs. This should reduce the amount of data we need to download to run the CI on GitHub and speed things up slightly. It also means we can be more selective about when the
zenodo-cache-syncworkflow is run. Now it is only triggered when thezenodo_dois.ymlfile is changed, not any time the Datastore module is changed. See issue #4494 and PR #4870. -
Modernized the
datapackage.jsonmetadata stored on Zenodo for the Census DP1 data source, enabling the removal of a special case in the Datastore that only existed to deal with very old archive metadata. See PR #4879. -
Data source documentation pages now display the source data concept DOI with a link to the archive on Zenodo. See PR #4879.
-
Made a change to the Datastore that allows it to obtain metadata from a
datapackage.jsonfile stored on Zenodo, even if the data referenced by the data package is stored on GCS, as is the case with FERC EQR. See the FERC EQR archive on Zenodo as an example. See PR #4879. -
Added handling to
pudl.transform.classes.StringCategoriesso that thecategorieskey of transform params can be specified as a Path to a YAML file. This will make it possible to include large categorization sets without the params Python files becoming unwieldy. See PR #4978.
Other PUDL v2026.2.0 Resources
- PUDL v2026.2.0 Data Dictionary
- PUDL v2026.2.0 Documentation
- PUDL in the AWS Open Data Registry
- PUDL v2026.2.0 in a free, public AWS S3 bucket: s3://pudl.catalyst.coop/v2026.2.0/
- PUDL v2026.2.0 in a requester-pays GCS bucket: gs://pudl.catalyst.coop/v2026.2.0/
- Zenodo archive of the PUDL GitHub repo for this release
- PUDL v2026.2.0 release on GitHub
Contact Us
If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. Here's a bunch of different ways to get in touch:
- Follow us on GitHub
- Use the PUDL Github issue tracker to let us know about any bugs or data issues you encounter
- GitHub Discussions is where we provide user support.
- Watch our GitHub Project to see what we're working on.
- Email us at hello@catalyst.coop for private communications.
- On Mastodon: @CatalystCoop@mastodon.energy
- On BlueSky: @catalyst.coop
- On Twitter: @CatalystCoop
- Connect with us on LinkedIn
- Play with our data and notebooks on Kaggle
- Combine our data with ML models on HuggingFace
- Learn more about us on our website: https://catalyst.coop
- Subscribe to our announcements list for email updates.
Files
pudl_parquet.zip
Files
(17.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:baa43b57904e4716bafa0189bfb56f87
|
18.3 MB | Download |
|
md5:9d8232252995b90251b80920d28cb030
|
506.7 MB | Preview Download |
|
md5:9f0e58247263e5e4cbf0abf5f7aa3b2c
|
271.3 MB | Preview Download |
|
md5:2bca347e657609d6e599fcade53c426a
|
1.0 GB | Download |
|
md5:17217098f059d35bdeb418e9ea5b781f
|
190.8 MB | Preview Download |
|
md5:d3c473fd28b996914eb1c7cf9c6484dd
|
2.3 MB | Preview Download |
|
md5:02c69518e60b733979671e73d2e12f9f
|
7.3 MB | Preview Download |
|
md5:3626e528897adfd2f39bb488aa8d1603
|
73.9 MB | Preview Download |
|
md5:36e32f853e68343d9add15b52cb044da
|
153.1 MB | Download |
|
md5:a96436475c858c5a788b711687b03811
|
28.1 MB | Preview Download |
|
md5:5b1e651722093b643575010291f6e106
|
2.6 MB | Preview Download |
|
md5:086bdfaeb6293ed05e3bc6748f0f9e6c
|
7.2 MB | Preview Download |
|
md5:9946cc0f001401f07cc74997b2bf773b
|
2.9 MB | Preview Download |
|
md5:09dca83035b948a936ab6f1421d49ba5
|
52.7 MB | Download |
|
md5:af67eac0002ef4655a847366c31545c3
|
4.4 MB | Preview Download |
|
md5:be82fe5ea6396fdb067b9b6d6e07c0bb
|
964.5 kB | Preview Download |
|
md5:4667b8113e5e6d61af554a6304bf5ebc
|
1.9 MB | Preview Download |
|
md5:8913677c69ccc42523a6cd435d070d2e
|
43.5 MB | Preview Download |
|
md5:451243e423431c4daa4466cbdf801f49
|
75.5 MB | Download |
|
md5:71812b1d3a4ddb275c2e469812d574d2
|
22.7 MB | Preview Download |
|
md5:6904bf33a56d6cca8928be054dc1b00d
|
1.4 MB | Preview Download |
|
md5:22324cdc278878cd884e4caf9971d903
|
3.0 MB | Preview Download |
|
md5:dc83d5aceb3e6c398bac4b47931dcaa3
|
65.8 MB | Download |
|
md5:f963c8a53a7af2ce70650fbf0ecce70d
|
193.6 MB | Preview Download |
|
md5:5403ba305d91359ed220c2025954d43f
|
85.3 kB | Preview Download |
|
md5:694686904ca1871e1b96f2002d2a136d
|
192.9 kB | Preview Download |
|
md5:b28d3f8d5e678a249cd8b73897c74eca
|
3.0 GB | Preview Download |
|
md5:0b3771169376f805a58ad6e59afc45b7
|
11.5 GB | Preview Download |