catalyst-cooperative/pudl: PUDL v2025.2.0
Creators
- 1. Catalyst Cooperative
Description
This is our regular quarterly release for 2025Q1. It includes updates to all the datasets that are published with quarterly or higher frequency, plus initial versions of a few new data sources that have been in the works for a while.
One major change this quarter is that we are now publishing all processed PUDL data as Apache Parquet files, alongside our existing SQLite databases. See Data Access for more on how to access these outputs.
Some potentially breaking changes to be aware of:
- In the EIA Form 930 – Hourly and Daily Balancing Authority Operations Report a number of new energy sources have been added, and some old energy sources have been split into more granular categories. See Changes in energy source granularity over time.
- We are now running the EPA’s CAMD to EIA unit crosswalk code for each individual year starting from 2018, rather than just 2018 and 2021, resulting in more connections between these two datasets and changes to some sub-plant IDs.
Many thanks to the organizations who make these regular updates possible! Especially GridLab, RMI, and the ZERO Lab at Princeton University. If you rely on PUDL and would like to help ensure that the data keeps flowing, please consider joining them as a PUDL Sustainer, as we are still fundraising for 2025.
See the v2025.2.0 release notes in our documentation for more details.
<!-- Release notes generated using configuration in .github/release.yml at v2025.2.0 -->
What's Changed
New & Updated Data
- Integrate EIA 861 2023 final release data by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/3911
- Integrate EIA 930 data up through October by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/3946
- Add Q3 2024 CEMS data to ETL by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/3948
- EIA bulk elec q3 2024 integration by @aesharpe in https://github.com/catalyst-cooperative/pudl/pull/3951
- Eia860m q3 24 by @aesharpe in https://github.com/catalyst-cooperative/pudl/pull/3949
- Eia923 q3 24 by @aesharpe in https://github.com/catalyst-cooperative/pudl/pull/3950
- Eia176 wide table by @davidmudrauskas in https://github.com/catalyst-cooperative/pudl/pull/3590
- EIA 176 wide tables - follow-up fixes by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/3978
- Add EIA 176 2023 data by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4004
- add censusfips metadata by @cmgosnell in https://github.com/catalyst-cooperative/pudl/pull/4006
- Enable sec10k distribution by @zschira in https://github.com/catalyst-cooperative/pudl/pull/4026
- Integrate EIA 860 multifuel table by @jmelot in https://github.com/catalyst-cooperative/pudl/pull/3988
- Add Q4 2024 CEMS data to ETL by @cmgosnell in https://github.com/catalyst-cooperative/pudl/pull/4052
- Update Q1 2025 EIA bulk electricity data by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4051
- Eia860m eia923 q1 2025 by @aesharpe in https://github.com/catalyst-cooperative/pudl/pull/4047
Other Changes
- Post v2024.10.0 release errata cleanup by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/3917
- Add some links to the VCE RARE data source page. by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/3921
- Close out PUDL v2024.11.0 release notes by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/3961
- Reduce peak memory use of VCE RARE assets by @zschira in https://github.com/catalyst-cooperative/pudl/pull/3959
- Clean up some nightly build infrastructure cruft by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/3962
- Fix nested list formatting in CONTRIBUTING by @crd477 in https://github.com/catalyst-cooperative/pudl/pull/3974
- Updates to splink FERC to EIA record linkage notebook by @katie-lamb in https://github.com/catalyst-cooperative/pudl/pull/3976
- Correct flipped starting/ending balance column rename. by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/3979
- Add FERC1 validation tests by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/3860
- Fix
pudl.settings
class doc strings by @bendnorman in https://github.com/catalyst-cooperative/pudl/pull/3986 - Add parquet download link to data dictionary by @bendnorman in https://github.com/catalyst-cooperative/pudl/pull/3984
- Refactor entity matching name cleaner to be more efficient by @katie-lamb in https://github.com/catalyst-cooperative/pudl/pull/3953
- Fix row count asset check for vcerare by @zschira in https://github.com/catalyst-cooperative/pudl/pull/3993
- Make asset checks run in integration test by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/3990
- Make the reconstructable job not take any arguments at all. by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/3996
- Add NREL GHA runner WIF setup by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/3952
- Add boolean column naming standard to naming_conventions doc by @aesharpe in https://github.com/catalyst-cooperative/pudl/pull/3998
- Make it possible to pass another
sources
dict to DataSource by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4003 - Finish unittest -> pytest conversion by @jdangerx in https://github.com/catalyst-cooperative/pudl/pull/4014
- Pass
sources
toDataSource.from_id()
by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4013 - Acknowledge PUDL Sustainers by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/4025
- Sec distribution fix by @zschira in https://github.com/catalyst-cooperative/pudl/pull/4031
- Add sec10k metadata directly in PUDL by @zschira in https://github.com/catalyst-cooperative/pudl/pull/4035
- Update PUDL bibliography by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4049
- Change SEC 10-K table schemas to fix FK errors and use quarterly naming. by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/4046
- fix: Normalize VCE RARE spellings of great lakes. by @krivard in https://github.com/catalyst-cooperative/pudl/pull/4029
- Update Q1 2025 EIA 930 data by @krivard in https://github.com/catalyst-cooperative/pudl/pull/4054
- Update EIA-EPA crosswalk to include multiple years of crosswalk data by @e-belfer in https://github.com/catalyst-cooperative/pudl/pull/4056
- Publish yearly detailed FERC1 accounting tables by @cmgosnell in https://github.com/catalyst-cooperative/pudl/pull/4057
- Close out release notes for PUDL v2025.2.0 by @zaneselvans in https://github.com/catalyst-cooperative/pudl/pull/4062
New Contributors
- @crd477 made their first contribution in https://github.com/catalyst-cooperative/pudl/pull/3974
- @jmelot made their first contribution in https://github.com/catalyst-cooperative/pudl/pull/3988
- @krivard made their first contribution in https://github.com/catalyst-cooperative/pudl/pull/4029
Full Changelog: https://github.com/catalyst-cooperative/pudl/compare/v2024.10.0...v2025.2.0
Files
catalyst-cooperative/pudl-v2025.2.0.zip
Files
(47.1 MB)
Name | Size | Download all |
---|---|---|
md5:5eee27ded216bc2eccc1e9962a338ed4
|
47.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/catalyst-cooperative/pudl/tree/v2025.2.0 (URL)
Software
- Repository URL
- https://github.com/catalyst-cooperative/pudl