There is a newer version of the record available.

Published May 25, 2024 | Version v2024.5.0
Dataset Open

Public Utility Data Liberation Project (PUDL) Data Release

Description

PUDL v2024.5.0 Data Release

We've just completed our quarterly integration of EIA data sources for 2024Q2 (in support of RMI's Utility Transition Hub) and have also added a bunch of new tables over the last few months in an effort to better support energy system modelers (with support from GridLab).

New Data Coverage

EIA-860 & EIA-923

GridPath RA Toolkit

EIA AEO

  • Extracted tables 13, 15, 20, and 54 from the EIA Annual Energy Outlook 2023, which include future projections related to electric power and renewable energy through the year 2050, across a variety of scenarios. See issue #3368 and PR #3538.
  • Added new tables from EIA AEO table 54:
    • :ref:`core_eiaaeo__yearly_projected_generation_in_electric_sector_by_technology` contains generation capacity & generation projections for the electric sector, broken out by technology type. See issue #3581 and PR #3582.
    • :ref:`core_eiaaeo__yearly_projected_generation_in_end_use_sectors_by_fuel_type` contains generation capacity & generation projections for the electric sector, broken out by technology type. See issue #3581 and PR #3598.
    • :ref:`core_eiaaeo__yearly_projected_electric_sales` contains electric sales projections until 2050, broken out by customer type. See issue #3581 and PR #3617.

NREL ATB

  • Added new NREL ATB tables with annual technology cost and performance projections. See issue #3465 and PRs #3498, #3570.

EIA-930

EPA CEMS

  • Added 2024 Q1 of CEMS data. See issue #3620 and PR #3624.

EIA Bulk Electricity Data

  • Updated the EIA Bulk Electricity data archive to include data that was available as of 2024-05-01, which covers up through 2024-02-01 (3 months more than the previously used archive). See PR #3615.

FERC Form 1

Data Cleaning

  • When generator_operating_date values are too inconsistent to be harvested successfully, we now take the max date within a year and attempt to harvest again, to rescue records lost because of inconsistent month reporting in EIA 860 and 860M. See issue #3340 and PR #3419. This change also fixed a bug that was preventing other columns harvested with a special process from being saved.
  • When ingesting FERC 1 XBRL filings, we now take the most recent non-null value instead of the value from the latest filing that applies for a specific row. This means that we no longer lose data if a utility posts a FERC filing with only a small number of updated values. See issue #3309 and PR #3545.

EIA - FERC1 Record Linkage Model Update

We merged in a refactor of the EIA plant parts to FERC1 plants record linkage model, which was generously supported by a CCAI Innovation Grant. This replaced the linear regression model with a model built with the Python package Splink. Splink provides helpful visualizations to understand model performance and parameter tuning, which can be generated with devtools/splink-ferc1-eia-match.ipynb. We measured model performance with precision - a measure of accuracy when the model makes a prediction, recall - a measure of coverage of FERC records model predicted a match for, and accuracy - a measure of overall correctness of the predictions. Model performance improved and now has a precision of .94, recall of .9, and overall accuracy of .85.

Schema Changes

Bug Fixes

Ensure that all columns fed into the harvesting / reconciliation process are encoded before harvesting takes place, improving the consistency of harvested fields. See issue #3542 and PR #3558. This change also simplifies the encoding process in the vast majority of cases, since the same global set of encoders can be used on any dataframe, with every column encoded based on the field definitions and FK constraints associated with the column name.

CLI Changes

Removed the --clobber option from the ferc_to_sqlite command and associated assets. We rebuild these databases infrequently, and needing to either edit the runtime parameters in Dagster's Launchpad or remove the existing databases from the filesystem manually are brittle. Partly in response to issue #3612; see PR #3622.

Other PUDL v2024.5.0 Resources

Contact Us

If you're using PUDL, we would love to hear from you! Even if it's just a note to let us know that you exist, and how you're using the software or data. Here's a bunch of different ways to get in touch:

Files

ferc1_xbrl_datapackage.json

Files (9.4 GB)

Name Size Download all
md5:560862b6eda63dd9c99034ec4995cf14
6.4 MB Download
md5:a4bea8119e67502dfdf503f953fc8179
506.7 MB Download
md5:fa4f3586790ed4438b7e3acc61df8f78
67.1 MB Download
md5:c63b3e9c43574e2497b027c65f397f73
64.9 MB Download
md5:cc6cf41a9e8a93d425d0ea3712b61539
79.4 MB Download
md5:7f4d5ff59151fc95fd61fc6cfc54025f
9.8 MB Download
md5:bf5f4213416070970d1df4e497b5e622
5.4 GB Download
md5:ea6dea30c134d4ee702a7350534cc5aa
275.5 MB Download
md5:a607d95b3e90a7adbbe0f60f08dc82da
97.2 MB Download
md5:e96db21413b81ea068ec44ac6c42b6fb
1.7 MB Preview Download
md5:026ad62c418e5e8aab4a85e6a68d628a
7.3 MB Preview Download
md5:a59aedfeb1d2e0498786680fc3e61bba
74.5 MB Download
md5:1536e1eec1ebf2a1c28de6188c23da38
13.8 MB Download
md5:efcd21f96a10fb21e4c62c671f6371f0
2.0 MB Preview Download
md5:fbbd750509118029d7d675e573f6ad5a
7.1 MB Preview Download
md5:f4512fd566a296a1905872ce29752492
2.9 MB Download
md5:04569ee928ec487858b448dd1dd7d652
2.3 MB Download
md5:a3cdc95139ea96e46b4b2a231ef785a7
748.9 kB Preview Download
md5:0cc4fca785314082b8f514bae888600e
1.9 MB Preview Download
md5:0cc8f68819a45378e4c9c0ecf884de78
43.9 MB Download
md5:571b1e3649011d4acf3f1475a4d09966
10.6 MB Download
md5:74fa9bde52a973826ecef44c65249e7a
1.1 MB Preview Download
md5:2c85dae41667448ea32066726f76ae5e
2.9 MB Preview Download
md5:73f2028f0d495d13e287287ee49473a7
102.2 MB Download
md5:1243f03c0121d5a7993696ef9afca655
59.8 kB Preview Download
md5:9fef935f9a970839319ada082d6c9672
192.4 kB Preview Download
md5:c6b3276d15f1a5fa75c58565dda138d2
105.1 MB Download
md5:31c234dda989ccdfb474af27fe6034e8
61.1 MB Download
md5:51c4824c58b0273cc2cb91e31a605fcc
55.4 MB Download
md5:7ee6dce8806629d7a3ab3c4e992373a1
2.4 GB Download