Published November 19, 2025 | Version v1
Dataset Open

emburden: Processed Energy Burden Datasets (US Nationwide)

Authors/Creators

  • 1. Emergi Foundation, UNC Chapel Hill

Description

PROCESSED, analysis-ready household energy burden datasets from the DOE Low-Income Energy Affordability Data (LEAD) Tool, formatted for the emburden R package. Scope: All 51 US states and territories (50 states + DC) IMPORTANT: These are PRE-PROCESSED datasets, not raw OpenEI data. They have been:
  • Aggregated by census tract + income bracket
  • Enriched with computed energy burden metrics (EROI, NER, DEAR)
  • Standardized for immediate analysis
  • Quality-checked and validated
This repository provides census tract-level data on household energy burden for the entire United States, covering ~73,000 census tracts. Data includes both Area Median Income (AMI) and Federal Poverty Line (FPL) cohort analyses for 2018 and 2022 vintages.

Files Included:

  • lead_ami_cohorts_2022_us.csv.gz: 2022 AMI cohort data (701,490 records, 148 MB)
  • lead_fpl_cohorts_2022_us.csv.gz: 2022 FPL cohort data (588,163 records, 52 MB)
  • lead_ami_cohorts_2018_us.csv.gz: 2018 AMI cohort data (530,500 records, 54 MB)
  • lead_fpl_cohorts_2018_us.csv.gz: 2018 FPL cohort data (514,893 records, 53 MB)
  • checksums.txt: MD5 checksums for verification
Total size: 307 MB compressed

Data Processing

  • Source: Raw LEAD Tool data from OpenEI
  • Processing: emburden R package v0.4.8 data pipeline
  • Format: CSV (aggregated tract-level cohorts with computed metrics)
  • Ready for: Immediate analysis, no additional processing required

Coverage

  • States: All 51 (50 states + DC, excludes PR)
  • Census Tracts: ~73,000 nationwide
  • Total Records: 2.3+ million cohort observations
  • Income Brackets: 4-6 per dataset/vintage

Data Sources

Original raw data from:
  • DOE LEAD Tool 2022: https://data.openei.org/submissions/6219
  • DOE LEAD Tool 2018: https://data.openei.org/submissions/573
Processed using: emburden R package v0.4.8 (https://github.com/ScheierVentures/emburden)

Citation

When using this data, please cite:
  1. This Zenodo repository (DOI provided)
  2. The emburden R package v0.4.8
  3. The original DOE LEAD Tool publications

License

CC-BY-4.0 (same as source data)

Files

checksums.txt

Files (80.6 MB)

Name Size Download all
md5:6f6ddd167d95cce9c97c5db4221fa26e
202 Bytes Preview Download
md5:5aefd8e2ef0a63089b68977579d9df86
18.0 MB Download
md5:d3b30d9d0009032ebb1b9228e44d0e2d
24.1 MB Download
md5:3da8be8c8628656b7772df4c4e7c4e04
18.2 MB Download
md5:767f2ff27193116f61e893999eb8bcf1
20.2 MB Download

Additional details