Published November 20, 2025 | Version v1
Dataset Open

emburden: Processed Energy Burden Datasets (US Nationwide)

Authors/Creators

  • 1. Emergi Foundation, UNC Chapel Hill

Description

PROCESSED, analysis-ready household energy burden datasets from the DOE Low-Income Energy Affordability Data (LEAD) Tool, formatted for the emburden R package. Scope: All 51 US states and territories (50 states + DC) IMPORTANT: These are PRE-PROCESSED datasets, not raw OpenEI data. They have been:
  • Aggregated by census tract + income bracket
  • Enriched with computed energy burden metrics (EROI, NER, DEAR)
  • Standardized for immediate analysis
  • Quality-checked and validated
This repository provides census tract-level data on household energy burden for the entire United States, covering ~73,000 census tracts. Data includes both Area Median Income (AMI) and Federal Poverty Line (FPL) cohort analyses for 2018 and 2022 vintages.

Files Included:

  • lead_ami_cohorts_2022_us.csv.gz: 2022 AMI cohort data (701,490 records, 148 MB)
  • lead_fpl_cohorts_2022_us.csv.gz: 2022 FPL cohort data (588,163 records, 52 MB)
  • lead_ami_cohorts_2018_us.csv.gz: 2018 AMI cohort data (530,500 records, 54 MB)
  • lead_fpl_cohorts_2018_us.csv.gz: 2018 FPL cohort data (514,893 records, 53 MB)
  • checksums.txt: MD5 checksums for verification
Total size: 307 MB compressed

Data Processing

  • Source: Raw LEAD Tool data from OpenEI
  • Processing: emburden R package v0.4.8 data pipeline
  • Format: CSV (aggregated tract-level cohorts with computed metrics)
  • Ready for: Immediate analysis, no additional processing required

Coverage

  • States: All 51 (50 states + DC, excludes PR)
  • Census Tracts: ~73,000 nationwide
  • Total Records: 2.3+ million cohort observations
  • Income Brackets: 4-6 per dataset/vintage

Data Sources

Original raw data from:
  • DOE LEAD Tool 2022: https://data.openei.org/submissions/6219
  • DOE LEAD Tool 2018: https://data.openei.org/submissions/573
Processed using: emburden R package v0.4.8 (https://github.com/ScheierVentures/emburden)

Citation

When using this data, please cite:
  1. This Zenodo repository (DOI provided)
  2. The emburden R package v0.4.8
  3. The original DOE LEAD Tool publications

License

CC-BY-4.0 (same as source data)

Files

checksums.txt

Files (80.6 MB)

Name Size Download all
md5:28ce17263d648263a417e54c923a7c85
404 Bytes Preview Download
md5:4941e3566daec1badc53eb44f34d95a8
18.0 MB Download
md5:cc847d89119a374bede6ee7f41060506
24.1 MB Download
md5:85ef6b7b4de244e80ff700f3d5becf3a
18.2 MB Download
md5:767f2ff27193116f61e893999eb8bcf1
20.2 MB Download

Additional details