Published December 22, 2023 | Version v1
Dataset Open

Supporting Data for "Local exposure misclassification in national models: relationships with urban infrastructure and demographics"

  • 1. ROR icon The University of Texas at Austin
  • 2. ROR icon University of California, Berkeley
  • 3. Harvard T.H. Chan School of Public Health

Description

Overview: This dataset accompanies the recent publication "Local exposure misclassification in national models: relationships with urban infrastructure and demographics" (DOI: 10.1038/s41370-023-00624-z). It provides essential data for replicating and extending the analysis conducted in the study. The script for the analysis is available at https://github.com/SEChambliss-AQ/LD-analysis. The dataset consists of four key files.

Files Included:

  1. Gridded OSM and GSV Data (gridded_OSM_GSV.RDS):

    • This R object offers a 100mx100m grid covering select neighborhoods in the San Francisco Bay Area.
    • Each grid cell includes average air pollution levels (Ultrafine Particle Count in thousand count per cubic meter; Nitrogen Dioxide in ppb) from mobile monitoring.
    • The file also provides normalized z-scores of local density of urban infrastructure related to air pollutants based on OpenStreetMap (OSM) data.
    • Further details can be found in the associated publication.
  2. BartMachine Outputs (bartMachine R objects.zip):

    • A collection of R object outputs from the bartMachine package, an R-Java Bayesian Additive Regression Trees implementation.
    • These objects, which can be regenerated using the provided script, are included to reduce computational requirements for future analyses.
    • The script for generating these outputs is available at the above GitHub Repository.
  3. NO2 Predictions (CACES_criteria.csv):

    • Land Use Regression (LUR) data for Nitrogen Dioxide, provided by the Center for Air, Climate, and Energy Solutions.
    • Methodologies for integrating these data with the gridded dataset are described in the publication and script.
  4. UFP Predictions (CACES_UFP.csv):

    • Similar to the NO2 dataset, this file contains LUR data for Ultrafine Particle predictions.
    • Methods for data integration are detailed in the publication and available script.

Usage: These files are intended for researchers and analysts aiming to replicate or build upon the study's findings. They provide a source of data for exploring the complex interplay between air pollution, urban infrastructure, and demographic factors in urban environments. For detailed methodology and analysis, refer to the original publication and the accompanying GitHub repository.

Files

bartMachine R objects.zip

Files (142.8 MB)

Name Size Download all
md5:9e5d74f8213696e51bf55d6a5dd73b01
128.2 MB Preview Download
md5:72cb49394159e82740dc7502732f9034
8.8 MB Preview Download
md5:8ec37793699be24c02513f981dc1422d
1.4 MB Preview Download
md5:8c0161705cdcf5490e2a5216d35d54fe
4.4 MB Download

Additional details

Related works

Is described by
Publication: 10.1038/s41370-023-00624-z (DOI)