Published 2026 | Version v1

Supporting data repository for "Data fusion for fine-scale ozone mapping in the New York City metropolitan area using low-cost sensors and model information"

  • 1. EDMO icon State University of New York at Albany, Atmospheric Sciences Research Center
  • 2. ROR icon University Corporation for Atmospheric Research
  • 3. ROR icon University at Albany, State University of New York
  • 4. ROR icon NSF National Center for Atmospheric Research
  • 5. ROR icon Research Applications Laboratory
  • 6. Joint Center for Satellite Data Assimilation

Description

Supporting data repository for "Data fusion for fine-scale ozone mapping in the New York City metropolitan area using low-cost sensors and model information" by Hojeily et al. (2026) published in Atmospheric Environment. 

Repository created and managed by Ellie Hojeily (ehojeily@albany.edu), last updated: 23 January 2026

Important! The New York State Mesonet (NYSM) observed low-cost sensor data must be retrieved from the NYSM website (https://www2.nysmesonet.org/) following acceptance of a data use agreement. As such, the NYSM low-cost sensor observations are omitted from files in this repository. Please contact Ellie with questions or if additional data are required. Additionally, NYSM meteorological data can be requested from the Mesonet website (https://nysmesonet.org/about/data). 

The following files are available to download from this repository: 

  1. airnow_results.csv - results for the bias correction on the AirNow sites within the model domain. 
  2. nysm_testing_results.csv - results for the bias correction on the NYSM testing sites. Note that the observed O3 values are omitted in this file. Please reach out directly or retrieve data from the NYSM if these data are needed. 
  3. queens_college_results.csv - results for the Queens College NYSDEC site.
  4. sample_wrf.zip - sample output from Wx-AQ/WRF-Chem model, data for the July 1 2024 are provided due to file size restrictions. If you would like the full week of data to use with the sample_bc.zip, please reach out and it can be provided. These files include all forecast fields from WRF. The files will start with 'NYSM_merged' indicating they are bias corrected files. 
  5. sample_bc.zip - sample gridded dataset using the NYSM and AirNow testing sites for bias correction, hourly gridded data for the first week of July are provided. The sedi output variable in the WRF files is 'wrf_pm25_m' since the original SEDI code was designed for PM, these variables are however surface O3 in ppm. Reach out if assistance is needed. Please note that this file is 1.2 gb. 

For additional data, please contact Ellie Hojeily (ehojeily@albany.edu)

Metadata. All .csv files are structured similarly using the following column names: 

Column/Header Name Explanation
site (site.1)  Site name, for the NYSM sites, this is a short identifier (4 letters for standard sites, 6 for Micronet sites) 
lat Latitude in degrees. The latitude values have been rounded to 3 decimal places. More detailed lat/lon information can be found from AirNow and the NYSM. 
lon Longitude in degrees. The longitude values have been rounded to 3 decimal places. More detailed lat/lon information can be found from AirNow and the NYSM. 
O3 Observed ozone in ppb (omitted in nysm_testing_results.csv)
Source Source of observed data (NYSM, AirNow)
wxaq-O3 WRF predicted ozone in ppb
sedi-O3 Bias corrected ozone in ppb
datetime Datetime in UTC
Period Indicates if the site was used to Train or Test the BC. 
Run Indicates what bias correction was used (nysm_only, airnow_only, nysm_airnow) 

Files

airnow_results.csv

Files (2.0 GB)

Name Size
md5:2832d491346ff2c34233e7771eaed553
633.6 MB Preview Download
md5:18638ca2552f5605435234055892508d
82.3 MB Preview Download
md5:1fed67e9c249616df943cd659b5d52e0
2.9 MB Preview Download
md5:0a78be8951b3e3794db6b804de520d88
25.6 MB Preview Download
md5:b560fe328d43d43a069e92166b43ed44
1.3 GB Download

Additional details

Funding

New York State Energy Research and Development Authority
156228
New York State Energy Research and Development Authority
100417

References

  • Ellie Hojeily, Cheng-Hsuan Lu, Stefano Alessandrini, Ju-Hye Kim, Rajesh Kumar, Shih-Wei Wei, Liam Sheji, Md.Aynul Bari, Scott Miller, Data fusion for fine-scale ozone mapping in the New York City metropolitan area using low-cost sensors and model information, Atmospheric Environment, 2026, 121805, ISSN 1352-2310, https://doi.org/10.1016/j.atmosenv.2026.121805.
  • A Network Calibration Approach Improves the Accuracy and Long-Term Stability of a Low-Cost Air Quality Mesonet in New York City Ellie H. Hojeily, Jason M. Covert, Margaret J. Schwab, Clover Moore, Cheng-Hsuan Lu, Md. Aynul Bari, and Scott D. Miller ACS ES&T Air 2026 3 (1), 58-72 DOI: 10.1021/acsestair.5c00205
  • Combining Low-Cost Sensors with the New York State Mesonet for Continuous Fine-Scale Air Quality Monitoring in the New York City Metropolitan Area Scott D. Miller, Ellie H. Hojeily, Jason M. Covert, Cheng-Hsuan Lu, Md. Aynul Bari, Margaret J. Schwab, Clover Moore, and Matthew Brooking ACS ES&T Air 2026 3 (1), 44-57 DOI: 10.1021/acsestair.5c00200