File uploads: We have fixed an issue which caused file uploads to fail. We apologise for the inconvenience it may have caused.

There is a newer version of the record available.

Published March 12, 2024 | Version v1
Dataset Open

High-quality daily PM2.5 datasets at a 10 km resolution for India

  • 1. Emmett Interdisciplinary Program in Environment and Resources, Stanford University, Stanford, CA, USA
  • 2. Department of Earth System Science, Stanford University, Stanford, CA, USA
  • 3. Center for Innovation in Global Health, Stanford University, Stanford, CA, USA
  • 4. Department of Energy Science and Engineering, Stanford University, Stanford, CA, USA
  • 5. Doerr School of Sustainability, Stanford University, Stanford, CA, USA
  • 6. Center on Food Security and the Environment, Stanford University, Stanford, CA, USA
  • 7. National Bureau of Economic Research, Cambridge, MA, USA

Description

This is the older version. Please see the version 2

 
 
Open-source daily fine particulate matter (PM2.5) datasets at a 10 km resolution for India from 2005 to 2023, using a region-specific two-stage machine learning model carefully validated on held-out monitor data that it was not trained on. Our model demonstrates robust out-of-sample performance, substantially outperforming existing publicly-available monthly PM2.5 datasets.
 
To take advantage of both the longer available time series of Aerosol Optical Depth (AOD) data and information from newer sensors such as TROPOspheric Monitoring Instrument (TROPOMI), we developed two separate machine learning models - the "Full model" and the "AOD model".
 
Full model:
  • Predictive performance (spatial cross-validation): R2 value of 0.67, RMSE of 27.79 μg/m3
  • Input features: Moderate Resolution Imaging Spectroradiometer (MODIS) AOD and TROPOMI satellite inputs along with other remote sensing data
  • Daily PM2.5 predictions for: July 10, 2018 - September 30, 2023
AOD model: 
  • Predictive performance (spatial cross-validation): R2 value of 0.64, RMSE of 32.08 μg/m3
  • Input features: all inputs except TROPOMI used for the Full model
  • Daily PM2.5 predictions for: January 1, 2005 - September 30, 2023
 
Please note that we employed spatial cross-validation (CV) rather than more conventional random CV to be responsible for predicting daily PM2.5 concentrations for locations without air quality monitors across India. When the above Full model was evaluated using 10-fold random CV, it showed notably higher performance (R2 of 0.85 and RMSE of 18.48 μg/m3). This highlights the potential of random CV to overstate model performance on critical real-world applications.
 
The paper has been submitted for publication in a peer reviewed journal, but has yet to be formally accepted for publication.
You can find a preprint on EarthArXiv: https://doi.org/10.31223/X5H40F

Files

aod_model_2005.zip

Files (2.0 GB)

Name Size Download all
md5:192c5e497e148194380617e4a785dbef
82.5 MB Preview Download
md5:dc7215a41f530d07201cc8af83131ffc
82.5 MB Preview Download
md5:cc8546487f10d21c391d6d2eedf552fc
82.4 MB Preview Download
md5:31700368d05e87647e287040aa5e767e
82.5 MB Preview Download
md5:c0f676d6473d90eb31ec8dbdd239070c
82.1 MB Preview Download
md5:81395808dce6d6e45e2b0077618e6dc0
82.5 MB Preview Download
md5:a67f3e2183e62a8afde4b07d5f869b43
81.9 MB Preview Download
md5:9ce5bea08eb51be0feaef443d9fbf88a
82.3 MB Preview Download
md5:035122099514f2db169fbad86c5ac176
82.3 MB Preview Download
md5:1fbf353d535ea387611dea28d451488f
82.1 MB Preview Download
md5:1ce1c4c97745a748c5e3975a4dcec933
82.1 MB Preview Download
md5:c371c134520b0e7571a6680a7400e2d7
82.4 MB Preview Download
md5:fd47a69acc3144b4ac8c3dfdf380ef1d
82.2 MB Preview Download
md5:07df8f969fe64ebea825f13cf62dd83a
81.9 MB Preview Download
md5:787e348f8a4160ad47bff91b060164cf
82.5 MB Preview Download
md5:7f317dc8a64920aa43691378cc787a05
83.1 MB Preview Download
md5:ed336006702e1bec608e52292f6c53e2
82.5 MB Preview Download
md5:ccf0c72be23d1e36ba9794736b8e3d58
82.6 MB Preview Download
md5:db0cb56771241685d24a01f82634af12
62.3 MB Preview Download
md5:8cec0f2598b1e24e1917d6ac7979a031
39.7 MB Preview Download
md5:528181329fd082902b5a01df9888d796
82.8 MB Preview Download
md5:0abc9f7265c8aebdff597e6957d7a311
83.4 MB Preview Download
md5:1a2062b42787d85aabe7a4da47e75e7e
82.8 MB Preview Download
md5:87ec2c53dd33bdc906f74433e0e748a4
82.9 MB Preview Download
md5:9e964a2a5286020924c2b4cabda77c1a
62.5 MB Preview Download

Additional details

Identifiers