Published March 12, 2024
| Version v1
Dataset
Open
High-quality daily PM2.5 datasets at a 10 km resolution for India
Creators
- 1. Emmett Interdisciplinary Program in Environment and Resources, Stanford University, Stanford, CA, USA
- 2. Department of Earth System Science, Stanford University, Stanford, CA, USA
- 3. Center for Innovation in Global Health, Stanford University, Stanford, CA, USA
- 4. Department of Energy Science and Engineering, Stanford University, Stanford, CA, USA
- 5. Doerr School of Sustainability, Stanford University, Stanford, CA, USA
- 6. Center on Food Security and the Environment, Stanford University, Stanford, CA, USA
- 7. National Bureau of Economic Research, Cambridge, MA, USA
Description
This is the older version. Please see the version 2
Open-source daily fine particulate matter (PM2.5) datasets at a 10 km resolution for India from 2005 to 2023, using a region-specific two-stage machine learning model carefully validated on held-out monitor data that it was not trained on. Our model demonstrates robust out-of-sample performance, substantially outperforming existing publicly-available monthly PM2.5 datasets.
To take advantage of both the longer available time series of Aerosol Optical Depth (AOD) data and information from newer sensors such as TROPOspheric Monitoring Instrument (TROPOMI), we developed two separate machine learning models - the "Full model" and the "AOD model".
Full model:
- Predictive performance (spatial cross-validation): R2 value of 0.67, RMSE of 27.79 μg/m3
- Input features: Moderate Resolution Imaging Spectroradiometer (MODIS) AOD and TROPOMI satellite inputs along with other remote sensing data
- Daily PM2.5 predictions for: July 10, 2018 - September 30, 2023
AOD model:
- Predictive performance (spatial cross-validation): R2 value of 0.64, RMSE of 32.08 μg/m3
- Input features: all inputs except TROPOMI used for the Full model
- Daily PM2.5 predictions for: January 1, 2005 - September 30, 2023
Please note that we employed spatial cross-validation (CV) rather than more conventional random CV to be responsible for predicting daily PM2.5 concentrations for locations without air quality monitors across India. When the above Full model was evaluated using 10-fold random CV, it showed notably higher performance (R2 of 0.85 and RMSE of 18.48 μg/m3). This highlights the potential of random CV to overstate model performance on critical real-world applications.
The paper has been submitted for publication in a peer reviewed journal, but has yet to be formally accepted for publication.
You can find a preprint on EarthArXiv: https://doi.org/10.31223/X5H40F
Files
aod_model_2005.zip
Files
(2.0 GB)
Name | Size | Download all |
---|---|---|
md5:192c5e497e148194380617e4a785dbef
|
82.5 MB | Preview Download |
md5:dc7215a41f530d07201cc8af83131ffc
|
82.5 MB | Preview Download |
md5:cc8546487f10d21c391d6d2eedf552fc
|
82.4 MB | Preview Download |
md5:31700368d05e87647e287040aa5e767e
|
82.5 MB | Preview Download |
md5:c0f676d6473d90eb31ec8dbdd239070c
|
82.1 MB | Preview Download |
md5:81395808dce6d6e45e2b0077618e6dc0
|
82.5 MB | Preview Download |
md5:a67f3e2183e62a8afde4b07d5f869b43
|
81.9 MB | Preview Download |
md5:9ce5bea08eb51be0feaef443d9fbf88a
|
82.3 MB | Preview Download |
md5:035122099514f2db169fbad86c5ac176
|
82.3 MB | Preview Download |
md5:1fbf353d535ea387611dea28d451488f
|
82.1 MB | Preview Download |
md5:1ce1c4c97745a748c5e3975a4dcec933
|
82.1 MB | Preview Download |
md5:c371c134520b0e7571a6680a7400e2d7
|
82.4 MB | Preview Download |
md5:fd47a69acc3144b4ac8c3dfdf380ef1d
|
82.2 MB | Preview Download |
md5:07df8f969fe64ebea825f13cf62dd83a
|
81.9 MB | Preview Download |
md5:787e348f8a4160ad47bff91b060164cf
|
82.5 MB | Preview Download |
md5:7f317dc8a64920aa43691378cc787a05
|
83.1 MB | Preview Download |
md5:ed336006702e1bec608e52292f6c53e2
|
82.5 MB | Preview Download |
md5:ccf0c72be23d1e36ba9794736b8e3d58
|
82.6 MB | Preview Download |
md5:db0cb56771241685d24a01f82634af12
|
62.3 MB | Preview Download |
md5:8cec0f2598b1e24e1917d6ac7979a031
|
39.7 MB | Preview Download |
md5:528181329fd082902b5a01df9888d796
|
82.8 MB | Preview Download |
md5:0abc9f7265c8aebdff597e6957d7a311
|
83.4 MB | Preview Download |
md5:1a2062b42787d85aabe7a4da47e75e7e
|
82.8 MB | Preview Download |
md5:87ec2c53dd33bdc906f74433e0e748a4
|
82.9 MB | Preview Download |
md5:9e964a2a5286020924c2b4cabda77c1a
|
62.5 MB | Preview Download |
Additional details
Identifiers
- DOI
- 10.31223/X5H40F