Dataset Open Access

Britain Breathing 2020 Air Quality and Meteorological Dataset

Lowe, Douglas; Gledson, Ann; Topping, David; Jay, Caroline; Reani, Manuele

This data set is a collection of daily mean and maximum values for a range of air quality and meterological measurements and model forecasts for the UK for the year 2020. The dataset contains Temperature, Relative Humidity, and Pressure data, downloaded from the Met Office MIDAS archives via the MEDMI server ( Also downloaded from the MEDMI server are daily pollen measurements for the UK. PM10, PM2.5, NO2, NOx (as NO2), O3, and SO2 measurements from the DEFRA AURN network, and also model forecasts of the same made using the EMEP model.

The tools used to download and process these measurement datasets are available here:

The dataset is designed for use with the region estimator toolset, available here:

Emissions over the UK for the EMEP model runs were generated using the NAEI 2016 UK emission dataset, available in netcdf form here: The running scripts, and operation inputs for EMEP, are available here: and

This dataset is an extension of the Britain Breathing 2016-2019 Air Quality and Meteorological Dataset, available here: The file 'site_location_data.csv' in this dataset contains the location metadata (site_id, latitude, longitude, postcode area) for the dataset here too.

The dataset is presented in CSV format as the file 'turing_aq_daily_met_pollen_pollution_data_2020.csv', described below.

The columns intended to be used as indexes are:

  • timestamp,
    • date of measurements on that row
  • site_id,
    • measurement site ID, corresponding to sites in the three networks:
      • AURN [indicated by AQ],
      • MIDAS [indicated by WEATHER],
      • or pollen [indicated by POLLEN].

The data columns are:

  • O3, PM10, PM2.5, NO2, NOXasNO2, SO2, 
    • daily mean and maximum values in ug/m3 (all with "_max", "_mean", and "_flag" tags)
    • AURN measurement data
    • daily mean and maximum values in ug/m3 (all with "_max", and "_mean" tags)
    • EMEP model forecasts
  • alnus, ambrosia, artemisia, betula, corylus, fraxinus, platanus, poaceae, quercus, salix, ulmus, urtica,
    • daily pollen grain counts
  • temperature, relativehumidity, pressure,
    • daily mean and maximum values in degC, %, and hPa (all with "_max", "_mean", and "_flag" tags).
    • Met Office measurement data

The "_flag" columns indicate data points which have been partially, or fully, imputed. The values for these will be in the range 0-1, and indicate the fraction of the hourly values within that day that are imputed (0 = none, 1 = all 24 hourly datapoints are imputed). No imputation has been carried out for this dataset, so the "_flag" data in this dataset will always be zero (the number of hourly data points used to calculate the daily mean and maximum are not recorded in this dataset).


Files (25.3 MB)
Name Size
25.3 MB Download
All versions This version
Views 137110
Downloads 8666
Data volume 2.2 GB1.7 GB
Unique views 12199
Unique downloads 7861


Cite as