There is a newer version of the record available.

Published January 16, 2023 | Version 2.0.0
Dataset Open

AgrImOnIA: Open Access dataset correlating livestock and air quality in the Lombardy region, Italy

  • 1. University of Bergamo
  • 2. University of Turin
  • 3. Leibniz University Hannover
  • 4. University of Milano-Bicocca

Contributors

Project manager:

  • 1. University of Bergamo

Description

OBSOLETE VERSION - This is not the last update of the dataset, we strongly suggest to use the last version

The AgrImOnIA dataset is a comprehensive dataset relating air quality and livestock (expressed as the density of bovines and swine bred) along with weather and other variables. The AgrImOnIA Dataset represents the first step of the AgrImOnIA project. The purpose of this data set is to give the opportunity to assess the impact of agriculture on air quality in Lombardy through statistical techniques capable of highlighting the relationship between the livestock sector and air pollutants concentrations.

This dataset is a collection of estimated daily values for a range of measurements of different dimensions as: air quality, meteorology, emissions, livestock animals and land use. Data are related to Lombardy and the surrounding area for 2016-2021, inclusive. The surrounding area is obtained by applying a 0.3° buffer on Lombardy borders.

The data uses several aggregation and interpolation methods to estimate the measurement for all days.

The files in the folder are:

Agrimonia_Dataset.csv(.Rdata,.mat) which is built by joining the daily time series related to the AQ, WE, EM, LI and LA variables. In order to simplify access to variables in the Agrimonia dataset, the variable name starts with the dimension of the variable, i.e., the name of the variables related to the AQ dimension start with 'AQ_'. This file is archived also in the and format for MATLAB and R software, respectively. 

Metadata_Agrimonia.csv which provides further information for the sources used, variables imported, transformations applied, and about the Agrimonia variables.

Metadata_AQ_imputation_uncertainty.csv which contains the daily uncertainty estimate of the imputed observation for the AQ to mitigate missing data in the hourly time series.  

Metadata_LA_CORINE_labels.csv which contains the label and the description associated with the CLC class.  

Metadata_monitoring_network_registry.csv which contains all details about the AQ monitoring station used to build the dataset. Information about pollutant stations includes: station type, municipality code, environment type, altitude, pollutants sampled and other information. Each row represents a single sensor.

Metadata_LA_SIARL_labels.csv which contains the label and the description associated with the SIARL class.

The dataset can be reproduced using the code available at the GitHub page: https://github.com/AgrImOnIA-project/AgrImOnIA_Data

UPDATE 16/01/2023 - NEW RELEASE

A new version of the dataset is released, Agrimonia_Dataset_v_2_0_0.csv (.Rdata and .mat) and Metadata_monitoring_network_registry_v_2_0_0.csv. Some minor points have been addressed:

  • Added values for LA_land_use variable for Switzerland stations (in Agrimonia Dataset_v_2_0_0.csv)
  • Deleted incorrect values for LA_soil_use variable for stations outside Lombardy region during 2018 (in Agrimonia Dataset_v_2_0_0.csv)
  • Fixed duplicate sensors corresponding to the same pollutant within the same station (in Metadata_monitoring_network_registry_v_2_0_0.csv)

Files

Agrimonia_Dataset_v_2_0_0.csv

Files (189.7 MB)

Name Size Download all
md5:f5d56ace29a8050a0c329c56ef1371d7
114.3 MB Preview Download
md5:d2b538e22762c703a0c871a7b99ca18c
26.0 MB Download
md5:7fd10ce6e129a2f509991addaa44f4c5
19.0 MB Download
md5:c04b75394c8f7db0c1c31d668bc5c6f7
23.6 kB Preview Download
md5:d159664aa135832b930ca3fe3a65ef52
30.2 MB Preview Download
md5:cdad9d722504d97ff0576d85859deb10
4.0 kB Preview Download
md5:cee341b55b17639eba34ae51d1f8e17f
475 Bytes Preview Download
md5:0d58c8f792ef46acfc6d1eb615b580f7
178.4 kB Preview Download