Published August 18, 2020 | Version v1
Dataset Open

Reliably predicting pollinator abundance: challenges of calibrating process-based ecological models

  • 1. University of Reading
  • 2. Lund University
  • 3. Northumbria University
  • 4. Brazilian Agricultural Research Corporation
  • 5. Western Norway University of Applied Sciences
  • 6. University of Leeds
  • 7. University of Worcester
  • 8. University of Bristol
  • 9. University of Edinburgh
  • 10. Lancaster University

Description

1. Pollination is a key ecosystem service for global agriculture but evidence of pollinator population declines is growing. Reliable spatial modelling of pollinator abundance is essential if we are to identify areas at risk of pollination service deficit and effectively target resources to support pollinator populations. Many models exist which predict pollinator abundance but few have been calibrated against observational data from multiple habitats to ensure their predictions are accurate.

2. We selected the most advanced process-based pollinator abundance model available and calibrated it for bumblebees and solitary bees using survey data collected at 239 sites across Great Britain. We compared three versions of the model: one parameterised using estimates based on expert opinion, one where the parameters are calibrated using a purely data-driven approach and one where we allow the expert opinion estimates to inform the calibration process.

3. All three model versions showed significant agreement with the survey data, demonstrating this model's potential to reliably map pollinator abundance. However, there were significant differences between the nesting/floral attractiveness scores obtained by the two calibration methods and from the original expert opinion scores.

4. Our results highlight a key universal challenge of calibrating spatially-explicit, process-based ecological models. Notably, the desire to reliably represent complex ecological processes in finely mapped landscapes necessarily generates a large number of parameters, which are challenging to calibrate with ecological and geographical data that is often noisy, biased, asynchronous and sometimes inaccurate. Purely data-driven calibration can therefore result in unrealistic parameter values, despite appearing to improve model-data agreement over initial expert opinion estimates. We therefore advocate a combined approach where data-driven calibration and expert opinion are integrated into an iterative Delphi-like process, which simultaneously combines model calibration and credibility assessment. This may provide the best opportunity to obtain realistic parameter estimates and reliable model predictions for ecological systems with expert knowledge gaps and patchy ecological data.

Notes

TransectSurveyData.csv

We strongly recommend using the original source datasets (see Gardner et al. 2020 for references), rather than our processed composite dataset, since these are far more detailed, not amalgamated to survey-level totals, generally include higher taxonomic resolution and often include ancillary data on survey weather conditions etc.

The column contents of the file are as follows:

1. Site - name of survey site

2. X - British National Grid x co-ordinate for survey site

3. Y - British National Grid y co-ordinate for survey site

4. Method - survey methodology (transect in all cases)

5. TransectDuration_min - total duration of transect survey in minutes. In many (but not all) cases, this is obtained by converting the reported transect length into a duration assuming a 50m transect takes 10min to survey.

6. Year - the year the survey was carried out

7. Week - the week of the year the survey was carried out, where this value ranges in integers between 1 and 52 and the week beginning 1st January is designated as week 1.

8. Day - the day of the year the survey was carried out, where this value ranges in integers between 1 and 365 and 1st January is designated as day 1.

9. GroundNestingBumblebees - the total number of ground nesting bumblebees observed during the survey

10. TreeNestingBumblebees - the total number of tree nesting bumblebees observed during the survey

11. GroundNestingSolitaryBees - the total number of ground nesting solitary bees observed during the survey

12. CavityNestingSolitaryBees - the total number of cavity nesting solitary bees observed during the survey

NB: Columns 9-12 can contain non-integer values. This occurs when bees were recorded as 'Bombus unknown' or 'solitary unknown'. In which case, they were apportioned between the nesting guilds according to the proportion of known species observed from each guild on that survey, as described in Gardner et al. (2020).

 

lfn.csv

This model input file lists the numeric code assigned to each landcover class (lu).

poll_names.csv

This model input file lists the species code assigned to each bee guild, its life history (solitary or social) and the floral periods (P1, P2, P3) during which its reproductive females (q) and workers (w; if social) are actively foraging.

av.csv

This model input file lists the maximum nest density used for each species code. Units = nests per ha.

distances.csv

This model input file lists the foraging and dispersal (activity = 'nesting') distances used for each species code. Units = m.

growth.csv

This model input file lists the growth parameters used for each species code. See Gardner et al. (2020) for parameter descriptions.

floralCover.csv

This model input file lists the expert opinion floral cover scores used for each landcover code in each floral period (P1, P2, P3). Columns with *_b represent mean scores across all experts. Columns with *_l represent mean value minus standard error. Columns with *_u represent mean value plus standard error. The permitted range for scores is 0.0 - 100.0. 

attract.csv

This model input file lists the expert opinion floral attractiveness scores (Flor_*) and nesting attractiveness scores (Nest_*) used for each species code with each landcover code in each floral period (P1, P2, P3). Columns with *_b represent mean scores across all experts. Columns with *_l represent mean value minus standard error. Columns with *_u represent mean value plus standard error. The permitted range for floral attractiveness scores is 0.0 - 20.0. The permitted range for nesting attractiveness scores is 0.0 - 1.0.

Funding provided by: Global Food Security `Food System Resilience' Programme*
Crossref Funder Registry ID:
Award Number: BB/R00580X/1

Funding provided by: Global Food Security `Food System Resilience' Programme
Crossref Funder Registry ID:
Award Number: BB/R00580X/1

Files

attract.csv

Files (100.8 kB)

Name Size Download all
md5:47025cec284c43b48559b8dd644fe272
21.7 kB Preview Download
md5:bf976eab6f1e05b7daf9f9a92ac3bce6
60 Bytes Preview Download
md5:b6e9599e1b3f58d23d469bf7a3e02313
177 Bytes Preview Download
md5:c519226f80abfadad6adef8d305ae10b
5.0 kB Preview Download
md5:1f6d4a0a9e2ba89aa8b4cc5ce4ab42ff
535 Bytes Preview Download
md5:bd431e9d05246da1fbcef3d5fa366ba8
745 Bytes Preview Download
md5:31af388adbd8f5ff6ce597d0cf166dbe
347 Bytes Preview Download
md5:f1f500a1bc709115f7bc06bb4f3dda40
72.3 kB Preview Download

Additional details

Related works