Published February 4, 2026 | Version v1
Preprint Open

A factorial comparison of excess mortality models for Germany

  • 1. University of Koblenz
  • 2. University of Osnabrück
  • 3. ROR icon University of Stuttgart

Description

Reliable estimates of excess mortality (EM) are essential for both actuarial applications 
and informing public debate and policy. Yet, published estimates during the COVID-19 
pandemic for Germany varied by tens of thousands of deaths. This discrepancy raises the 
question of how sensitive EM estimates are to methodological choices.  

We therefore constructed a comprehensive factorial framework of models to calculate expected mortality, while
systematically varying six components: sex treatment, demographic dataset, 
age cohort resolution, temporal resolution, forerun length, and mortality model form.  
The full design yields 1152 candidate models, of which 764 remained feasible after 
excluding unstable combinations. These models were fitted to German mortality data 
2000--2019 and extrapolated to 2020--2024.  
We then assessed model performance using residuals and the fraction of variance unexplained (FVU). Extrapolations were assessed by analyzing predicted EM.

Our results demonstrate that EM estimates for the German population are surprisingly robust to choices of sex treatment and temporal resolution, but highly sensitive to age cohort resolution, forerun length, and model form. In particular, models without age stratification produce implausibly high EM due to Simpson’s paradox, and constant or quadratic models yield unreliably diverging extrapolations. By contrast, actuarial models with moderate forerun lengths provide robust and thus interpretable results.

Based on these findings, we strongly recommend excluding constant and quadratic baselines, avoiding unstratified models, and using updated demographic data. Among the feasible candidates, only 208 models are recommended for estimating EM of a stratified population -- mainly those with actuarial form, demographic resolution, and moderate to long forerun lengths. 

By highlighting the methodological sensitivity of 
estimates of expected mortality, this study aims at guiding actuaries, statisticians, and 
public-health modelers to construct meaningful, reliable baselines for EM estimates.

Files

Excess_Models_submitted_20260203.pdf

Files (3.2 MB)

Name Size Download all
md5:7040938c5fedbb86e8da727972b6fe0e
3.2 MB Preview Download