Published February 18, 2026 | Version v1
Preprint Open

Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects

  • 1. ROR icon Athena Research and Innovation Center In Information Communication & Knowledge Technologies
  • 2. ROR icon Harokopio University of Athens
  • 3. ATHENA research center
  • 4. ATHENA Research Center
  • 5. Max Planck ISS
  • 6. LMU

Description

Generalized additive models (GAMs) offer interpretability through independent univariate feature effects but underfit when interactions are present in data. GA2Ms add selected pairwise interactions which improves accuracy, but sacrifices interpretability and limits model auditing. We propose \emph{Conditionally Additive Local Models} (CALMs), a new model class, that balances the interpretability of GAMs with the accuracy of GA2Ms. CALMs allow multiple univariate shape functions per feature, each active in different regions of the input space. These regions are defined independently for each feature as simple logical conditions (thresholds) on the features it interacts with. As a result, effects remain locally additive while varying across subregions to capture interactions. We further propose a principled distillation-based training pipeline that identifies homogeneous regions with limited interactions and fits interpretable shape functions via region-aware backfitting. Experiments on diverse classification and regression tasks show that CALMs consistently outperform GAMs and achieve accuracy comparable with GA2Ms. Overall, CALMs offer a compelling trade-off between predictive accuracy and interpretability.

Files

2602.16503v1.pdf

Files (917.4 kB)

Name Size Download all
md5:a5d4bc626d35e4504726cb8fca1dd66e
917.4 kB Preview Download

Additional details

Funding

European Commission
AI-DAPT - AI-Ops Framework for Automated, Intelligent and Reliable Data/AI Pipelines Lifecycle with Humans-in-the-Loop and Coupling of Hybrid Science-Guided and AI Models 101135826