Published March 18, 2026 | Version 0.1.0
Preprint Open

Robustness Under Noise: A Comprehensive Analysis of Latent Factor Posterior Models in Noisy Compliance Environments

  • 1. Epalea

Description

We present a comprehensive empirical study examining the robustness of Latent Posterior Factor (LPF) models under varying degrees of data corruption in tax compliance classification tasks. Our experiments systematically evaluate model performance across five noise configurations ranging from clean data to extreme corruption (70% feature noise, 40% contradictory evidence). Results demonstrate that LPF models with Sum-Product Network (SPN) aggregation provide interpretable uncertainty quantification through predictable degradation curves, though they achieve lower absolute accuracy than BERT baselines and alternative architectures across all noise levels—a gap of 1–8 percentage points depending on noise severity. Through extensive seed testing (15 seeds per configuration) and multi-metric evaluation including Expected Calibration Error (ECE), Negative Log-Likelihood (NLL), and Brier scores, we establish that probabilistic evidence aggregation provides measurable robustness advantages in noisy environments. Our analysis reveals critical noise tolerance thresholds and quantifies the contribution of different architectural components to overall model resilience.

Keywords:
Model robustness, data corruption, Latent Posterior Factors (LPF), tax compliance classification, uncertainty quantification, probabilistic aggregation, sum-product networks (SPN), expected calibration error (ECE), negative log-likelihood (NLL), Brier score, interpretable AI, noise tolerance, neural-symbolic reasoning, multi-metric evaluation, machine learning resilience, evidence-based AI

Files

main.pdf

Files (2.1 MB)

Name Size Download all
md5:463625b6c1df42c8ff2eb7826bd1591d
2.1 MB Preview Download

Additional details

Related works

Is supplemented by
Preprint: 10.5281/zenodo.19183861 (DOI)
Preprint: 10.5281/zenodo.19184458 (DOI)
Preprint: arXiv:2603.15670 (arXiv)
Preprint: arXiv:2603.15674 (arXiv)

Software

Repository URL
https://github.com/aaaEpalea/epalea.git
Programming language
Python
Development Status
Active