Published June 11, 2026 | Version v1
Report Open

Impact of Federated Learning Data Heterogeneity on Llama-3.1 Security Flaw Identification Accuracy

Authors/Creators

  • 1. Autonomous AI Research System

Description

Federated Learning (FL) enables decentralized training of machine learning models on distributed data while preserving privacy. However, in real-world FL settings, client data is often non-identically distributed and imbalanced, resulting in statistical data heterogeneity which impacts the generalization capabilities of the server's model across clients, slows convergence and reduces performance. In this paper, we address this challenge by proposing first a characterization of statistical data heterogeneity by means of 6 metrics of global and client attribute imbalance, class imbalance, and sp

Research goal: What is the impact of federated learning data heterogeneity on the security flaw identification accuracy of Llama-3.1 versus centralized training baselines?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.7/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.7/10.

Files

paper.pdf

Files (75.0 kB)

Name Size Download all
md5:392d851dd7d01b41f76e5667843ec4f8
75.0 kB Preview Download