Comparison of Meta-Learning Convergence and F1-Score in Small and Large Language Models for Few-Shot Anomaly Detection

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20641574

Published June 11, 2026 | Version v1

Report Open

Comparison of Meta-Learning Convergence and F1-Score in Small and Large Language Models for Few-Shot Anomaly Detection

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Anomaly detection is a widely explored domain in machine learning. Many models are proposed in the literature, and compared through different metrics measured on various datasets. The most popular metrics used to compare performances are F1-score, AUC and AVPR. In this paper, we show that F1-score and AVPR are highly sensitive to the contamination rate. One consequence is that it is possible to artificially increase their values by modifying the train-test split procedure. This leads to misleading comparisons between algorithms in the literature, especially when the evaluation protocol is not

Research goal: How does the convergence speed and final F1-score of meta-learning frameworks compare between small (8B) and large (70B) language models in cross-domain few-shot anomaly detection?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (86.7 kB)

Name	Size	Download all
paper.pdf md5:92be728aca4637d789d68e9b54c8f6b5	86.7 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Comparison of Meta-Learning Convergence and F1-Score in Small and Large Language Models for Few-Shot Anomaly Detection

Authors/Creators

Description

Notes

Files

paper.pdf

Files (86.7 kB)