A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

Pecher, Branislav; Srba, Ivan; Bielikova, Maria

doi:10.1145/3691339

Published September 2, 2024 | Version v1

Journal article Open

A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

1. Kempelen Institute of Intelligent Technologies
2. Brno University of Technology
3. Slovak.AI

Learning with limited labelled data, such as prompting, in-context learning, fine-tuning, meta-learning or few-shot learning, aims to effectively train a model using only a small amount of labelled samples. However, these approaches have been observed to be excessively sensitive to the effects of uncontrolled randomness caused by non-determinism in the training process. The randomness negatively affects the stability of the models, leading to large variances in results across training runs. When such sensitivity is disregarded, it can unintentionally, but unfortunately also intentionally, create an imaginary perception of research progress. Recently, this area started to attract research attention and the number of relevant studies is continuously growing. In this survey, we provide a comprehensive overview of 415 papers addressing the effects of randomness on the stability of learning with limited labelled data. We distinguish between four main tasks addressed in the papers (investigate/evaluate; determine; mitigate; benchmark/compare/report randomness effects), providing findings for each one. Furthermore, we identify and discuss seven challenges and open problems together with possible directions to facilitate further research. The ultimate goal of this survey is to emphasise the importance of this growing research area, which so far has not received an appropriate level of attention, and reveal impactful directions for future research.

Files

Pecher-at-al_ACM-CSUR_A_Survey_of_Stability_of_Learning_with_Limited_Labelled_Data.pdf

Files (1.5 MB)

Name	Size	Download all
Pecher-at-al_ACM-CSUR_A_Survey_of_Stability_of_Learning_with_Limited_Labelled_Data.pdf md5:b4ab2656014320f6d5c6a53583a11e1e	1.5 MB	Preview Download

Additional details

DOI: 10.48550/arXiv.2312.01082
arXiv: arXiv:2312.01082

TAILOR – Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization 952215: European Commission
DisAI – Improving scientific excellence and creativity in combating disinformation with artificial intelligence and language technologies 101079164: European Commission
vera.ai – vera.ai: VERification Assisted by Artificial Intelligence 101070093: European Commission

	All versions	This version
Views	23	23
Downloads	24	24
Data volume	39.3 MB	39.3 MB

A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

Files

Pecher-at-al_ACM-CSUR_A_Survey_of_Stability_of_Learning_with_Limited_Labelled_Data.pdf

Files (1.5 MB)

Additional details

Identifiers

Funding

A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

Creators

Description

Files

Pecher-at-al_ACM-CSUR_A_Survey_of_Stability_of_Learning_with_Limited_Labelled_Data.pdf

Files (1.5 MB)

Additional details

Identifiers

Funding