CLIP-based Few-Shot Multi-Label Classification Methods: A Comparative Study

Aktas, Yağmur Çiğdem; Castaño García, Jorge

doi:10.5281/zenodo.17966109

Published December 17, 2025 | Version v1

Conference proceeding Open

CLIP-based Few-Shot Multi-Label Classification Methods: A Comparative Study

1. Vicomtech

Categorizing darkweb image content is critical for identifying and averting potential threats. However, this remains a challenge due to the nature of the data, which includes
multiple co-existing domains and intra-class variations. While many methods have been proposed to classify this image content, multi-label multi-class classification remains underexplored. The complexity of darkweb imagery, combined with the need for efficient classification systems, demands innovative approaches that can handle both the technical challenges and the sensitive nature of the content. In this paper, we present a comparative study of few-shot multi-label classification methods using the
multimodal model CLIP. Our research addresses the growing need for robust classification systems that can effectively categorize diverse and complex image content while maintaining high accuracy and computational efficiency. We particularly focus on the challenges of handling multiple labels simultaneously and the scalability of these systems in real-world applications. We analyze and compare four different approaches: CLIP+Label Empower Adapter, CLIP Sigmoid, SIGLIP, and CLIP+ML-Decoder. Our study evaluates these methods based on their precision, recall, and ability to handle increasing class numbers efficiently. Finally, our research contributes to the field by providing detailed insights into the strengths and limitations of each method.

Files

CLIP_based_Few_Shot_Multi_Label_Classifi.pdf

Files (686.2 kB)

Name	Size	Download all
CLIP_based_Few_Shot_Multi_Label_Classifi.pdf md5:d693f79a2b3f6a4b576c766cccaa8dfc	686.2 kB	Preview Download

Additional details

European Commission
STARLIGHT - Sustainable Autonomy and Resilience for LEAs using AI against High priority Threats 101021797

Views

Downloads

Show more details

	All versions	This version
Views	33	33
Downloads	46	46
Data volume	32.9 MB	32.9 MB

More info on how stats are collected....

DOI

Resource type

Conference proceeding

Publisher

Zenodo

Imprint

1-6.

Conference

13th International Symposium on Digital Forensics and Security (ISDFS), Boston, MA, USA, 2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more; MIT License

A short and simple permissive license with conditions only requiring preservation of copyright and license notices. Licensed works, modifications, and larger works may be distributed under different terms and without source code. Read more

Technical metadata

Created: December 17, 2025
Modified: December 17, 2025

CLIP-based Few-Shot Multi-Label Classification Methods: A Comparative Study

Authors/Creators

Description

Files

CLIP_based_Few_Shot_Multi_Label_Classifi.pdf

Files (686.2 kB)

Additional details

Funding