Published October 24, 2025 | Version v1
Conference paper Open

Malware Detection in Docker Containers: An Image is Worth a Thousand Logs

Description

Malware detection is increasingly challenged by evolving techniques like obfuscation and polymorphism, limiting the effectiveness of traditional methods. Meanwhile, the widespread adoption of software containers has introduced new security challenges, including the growing threat of malicious software injection, where a container, once compromised, can serve as entry point for further cyberattacks. In this work, we address these security issues by introducing a method to identify compromised containers through machine learning analysis of their file systems. We cast the entire software containers into large RGB images via their tarball representations, and propose to use established Convolutional Neural Network architectures on a streaming, patchbased manner. To support our experiments, we release the COSOCO dataset-the first of its kind-containing 3364 largescale RGB images of benign and compromised software containers at https://huggingface.co/datasets/k3ylabs/cosoco-imagedataset. Our method detects more malware and achieves higher F1 and Recall scores than all individual and ensembles of VirusTotal engines, demonstrating its effectiveness and setting a new standard for identifying malware-compromised software containers.

Files

2504.03238v1.pdf

Files (18.6 MB)

Name Size Download all
md5:4df9405816a480fb1505c24014f78596
18.6 MB Preview Download

Additional details

Funding

European Commission
DYNABIC - Dynamic business continuity of critical infrastructures on top of adaptive multi-level cybersecurity 101070455

Dates

Available
2026-09-26