Published December 31, 2024 | Version v1
Dataset Open

DataSet for ICSE SEIP 25: Detecting Python Malware in the Software Supply Chain with Program Analysis

  • 1. ROR icon National University of Singapore

Description

* MalOSS: subset of malicious packages from MalOSS dataset [RQ1, RQ2, RQ4]
* BackStabber: subset of malicious packages from BackStabber Knife's Collection [RQ1, RQ2, RQ4]
* MalRegistry: subset of malicious packages from Python MalRegistry dataset [RQ1, RQ2, RQ4]
* Popular: a collection of top-100 most popular python packages from PyPI [RQ1, RQ2, RQ3, RQ4]
* Trusted: a collection of packages from trusted organizations hosted in PyPI [RQ1, RQ2, RQ3, RQ4]
* DataKund: a collection of newly identified malicious packages from PyPI [Case Study]
* Recent: a collection of packages that were recently (2024 Oct) added to PyPI [Macaron Case Study]

Files

Files (3.6 GB)

Name Size Download all
md5:47f9f277e44c7ff39f00c776e873452d
3.6 GB Download