Published December 31, 2024
| Version v1
Dataset
Open
DataSet for ICSE SEIP 25: Detecting Python Malware in the Software Supply Chain with Program Analysis
Description
* MalOSS: subset of malicious packages from MalOSS dataset [RQ1, RQ2, RQ4]
* BackStabber: subset of malicious packages from BackStabber Knife's Collection [RQ1, RQ2, RQ4]
* MalRegistry: subset of malicious packages from Python MalRegistry dataset [RQ1, RQ2, RQ4]
* Popular: a collection of top-100 most popular python packages from PyPI [RQ1, RQ2, RQ3, RQ4]
* Trusted: a collection of packages from trusted organizations hosted in PyPI [RQ1, RQ2, RQ3, RQ4]
* DataKund: a collection of newly identified malicious packages from PyPI [Case Study]
* Recent: a collection of packages that were recently (2024 Oct) added to PyPI [Macaron Case Study]
Files
Files
(3.6 GB)
Name | Size | Download all |
---|---|---|
md5:47f9f277e44c7ff39f00c776e873452d
|
3.6 GB | Download |