Published January 10, 2026 | Version v1.0
Software Open

Data and Code for Anticancer Peptide Prediction Using Machine Learning

  • 1. ROR icon University of Electronic Science and Technology of China

Description

Aegis is an end-to-end computational pipeline for identifying anticancer peptides (ACPs) from peptide sequences and visualising model performance. This record contains the source code and associated datasets used in the study, supporting reproducible machine learning–based ACP prediction.

The pipeline covers data preprocessing, feature extraction, model training, probability export, and performance visualisation. Raw peptide files are cleaned and merged into unified feature matrices, followed by model training and evaluation. The provided scripts generate publication-ready figures, including ROC and precision–recall curves, incremental feature selection (IFS) curves, and UMAP visualisations.

The workflow is implemented in Python (≥3.9) and is compatible with probability outputs generated by iLearnPlus (≥v0.1.4). Users can reproduce all analyses and figures reported in the associated manuscript, or apply the pipeline to new peptide datasets for ACP prediction and benchmarking.

Files

README.md

Files (1.1 MB)

Name Size Download all
md5:1df5ee6ad22973ee87c5cc575c008b73
1.1 MB Preview Download
md5:b8c2306a74316fdd5ff5b5be1f068618
2.2 kB Preview Download

Additional details

Software

Repository URL
https://github.com/xiao-zhu-pei-mei/Aegis
Programming language
Python