Data and Code for Anticancer Peptide Prediction Using Machine Learning
Authors/Creators
Description
Aegis is an end-to-end computational pipeline for identifying anticancer peptides (ACPs) from peptide sequences and visualising model performance. This record contains the source code and associated datasets used in the study, supporting reproducible machine learning–based ACP prediction.
The pipeline covers data preprocessing, feature extraction, model training, probability export, and performance visualisation. Raw peptide files are cleaned and merged into unified feature matrices, followed by model training and evaluation. The provided scripts generate publication-ready figures, including ROC and precision–recall curves, incremental feature selection (IFS) curves, and UMAP visualisations.
The workflow is implemented in Python (≥3.9) and is compatible with probability outputs generated by iLearnPlus (≥v0.1.4). Users can reproduce all analyses and figures reported in the associated manuscript, or apply the pipeline to new peptide datasets for ACP prediction and benchmarking.
Files
README.md
Files
(1.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:1df5ee6ad22973ee87c5cc575c008b73
|
1.1 MB | Preview Download |
|
md5:b8c2306a74316fdd5ff5b5be1f068618
|
2.2 kB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/xiao-zhu-pei-mei/Aegis
- Programming language
- Python