Amino Acid Composition drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis
Authors/Creators
Description
Overview
This repository contains the raw experimental data associated with the manuscript entitled:
“Amino Acid Composition Drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis.”
The data were generated as part of a study investigating the relationship between amino acid composition and aggregation propensity during solid-phase peptide synthesis (SPPS).
Contents
The repository contains the following files and folders:
Zip files with peptide names: Contain all the raw experimental data, uHPLC and LCMS done to characterize the syntheses. Labbook.xlsx helps navigating these.
Synthesis_data.zip contains all raw data which serve as the foundation for the machine learning models and analyses presented in the manuscript.
UZH_data_clean.csv is the processed synthesis data from our lab used to train the ML algorithms
How to Use the Data
This dataset is released under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and adaptation of the data, provided appropriate credit is given.
Researchers may, for example:
Reproduce or extend the machine learning models described in the associated publication
Conduct independent analyses of peptide sequence–aggregation relationships
Benchmark novel models or computational approaches using this dataset as a reference
We encourage the reuse of this dataset for both academic and commercial purposes, in line with the principles of open and reproducible science.
This dataset is released under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
Citation
Please cite this dataset as:
Bálint Tamás, Marvin Alberts, Teodoro Laino, and Nina Hartrampf, (2025). Raw Experimental Data for "Amino Acid Composition Drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis." Zenodo. https://doi.org/10.26434/chemrxiv-2025-wjbmv
Files
Barstar.zip
Files
(12.7 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:9a13a1f942f5021b8cff4888946537ce
|
1.6 GB | Preview Download |
|
md5:0ad9761041c134ac107ae28dc2513560
|
1.9 GB | Preview Download |
|
md5:6f220a68c6d6b6ca057b88bdc572064a
|
1.6 GB | Preview Download |
|
md5:44db11db2918d718bbcc197fc0b5a528
|
1.6 GB | Preview Download |
|
md5:9f9ba88f5c6cb1930096078ed1c513bc
|
15.6 kB | Download |
|
md5:7ae30d46337e74eaa4c670447ffb4e87
|
1.7 GB | Preview Download |
|
md5:c59afdd08608bbfa2bd4ff874ecd84db
|
248.8 MB | Preview Download |
|
md5:f02ba37c2cb5f58a14f5ba45ca8c891d
|
1.5 GB | Preview Download |
|
md5:8501969930821f6e414b64e2321f9774
|
1.9 GB | Preview Download |
|
md5:9984ce45620cc3ef2c87679f53f2363a
|
121.4 MB | Preview Download |
|
md5:89426eb49c0927e1315ed47999247def
|
2.3 kB | Preview Download |
|
md5:58d1babb5d90afbeb237e018986566ed
|
571.9 MB | Preview Download |
|
md5:7d82c41b7be2adf53c2d0815a8d8ef9c
|
116.8 kB | Preview Download |
Additional details
Funding
- Swiss National Science Foundation
- NCCR Catalysis (phase II) 225147
- Swiss National Science Foundation
- Flow-Based Methods for Chemical Peptide and Protein Synthesis 200865
Software
- Repository URL
- https://github.com/rxn4chemistry/AI4Aggregation