Published April 11, 2025 | Version v2
Dataset Open

Amino Acid Composition drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis

  • 1. ROR icon University of Zurich
  • 2. ROR icon IBM Research - Zurich
  • 3. ROR icon NCCR Catalysis

Description

Overview

This repository contains the raw experimental data associated with the manuscript entitled:
“Amino Acid Composition Drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis.”

The data were generated as part of a study investigating the relationship between amino acid composition and aggregation propensity during solid-phase peptide synthesis (SPPS).
Contents

The repository contains the following files and folders:

    Zip files with peptide names: Contain all the raw experimental data, uHPLC and LCMS done to characterize the syntheses. Labbook.xlsx helps navigating these.

    Synthesis_data.zip contains all raw data which serve as the foundation for the machine learning models and analyses presented in the manuscript.

    UZH_data_clean.csv is the processed synthesis data from our lab used to train the ML algorithms
   

How to Use the Data

This dataset is released under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and adaptation of the data, provided appropriate credit is given.

Researchers may, for example:

    Reproduce or extend the machine learning models described in the associated publication

    Conduct independent analyses of peptide sequence–aggregation relationships

    Benchmark novel models or computational approaches using this dataset as a reference

We encourage the reuse of this dataset for both academic and commercial purposes, in line with the principles of open and reproducible science.

This dataset is released under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
Citation

Please cite this dataset as:

Bálint Tamás, Marvin Alberts, Teodoro Laino, and Nina Hartrampf, (2025). Raw Experimental Data for "Amino Acid Composition Drives Peptide Aggregation: Predicting Aggregation for Improved Synthesis." Zenodo. https://doi.org/10.26434/chemrxiv-2025-wjbmv

Files

Barstar.zip

Files (12.7 GB)

Name Size Download all
md5:9a13a1f942f5021b8cff4888946537ce
1.6 GB Preview Download
md5:0ad9761041c134ac107ae28dc2513560
1.9 GB Preview Download
md5:6f220a68c6d6b6ca057b88bdc572064a
1.6 GB Preview Download
md5:44db11db2918d718bbcc197fc0b5a528
1.6 GB Preview Download
md5:9f9ba88f5c6cb1930096078ed1c513bc
15.6 kB Download
md5:7ae30d46337e74eaa4c670447ffb4e87
1.7 GB Preview Download
md5:c59afdd08608bbfa2bd4ff874ecd84db
248.8 MB Preview Download
md5:f02ba37c2cb5f58a14f5ba45ca8c891d
1.5 GB Preview Download
md5:8501969930821f6e414b64e2321f9774
1.9 GB Preview Download
md5:9984ce45620cc3ef2c87679f53f2363a
121.4 MB Preview Download
md5:89426eb49c0927e1315ed47999247def
2.3 kB Preview Download
md5:58d1babb5d90afbeb237e018986566ed
571.9 MB Preview Download
md5:7d82c41b7be2adf53c2d0815a8d8ef9c
116.8 kB Preview Download

Additional details

Funding

Swiss National Science Foundation
NCCR Catalysis (phase II) 225147
Swiss National Science Foundation
Flow-Based Methods for Chemical Peptide and Protein Synthesis 200865