Published November 8, 2025 | Version v1
Dataset Open

Simulating Tandem Mass Spectra for Small Molecules using a General-Purpose Large-Language Model

  • 1. ROR icon Icahn School of Medicine at Mount Sinai

Description

Files for testing the hypothesis that a Google Gemini Model can predict mass spectra for small molecules. 

Gemini_Prediction_Positive_Mass_Spectra_11.04.2025.msp  -- Gemini Predicted spectra (M+H) for 1436 compounds 
Gemini_prediction_Negative_Mass_Spectra_11.04.2025.msp -- Gemini Predicted spectra (M+H) for 835 compounds  
IDSL.CSA_parameters_for_negative_mode.xlsx -- spectra similarity search parameters for negative mode
IDSL.CSA_parameters_for_positive_mode.xlsx  -- spectra similartiy search parameters for positive mode
Negative_mode_spectra_similarity_results.csv  --- results of spectra similarity search (negative mode)
Positive_mode_spectra_similarity_results.csv --- results of spectra similarity search (positive mode)
gemini_prompt_negative_mode.txt ------- chain-of-through prompt for predicting M-H spectra
gemini_prompt_positive_mode.txt ------- chain-of-through prompt for predicting M+H spectra
gemini_spectra_generate.py ------- python script to call the Gemini API and populate the spectral library
negative_mode_nodes_cytoscape.txt --- Cytoscape network visualization node details (M+H)
positive_mode_nodes_cytoscape.txt --- Cytoscape network visualization node details (M-H)
sample-gemini-key.txt  --- file to place the Gemini API key
test_candidate_pubchem.json --- Chemical details obtained from the PubChem database                       

Files

gemini_prompt_negative_mode.txt

Files (9.0 MB)

Name Size Download all
md5:05daaa23e3dd2af858a4f57fa41a3d79
1.2 MB Download
md5:97b678a453146da2ff34b8482cc5a36e
2.2 MB Download
md5:664aa197b676f8021fd56189a1b16ec8
6.2 kB Preview Download
md5:e3ce543974076e60ef1ad964fc1d1ebd
6.1 kB Preview Download
md5:485cbfb132a3377dd4bdad1222bb2cac
14.6 kB Download
md5:c63a2b7a68b2aaf29bfdc21035ea41fc
40.1 kB Download
md5:39d905ffb90668fff5b6a66248deda09
40.1 kB Download
md5:0b8d73977a756cc5ebb65a6d4adc4dd9
95.1 kB Preview Download
md5:0059911f26e22817f4f545532a9047e6
1.5 MB Preview Download
md5:436a031a8f09e032b4294e4ef7672e42
200.1 kB Preview Download
md5:756b6399992aba7442fe48cdf298a4ee
3.7 MB Preview Download
md5:af3bc98a1cbcce2c1383b15d4b810004
33 Bytes Preview Download
md5:8a599a764dbe24d75c40d29da295ce1c
2.9 kB Preview Download