Published April 9, 2025 | Version v1.0
Dataset Open

Dataset for: Deep learning prediction of noise-driven nonlinear instabilities in fibre optics

  • 1. ROR icon XLIM
  • 2. ROR icon Centre National de la Recherche Scientifique
  • 3. ROR icon Université de Limoges
  • 4. ROR icon Franche-Comté Électronique Mécanique Thermique et Optique - Sciences et Technologies
  • 5. ROR icon Leibniz University Hannover

Description

This dataset accompanies the study Y. Boussafa et al. "Deep learning prediction of noise-driven nonlinear instabilities in fibre optics", Nature Communications, 16, 7800 (2025) and includes four curated datasets used to train and evaluate artificial neural networks (ANNs) for predicting spectral features resulting from modulation instability (MI) in nonlinear fibre propagation.

The datasets are:

  1. Numerical – 2 seeds
    GNLSE-based simulations with two coherent input seeds. Each seeding scenario (90 000 in total) includes:

    • Input seed parameters (wavelengths and spectral phases)

    • Output average spectra

    • Output spectral correlation maps computed from 500 Monte Carlo realizations

  2. Numerical – 4 seeds
    Same as above, with four coherent seeds per scenario (105 000 in total). Spectral correlation maps also computed from 500 GNLSE simulations per configuration.

  3. Experimental – 2 seeds
    Real-time DFT measurements of MI with two coherent input seeds. Each case includes:

    • Input seed parameters (defined via programmable filtering)

    • Output average spectra

    • Output spectral correlation maps computed from 1000 sequential DFT traces

  4. Experimental – 4 seeds
    Same as above, using four coherent seeds. Spectral correlation maps derived from 1000 DFT measurements per seeding configuration.

Notes:

All data are provided in physical units prior to ANN standardization, ensuring transparency and compatibility with custom preprocessing pipelines.

Data provided are the ones used to train the networks provided in Figs. 3, 5, 6, 7 of the main manuscript (https://doi.org/10.1038/s41467-025-62713-x). Traces windowing and sampling were however performed, in line with the described "Methods" section of the manuscript, to keep the datasize reasonable, and compatible with ANN processing. Full raw data (including all GNLSE realizations and unprocessed DFT traces) are available upon request due to their large size.

 

Please cite the corresponding paper (Y. Boussafa et al. Deep learning prediction of noise-driven nonlinear instabilities in fibre optics, Nature Communications, 16, 7800  2025) and dataset DOI (10.5281/zenodo.15179897) when using this data in publications

Files

README.md

Files (33.7 GB)

Name Size Download all
md5:cb6698be96d44d598ba701adfec09cde
3.3 GB Download
md5:f58ad4c34656491b496de9d24e9189b2
3.3 GB Download
md5:747bad3b85b8b5c3e1ea2d5ad28e5f9a
12.5 GB Download
md5:a95114ea403816cca85c6a00ca3dedc2
14.6 GB Download
md5:bce9c75164500ccc65d6d00abbe15477
4.1 kB Preview Download
md5:bce9c75164500ccc65d6d00abbe15477
4.1 kB Preview Download

Additional details

Additional titles

Subtitle
Numerical and experimental dataset for ANN training

Related works

Is supplement to
Publication: 10.1038/s41467-025-62713-x (DOI)

Funding

European Commission
STREAMLINE - Smart phoTonic souRces harnEssing Advanced Multidimensional Light optimization towards machIne-learNing-Enhanced imaging 950618
European Commission
QFreC - Smart protonic quantum frequency circuits 947603
Agence Nationale de la Recherche
OPTIMAL ANR-20-CE30-0004
Conseil Régional de Nouvelle-Aquitaine
SPINAL
Conseil Régional de Nouvelle-Aquitaine
PILIM

Dates

Created
2025-04-10
Initial submission of ANN training datasets