Published June 3, 2026 | Version v3
Dataset Open

Energies, structures descriptors of two-polaron configurations in defected Li4Ti5O12

  • 1. University of bayreuth
  • 2. ROR icon Fritz Haber Institute of the Max Planck Society
  • 3. ROR icon Institut Charles Gerhardt Montpellier
  • 4. Chair for Theoretical Physics VII and Bavarian Center for Battery Technologies, University of Bayreuth, Universitätsstr. 30, 95447 Germany

Description

This dataset contains structures, distance descriptors and relative energies of two-polaron configurations in two different defected supercells of Li4Ti5O12. Additionally, it contains jupyter notebooks with XGBoost models used to predict the energies of polarons in each supercell from the given descriptors.

Data Structure

The main directory contains the following two subdirectories:

  • lto_cell_A: This directory contains two subdirectories, opt_pol_structures and Notebook_ML_Model.
    • opt_pol_structures: It contains directories for the DFT+U optimized geometry files, including occupation matrices for different polaronic configurations for LTO_Cell_A, used for training and testing the machine learning model.
    • Notebook_ML_Model: It contains the following files: 
      • all_calculated_3.dat: This file contains the DFT calculated relative energies and features corresponding to different polaronic configurations for LTO_Cell_A.
      • polaron_data: This file contains features for all possible 780 polaronic configurations for the LTO simulation cell A.
      • XGB_reg.ipynb: This Jupyter notebook is used for training and evaluating the XGBoost regression model. It includes data preprocessing, model training, and evaluation steps, as well as visualizations of the results.
      • lto_model.json: This file contains the saved XGBoost model in JSON format.
      • training.dat and test.dat: These files contain the training and testing datasets used for the model. 
  • lto_cell_B: This directory also contains two subdirectories, opt_pol_structures and Notebook_ML_Model
    • opt_pol_structures: Similar to the one in lto_cell_A, it contains directories for the DFT+U optimized geometry files for LTO_Cell_B.
    • Notebook_ML_Model: It contains the following files: 
      • all_calculated.dat: This file contains the DFT+U calculated relative energies and features corresponding to different 196 polaronic configurations for LTO_Cell_B used for training and testing purposes.
      • all_feature.dat: This file contains features for all possible 74880 polaronic configurations for the LTO simulation cell B.
      • XGB_reg.ipynb: This Jupyter notebook is used for training and evaluating the XGBoost regression model for LTO_Cell_B. It includes data preprocessing, model training, and evaluation steps, as well as visualizations of the results.
      • lto_model.json: This file contains the saved XGBoost model in JSON format for LTO_Cell_B.
      • training.dat and test.dat: These files contain the training and testing datasets used for the model in LTO_Cell_B.
  • lto_cell_C: This directory also contains two subdirectories, opt_pol_structures and Notebook_ML_Model.
    • opt_pol_structures: Similar to the one in lto_cell_A, it contains directories for the DFT+U optimized geometry files for LTO_Cell_C.
    • Notebook_ML_Model: It contains the following files: 
      • all_calculated.dat: This file contains the DFT+U calculated relative energies and features corresponding to different 188 polaronic configurations for LTO_Cell_C used for training and testing purposes. 
      • XGB_reg.ipynb: This Jupyter notebook is used for training and evaluating the XGBoost regression model for LTO_Cell_C. It includes data preprocessing, model training, and evaluation steps, as well as visualizations of the results.
      • training.dat and test.dat: These files contain the training and testing datasets used for the model in LTO_Cell_C. 




Notes

H.O. recognises support from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) within the CRC 1585 MultiTrans. Furthermore, H.O. and T.S. gratefully acknowledge DFG funding from project PolarBatt (OB 425/13-1) and support from the DFG IRTG 2818 OPTEXC. Computations were carried out on the HPC systems festus (DFG grant number 523317330) of the University of Bayreuth, Fritz (NHR@FAU grant number b243cb, DFG grant number 440719683) of the Erlangen National High Performance Computing Center (NHR@FAU) of the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), and Raven at the Max Planck Computing and Data Facility.

Files

lto_bulk_project_1.1.zip

Files (14.3 MB)

Name Size Download all
md5:c4e102508bc00c5f8cb8171b7227f071
14.3 MB Preview Download

Additional details

Funding

Deutsche Forschungsgemeinschaft
MultiTrans CRC 1585
Deutsche Forschungsgemeinschaft
PolarBatt OB 425/13-1
Deutsche Forschungsgemeinschaft
OPTEXC IRTG 2818
Deutsche Forschungsgemeinschaft
Festus 523317330
Deutsche Forschungsgemeinschaft
NHR@FAU Fritz 440719683