Published March 17, 2026 | Version 03-17-2026
Software Open

BERTology of Molecular Property Prediction (Data Size Effect: Base-BERT, Part 3)

  • 1. ROR icon Molecular Sciences Software Institute
  • 2. ROR icon Virginia Tech

Description

Data Size Effects on Pre-Training 

Base-BERT (Part 3)

This record pertains to Data Size Effects on Pre-Training Experiments.
It involves the following files:
  • base-binidx-3-ms-1234-ds-1234.tar.gz
  • base-binidx-3-ms-1234-ds-2345.tar.gz
  • base-binidx-3-ms-2345-ds-1234.tar.gz
Each tar file contains all model artifacts (checkpoints, random-number generator states, optimizer states etc.), training logs (Tensorboard, MLFlow and Weights & Biases), and evaluation results, configuration files, run scripts, SLURM sbatch driver scripts, and any additional artifacts generated during the experiments.
 
Preprint: https://doi.org/10.48550/arXiv.2603.13627

Files

Files (39.3 GB)

Name Size Download all
md5:d177c03718dc8f3bc639f5b9c40061a5
13.1 GB Download
md5:dfe30e0a5dc12c15468eeaf5b0fa9093
13.1 GB Download
md5:953ca538746e0ee2bb9889e505292d4b
13.1 GB Download

Additional details

Software

Repository URL
https://github.com/molssi-ai/bertology
Programming language
Python
Development Status
Active