Published November 20, 2025 | Version v1
Dataset Open

Database for multi-tier machine learning optimization of organic photovoltaic photoactive layer fabrication

  • 1. ROR icon Henan Normal University
  • 2. ROR icon Shandong University

Description

This dataset provides a standardized, multi-tiered resource for optimizing organic photovoltaic devices through data-driven methods. It integrates donor/acceptor molecular structures (with SMILES, CDK fingerprints, and electronic descriptors) with nine key photoactive layer processing parameters and device efficiency.
The data is uniquely structured in three tiers to guide optimization: from single-parameter analysis, through stage-combined synergies (solution preparation, spin-coating, post-processing), to global optimization with all parameters. This structure facilitates the development and validation of both interpretable and high-performance machine learning models.
For complete details on the dataset structure, file descriptions, and exact data dictionary, please refer to the README.txt file included in the download.

Files

opv-multi-tier-ml-database.zip

Files (2.2 MB)

Name Size Download all
md5:858c3fc3f6ab171496f39883464e5ab8
2.2 MB Preview Download

Additional details

Funding

National Natural Science Foundation of China
22403026
National Natural Science Foundation of China
22325302