Published September 30, 2022 | Version v1
Dataset Open

OQM9HK: A Large-scale Graph Dataset for Machine Learning in Materials Science

Authors/Creators

  • 1. RIMCS LLC

Description

This is a large-scale graph dataset of materials science based on the Open Quantum Materials Database (OQMD) v1.5 .

Technical Report

RIMCS Website

Data Loading

A Python code example:

import sys
sys.path.append('/your/path/to/data/OQM9HK_BEL')
import OQM9HK

bel_path='/your/path/to/data/OQM9HK_BEL'

config = OQM9HK.load_config(path=bel_path)
print(config['atomic_numbers'])

split = OQM9HK.load_split(path=bel_path)
print(len(split['train']), len(split['val']), len(split['test']))

graph_data = OQM9HK.load_graph_data(path=bel_path)
name = next(iter(graph_data)) # Frist entry's name
graph = graph_data[name] # Graph object
print(graph.nodes)
print(graph.edge_sources)
print(graph.edge_targets)

dataset = OQM9HK.load_targets(path=bel_path) # Pandas dataframe
print(dataset)
train_set = dataset.iloc[split['train']]
val_set = dataset.iloc[split['val']]
test_set = dataset.iloc[split['test']]

 

Files

Files (391.8 MB)

Name Size Download all
md5:943ea3fb6f758b62b39751569346682a
120.2 MB Download
md5:a5ef5f2c8bd1bd1f1d20d630ffdfc8e7
132.2 MB Download
md5:e10d0dc93ded49ce10e9d4c0604d0945
139.5 MB Download