Published August 2, 2025 | Version v3
Dataset Open

Atomic Layer Deposition and Etching Process Schemas Extracted with Schema-Miner

  • 1. ROR icon Technische Informationsbibliothek (TIB)
  • 2. ROR icon Eindhoven University of Technology
  • 3. ROR icon University of Warwick
  • 4. TIB - Technische Informationsbibliothek
  • 5. ROR icon University of Brescia
  • 6. ROR icon Leibniz University Hannover
  • 7. Technische Informationsbibliothek Universitätsbibliothek Hannover

Description

This data repository contains the extracted JSON schemas for Atomic Layer Deposition (ALD) and Atomic Layer Etching (ALE) processes, extracted using the Schema-Miner tool. The schemas are categorized under two distinct use cases: experimental and simulation, to reflect the differing perspectives in ALD/E process modeling.

Each schema captures essential process properties, along with their respective constraints, data types, and other structural details necessary for standardized representation and interoperability. The schema extraction was conducted from a curated set of scientific publications related to ALD/E, leveraging large language models (LLMs) in combination with domain-expert insights to ensure both accuracy and relevance.

The complete methodology is described in our paper “LLMs4SchemaDiscovery: A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models,” presented at ESWC Conference 2025. Read the paper here.

The Schema-Miner tool is publicly available on GitHub: Access the repository.

The extracted schemas have also been uploaded as templates to the Open Research Knowledge Graph (ORKG), and can be accessed via the following links:

Atomic Layer Deposition: 

Atomic Layer Etching

What is New in Version 3?

This version introduces significant improvements only to the ALD Experimental schema, while other schemas remain unchanged from Version 2. The following updates have been made:

  1. The schema now models ALD processes involving binary, ternary, or quaternary compounds, enhancing its applicability to complex material systems.
  2. Processes utilizing supercycle-based ALD, such as those for IGZO (Indium Gallium Zinc Oxide), are now explicitly supported through extended schema structures.
  3. The growthPerCycle property now includes growth values for each constituent compound within a super cycle, number of cycles per compound, total film thickness.
  4. Flow rates of individual reactants are now captured as explicit process parameters.
  5. Each property under material properties is now annotated with its corresponding characterization method, which shows the technique used to compute the corresponding property value.
  6. A new device properties section has been added to capture Thin-Film Transistor (TFT) performance metrics, including field effect mobility, threshold voltage, subthreshold swing and onOffRatio.

Files

ALD-experimental-schema.json

Files (907.1 kB)

Name Size Download all
md5:8229906296fe51a409cda46838f0ab27
55.7 kB Preview Download
md5:b63b8b6e5a4fb1fc99d1c23cc03b55a0
39.1 kB Preview Download
md5:ae5aa77c690704ee247b645edc382aea
29.5 kB Preview Download
md5:cf69ccbeca115c93a7ccc23cabed5714
52.0 kB Preview Download
md5:1bbfbcb84543215c82e32667ebf1b017
730.8 kB Preview Download

Additional details

Software

Repository URL
https://github.com/sciknoworg/schema-miner
Development Status
Active