Published March 30, 2022 | Version v1
Dataset Open

OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis

  • 1. New York University
  • 2. University of Calgary

Description

Logic synthesis is a challenging and widely-researched combinatorial optimization problem during integrated circuit (IC) design. It transforms a high-level description of hardware in a programming language like Verilog into an optimized digital circuit netlist, a network of interconnected Boolean logic gates, that implements the function. Spurred by the success of ML in solving combinatorial and graph problems in other domains, there is growing interest in the design of ML-guided logic synthesis tools. Yet, there are no standard datasets or prototypical learning tasks defined for this problem domain. Here, we describe OpenABC-D,a large-scale, labeled dataset produced by synthesizing open source designs with a leading open-source logic synthesis tool and illustrate its use in developing, evaluating and benchmarking ML-guided logic synthesis. OpenABC-D has intermediate and final outputs in the form of 870,000 And-Inverter-Graphs (AIGs) produced from 1500 synthesis runs plus labels such as the optimized node counts, and de-lay. We define a generic learning problem on this dataset and benchmark existing solutions for it. The codes related to dataset creation and benchmark models are available athttps://github.com/NYU-MLDA/OpenABC.git.

Files

OPENABC2_DATASET.zip

Files (18.6 GB)

Name Size Download all
md5:7df8caf5bf830fc3c1e56ad6a1e79cd0
18.6 GB Preview Download
md5:1331a44c8d1fc59e6c527e164ed7339a
13.0 kB Preview Download

Additional details

Related works

Is documented by
Dataset: 10.48550/ARXIV.2110.11292 (DOI)