There is a newer version of the record available.

Published November 19, 2023 | Version v0.0
Dataset Open

NDC-substances

  • 1. CENTAI Institute

Description

Overview

This is a temporal higher-order network dataset, which here means a sequence of timestamped simplices where each simplex is a set of nodes. Under the Drug Listing Act of 1972, the U.S. Food and Drug Administration releases information on all commercial drugs going through the regulation of the agency, forming the National Drug Code (NDC) Directory. In this dataset, each hyperedge corresponds to an NDC code for a drug, and the nodes are substances that make up the drug. Timestamps are in days and represent when the drug was first marketed. We restricted to hyperedges containing at most 25 nodes.

Statistics

  • Number of nodes: 5,311
  • Number of timestamped hyperedges: 112,405
  • Number of unique hyperedges: 10,025

Source of original data

Source: NDC-substances.

References

If you use this data, please cite the following paper: 

Files

NDC-substances.json

Files (7.4 MB)

Name Size Download all
md5:724d5af1476e9f4647302134221b5067
7.4 MB Preview Download