Published June 9, 2026 | Version v0.0

physics-cocitations

  • 1. ROR icon Tokyo Metropolitan University

Description

Summary:
The physics-cocitations dataset is a co-citation hypergraph of highly cited Physics and Astronomy papers, constructed from the OpenAlex Snapshot (2024-09-27; https://developers.openalex.org/download/snapshot-format).

Nodes and Hyperedges:
Nodes represent highly cited Physics and Astronomy papers. For each Physics and Astronomy subfield, papers were ranked by citation count, and those accounting for the top 10% of cumulative citations were selected as nodes (minimum 100 papers per subfield), yielding 5,347 nodes in total.

Hyperedges represent co-citation relationships. For each paper in the OpenAlex corpus (excluding the selected top papers), the subset of top papers it cites was identified. If the same subset appeared in at least 3 citing papers, it was included as a hyperedge, yielding 42,535 hyperedges in total.

Each node carries the following attributes: OpenAlex work ID, paper title, publication date, primary topic, subfield, field, domain, and citation count.

Basic statistics:
- Nodes: 5,347
- Hyperedges: 42,535

Source:
OpenAlex Snapshot (2024-09-27), https://developers.openalex.org/download/snapshot-format

Reference:
Kazuki Nakajima, Yuya Sasaki, Takeaki Uno, and Masaki Aida. (2025). Learning Multi-Order Block Structure in Higher-Order Networks. arXiv preprint arXiv:2511.21350.

Files

physics-cocitations.json

Files (9.4 MB)

Name Size Download all
md5:2f09f000319a6194b7da9bb4f8e8fbe2
9.4 MB Preview Download