DBLP Article Similarities (DBLP-ArtSim) dataset
Creators
- 1. Athena Research Center
- 2. Univ. of the Peloponnese
Description
This dataset contains similarity scores among articles in AMiner's DBLP v10 dataset.
Similarities are calculated using the JoinSim [1] similarity measure on the derived citation network using the following metapaths:
- Paper - Author - Paper (PAP_similarities.csv)
- Paper - Topic - Paper (PTP_similarities.csv)
The file ids.csv contains a mapping from AMiner's ids to our internal numeric ids used in the similarities files.
[1] Xiong, Y., Zhu, Y., Yu, P.S.: Top-k similarity join in heterogeneous information networks. IEEE Transactions on Knowledge and Data Engineering 27(6), 1710– 1723 (2015)