Similarities are calculated using the JoinSim  similarity measure on the derived citation network using the following metapaths:
Paper - Author - Paper (PAP_similarities.csv)
Paper - Topic - Paper (PTP_similarities.csv)
The file ids.csv contains a mapping from AMiner's ids to our internal numeric ids used in the similarities files.
 Xiong, Y., Zhu, Y., Yu, P.S.: Top-k similarity join in heterogeneous information networks. IEEE Transactions on Knowledge and Data Engineering 27(6), 1710– 1723 (2015)
We acknowledge support of this work by the project "Moving from Big Data Management to Data Science" (MIS 5002437/3) which is implemented under the Action "Reinforcement of the Research and Innovation Infrastructure", funded by the Operational Programme "Competitiveness, Entrepreneurship and Innovation" (NSRF 2014-2020) and co-financed by Greece and the European Union (European Regional Development Fund).