Dataset Open Access
Thanasis Vergoulis;
Ilias Kanellos;
Serafeim Chatzopoulos;
Danae Pla Karidi;
Theodore Dalamagas
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="DOI">10.5281/zenodo.3723282</identifier> <creators> <creator> <creatorName>Thanasis Vergoulis</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-0555-4128</nameIdentifier> <affiliation>Athena Research Center</affiliation> </creator> <creator> <creatorName>Ilias Kanellos</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-2146-3795</nameIdentifier> <affiliation>Athena Research Center</affiliation> </creator> <creator> <creatorName>Serafeim Chatzopoulos</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1714-5225</nameIdentifier> <affiliation>Athena Research Center</affiliation> </creator> <creator> <creatorName>Danae Pla Karidi</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-3154-6212</nameIdentifier> <affiliation>Athena Research Center</affiliation> </creator> <creator> <creatorName>Theodore Dalamagas</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-5002-7901</nameIdentifier> <affiliation>Athena Research Center</affiliation> </creator> </creators> <titles> <title>BIP4COVID19: Impact metrics and indicators for coronavirus related publications</title> </titles> <publisher>Zenodo</publisher> <publicationYear>2020</publicationYear> <subjects> <subject>COVID-19</subject> <subject>coronavirus</subject> <subject>scientometrics</subject> <subject>bibliometrics</subject> </subjects> <dates> <date dateType="Issued">2020-03-21</date> </dates> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/3723282</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.3723281</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/covid-19</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/zenodo</relatedIdentifier> </relatedIdentifiers> <version>0.1</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>This dataset contains impact metrics and indicators for a set of publications that are related to the <a href="https://en.wikipedia.org/wiki/Coronavirus_disease_2019">COVID-19 infectious disease</a> and the coronavirus that causes it. It is based on the <a href="https://pages.semanticscholar.org/coronavirus-research">CORD-19 dataset</a> released by the team of <a href="https://www.semanticscholar.org/">Semantic Scholar</a> in response to the relevant, ongoing pandemic:</p> <blockquote> <p>COVID-19 Open Research Dataset (CORD-19). 2020. Version 2020-03-13. Retrieved from https://pages.semanticscholar.org/coronavirus-research. Accessed 2020-03-18. doi:10.5281/zenodo.3715506</p> </blockquote> <p>These data have been cleaned and integrated with data from other sources (e.g., PMC). The result was a subset of the COVID-19 dataset (23,222 unique articles). We constructed the underlying citation network and utilized it to produce, for each article, the values of the following impact measures, using the <em>PaperRanking</em> (<a href="https://github.com/diwis/PaperRanking">https://github.com/diwis/PaperRanking</a>) library<sup>1</sup>:</p> <ul> <li><em><strong>Citation-based influence</strong></em>: This is based on the PageRank<sup>2</sup> network analysis method. In the context of citation networks, it estimates the importance of each article based on its centrality in the network. Since it considers the whole network, it is an indicator of the impact in the <em>long term</em>.&nbsp;</li> <li><em><strong>Citation-based popularity</strong></em>: This is based on the RAM<sup>3</sup> citation network analysis method. Methods like PageRank are biased against recently published articles (new articles need time to receive their first citations). RAM alleviates this problem using an approach known as &quot;time-awareness&quot;. This is why it is more suitable to capture the impact of a publication in the <em>short term</em>.</li> </ul> <p>The work is based on the following publications:</p> <blockquote> <ol> <li>I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Impact-Based Ranking of Scientific Publications: A Survey and Experimental Evaluation. TKDE 2019</li> <li>Rumi Ghosh, Tsung-Ting Kuo, Chun-Nan Hsu, Shou-De Lin, and Kristina Lerman. 2011. Time-Aware Ranking in Dynamic Citation Networks. In Data Mining Workshops (ICDMW). 373&ndash;380</li> <li>R. Motwani L. Page, S. Brin and T. Winograd. 1999. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report. Stanford InfoLab.</li> </ol> </blockquote> <p><em><strong>Terms of use:</strong></em> These data are provided &quot;as is&quot;, without any warranties of any kind. The data are provided under the Creative Commons Attribution 4.0 International license.</p></description> <description descriptionType="Other">We acknowledge support of this work by the project "Moving from Big Data Management to Data Science" (MIS 5002437/3) which is implemented under the Action "Reinforcement of the Research and Innovation Infrastructure", funded by the Operational Programme "Competitiveness, Entrepreneurship and Innovation" (NSRF 2014-2020) and co-financed by Greece and the European Union (European Regional Development Fund).</description> <description descriptionType="Other">{"references": ["COVID-19 Open Research Dataset (CORD-19). 2020. Version 2020-03-13. Retrieved from https://pages.semanticscholar.org/coronavirus-research. Accessed 2020-03-18. doi:10.5281/zenodo.3715506", "I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Impact-Based Ranking of Scientific Publications: A Survey and Experimental Evaluation. TKDE 2019", "Rumi Ghosh, Tsung-Ting Kuo, Chun-Nan Hsu, Shou-De Lin, and Kristina Lerman. 2011. Time-Aware Ranking in Dynamic Citation Networks. In Data Mining Workshops (ICDMW). 373\u2013380", "R. Motwani L. Page, S. Brin and T. Winograd. 1999. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report. Stanford InfoLab."]}</description> </descriptions> </resource>
All versions | This version | |
---|---|---|
Views | 323,745 | 1,443 |
Downloads | 45,005 | 535 |
Data volume | 1.4 TB | 729.0 MB |
Unique views | 304,719 | 1,325 |
Unique downloads | 28,942 | 480 |