Representing COVID-19 information in collaborative knowledge graphs: the case of Wikidata

doi:10.5281/zenodo.5544840

Published September 14, 2020 | Version v5

Preprint Open

Representing COVID-19 information in collaborative knowledge graphs: the case of Wikidata

1. Faculty of Medicine of Sfax, University of Sfax, Sfax, Tunisia
2. Faculty of Sciences of Sfax, University of Sfax, Sfax, Tunisia
3. La Trobe University, Melbourne, Victoria, Australia
4. Computational Systems Biology Laboratory, University of São Paulo, São Paulo, Brazil
5. Department of Management in Networked and Digital Societies, Kozminski University, Warsaw, Poland
6. Web Semantics Oviedo (WESO) Research Group, University of Oviedo, Spain
7. Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill, CB #3270, Davie Hall, Chapel Hill, NC 27599-3270, United States of America
8. Faculty of Medicine, Hashemite University, Zarqa, Jordan
9. Institute of Child Health (ICH), Kolkata, India
10. School of Data Science, University of Virginia, Charlottesville, Virginia, United States of America

Information related to the COVID-19 pandemic ranges from biological to bibliographic, from geographical to genetic and beyond. The structure of the raw data is highly complex, so converting it to meaningful insight requires data curation, integration, extraction and visualization, the global crowdsourcing of which provides both additional challenges and opportunities. Wikidata is an interdisciplinary, multilingual, open collaborative knowledge base of more than 90 million entities connected by well over a billion relationships. It acts as a web-scale platform for broader computer-supported cooperative work and linked open data, since it can be written to and queried in multiple ways in near real time by specialists, automated tools and the public. The main query language, SPARQL, is a semantic language used to retrieve and process information from databases saved in Resource Description Framework (RDF) format.

Here, we introduce four aspects of Wikidata that enable it to serve as a knowledge base for general information on the COVID-19 pandemic: its flexible data model, its multilingual features, its alignment to multiple external databases, and its multidisciplinary organization. The rich knowledge graph created for COVID-19 in Wikidata can be visualized, explored and analyzed for purposes like decision support as well as educational and scholarly research.

Notes

To cite the work: Turki, H., Hadj Taieb, M. A., Shafee, T., Lubiana, T., Jemielniak, D., Ben Aouicha, M., Labra Gayo, J. E., Youngstrom, E. A., Banat, M., Das, D., & Mietchen, D. (2022). Representing COVID-19 information in collaborative knowledge graphs: The case of Wikidata. Semantic Web. doi:10.3233/SW-210444.

Files

Turki-et-al-2021-Representing-COVID-19-information-in-collaborative-knowledge-graphs-the-case-of-Wikidata.pdf

Files (15.1 MB)

Name	Size	Download all
Turki-et-al-2021-Representing-COVID-19-information-in-collaborative-knowledge-graphs-the-case-of-Wikidata.docx md5:5f81afdf95b32bffc7df03c63f9f9dad	5.0 MB	Download
Turki-et-al-2021-Representing-COVID-19-information-in-collaborative-knowledge-graphs-the-case-of-Wikidata.odt md5:976aac69f7e1bc9a3a0d84ff09229411	5.0 MB	Download
Turki-et-al-2021-Representing-COVID-19-information-in-collaborative-knowledge-graphs-the-case-of-Wikidata.pdf md5:e514f00c896438a28166e629fae73a15	5.1 MB	Preview Download

Additional details

Is identical to: Journal article: 10.3233/SW-210444 (DOI)

	All versions	This version
Views	4,507	742
Downloads	3,948	181
Data volume	17.9 GB	956.9 MB

Representing COVID-19 information in collaborative knowledge graphs: the case of Wikidata

Creators

Description

Notes

Files

Turki-et-al-2021-Representing-COVID-19-information-in-collaborative-knowledge-graphs-the-case-of-Wikidata.pdf

Files (15.1 MB)

Additional details

Related works