There is a newer version of this record available.

Journal article Open Access

Representing COVID-19 information in collaborative knowledge graphs: a study of Wikidata

Houcemeddine Turki; Mohamed Ali Hadj Taieb; Thomas Shafee; Tiago Lubiana; Dariusz Jemielniak; Mohamed Ben Aouicha; Jose Emilio Labra Gayo; Mus'ab Banat; Diptanshu Das; Daniel Mietchen

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Houcemeddine Turki</dc:creator>
  <dc:creator>Mohamed Ali Hadj Taieb</dc:creator>
  <dc:creator>Thomas Shafee</dc:creator>
  <dc:creator>Tiago Lubiana</dc:creator>
  <dc:creator>Dariusz Jemielniak</dc:creator>
  <dc:creator>Mohamed Ben Aouicha</dc:creator>
  <dc:creator>Jose Emilio Labra Gayo</dc:creator>
  <dc:creator>Mus'ab Banat</dc:creator>
  <dc:creator>Diptanshu Das</dc:creator>
  <dc:creator>Daniel Mietchen</dc:creator>
  <dc:description>Information related to the COVID-19 pandemic ranges from biological to bibliographic and from geographical to genetic. Wikidata is a vast interdisciplinary, multilingual, open collaborative knowledge base of more than 88 million entities connected by well over a billion relationships and is consequently a web-scale platform for broader computer-supported cooperative work and linked open data. Here, we introduce four aspects of Wikidata that make it an ideal knowledge base for information on the COVID-19 pandemic: its flexible data model, its multilingual features, its alignment to multiple external databases, and its multidisciplinary organization. The structure of the raw data is highly complex, so converting it to meaningful insight requires extraction and visualization, the global crowdsourcing of which adds both additional challenges and opportunities. The created knowledge graph for COVID-19 in Wikidata can be visualized, explored and analyzed in near real time by specialists, automated tools and the public, for decision support as well as educational and scholarly research purposes via SPARQL, a semantic query language used to retrieve and process information from databases saved in Resource Description Framework (RDF) format.</dc:description>
  <dc:subject>Public health surveillance</dc:subject>
  <dc:subject>Knowledge graph</dc:subject>
  <dc:subject>Community curation</dc:subject>
  <dc:subject>FAIR data</dc:subject>
  <dc:subject>Linked Open Data</dc:subject>
  <dc:title>Representing COVID-19 information in collaborative knowledge graphs: a study of Wikidata</dc:title>
All versions This version
Views 3,576437
Downloads 4,6293,384
Data volume 15.3 GB10.2 GB
Unique views 3,212418
Unique downloads 3,4232,543


Cite as