Preprint Open Access

Data Quality Issues in Current Nanopublications

Imran Asif; Jessica Chen-Burger; Alasdair J G Gray

Nanopublications are a granular way of publishing scientific claims together with their associated provenance and publication information. More than 10 million nanopublications have been published by a handful of researchers covering a wide range of topics within the life sciences. We were motivated to replicate an existing analysis of these nanopublications, but then went deeper into the structure of the existing nanopublications. In this paper, we analyse the usage of nanopublications by investigating the distribution of triples in each part and discuss the data quality issues that were subsequently revealed. We argue that there is a need for the community to develop a set of guidelines for the modelling of nanopublications.

Files (268.7 kB)
Name Size
268.7 kB Download
All versions This version
Views 14585
Downloads 11564
Data volume 30.4 MB17.2 MB
Unique views 13180
Unique downloads 10562


Cite as