Distributed Research Data Knowledge Graphs
Description
The emerging ecosystem of distributed research data knowledge graphs (KGs) within the NFDI presents new opportunities—but also profound challenges—for semantic interoperability and federated querying. This contribution explores these challenges through the lens of the Wikiverse (Wikibase, Wikidata) and OpenStreetMap (OSM) as community-driven infrastructures for managing Linked Open Data (LOD), with a focus on their potential as hubs for semantic integration across heterogeneous data domains. NFDI consortia representing diverse research fields, e.g. NFDI4Objects, NFDI4Culture, BERD@NFDI, NFDI4Society, MaRDI, and the cross-cutting Base4NFDI—are increasingly investing in distributed KG solutions tailored to their respective research communities. However, diverse modelling paradigms (e.g. RDF vs. LPG), ontological frameworks (e.g. CIDOC CRM vs. BFO or schema.org), and terminological conventions remain significant barriers to integration. The Basic Services TS4NFDI and KGI4NFDI address these gaps by offering infrastructure components for terminology alignment and KG federation. This paper contributes to this discourse by evaluating how community-maintained infrastructures—especially Wikibase instances and Wikidata—can support harmonisation efforts and serve as semantic reference points across consortia while raising new issues and challenges in federating the queries. Drawing on current collaborative experiences from the Knowledge Graph Ecosystem within the NFDI, we examine how federated SPARQL queries can be (and cannot be) executed across distributed instances. Federated querying remains technically complex due to inconsistent endpoint configurations, naming conventions, and ontology alignments. These issues are especially pressing in projects using custom Wikibase installations, such as FactGrid, fuzzy-sl, and Semantic Kompakkt. Wikidata is an important semantic hub for external identifiers, cross-linking with OSM, terminologies, and LOD cloud resources. Its SPARQL endpoint, collaborative curation model, and breadth of scope make it an invaluable resource for interoperable queries across domains. The contribution highlights how these concepts apply in real-world settings through use cases from NFDI4Objects, NFDI4Culture and MaRDI: (1) Linked Open Ogham, cf. CIIC 81; (2) Linked Open Samian Ware integrating findspots across LOD repositories; (3) Babylonian mathematics on archaeological objects, e.g., Si.427, K08538 or MS 3042; (4) the Stolpersteine Project explores crowd-sourced volunteered geographic information (VGI) integration; (5) cuneiform tablets made available as 3D scan, LOD in Wikidata and FactGrid. Each example illustrates technical and conceptual hurdles for data federation, such as property mappings, namespace collisions, and endpoint restrictions. The contribution reflects the sustainability and scalability of current approaches, outlining key lessons learned from recent federation workshops and NFDI-wide coordination efforts. The most promising developments are infrastructure synergies, including services from Base4NFDI, improved query tooling using FAIRification Tools—e.g., SPARQLing Unicorn— and emerging AI-supported ontology alignment and SPARQL query generation methods. By examining the structural complexities and design patterns of distributed semantic infrastructures, we argue that Wikidata and the broader Wikibase ecosystem can play a vital role in bridging disciplinary silos—if adequately supported through community-driven service development and harmonisation efforts.
Files
CoRDI_2025_Aachen_DistributedKnowledgeGraphs.pdf
Files
(9.8 MB)
Name | Size | Download all |
---|---|---|
md5:a24171466e6bcafcbb4d7aa1db120d6f
|
9.8 MB | Preview Download |
Additional details
Additional titles
- Subtitle (En)
- Challenges of federated queries using the Wikiverse and OpenStreetMap within the NFDI Knowledge Graph Ecosystem
Related works
- Is supplement to
- Conference proceeding: 10.5281/zenodo.16736047 (DOI)