Wikibase as an environment for harmonisation of data about past: the example of WikiHum
Description
Adam Zapała, Tomasz Królik
Wikibase as an environment for harmonisation of data about past: the example of WikiHum
The increasing number of digital projects in the field of history and cultural heritage over the last decade has led to a situation where very extensive information about the past became accessible online. Unfortunately, most projects use their own data model and do not refer to any reference databases. For this reason, researchers who intend to utilize combined resources are often forced to merge the data by themselves. Hence, data produced by projects often do not enter into general use and are not used optimally. The solution of this problem is the creation of an infrastructure that harmonizes and combines different data sets. An environment perfectly suited for this task is Wikibase.
As part of the DARIAH.Lab project, an instance of Wikibase (called WikiHum) has been developed. First of all, it serves as an interface for the automatic delivery of permanent identifiers (each item will automatically receive a Handle.net identifier). Secondly, it enables data from different projects to be added, stored and harmonized with tools compatible with Wikibase (e.g. Open Refine). Thirdly, the infrastructure also makes data available, both through the SPARQL- endpoint, and also through plug-ins to other software (e.g. TEI Publisher). Furthermore, the database will contain not only various external identifiers, but also the most important data to capture the relations between entities and its change in time.
As part of the project, the most important Polish historical resources for places and people in the past will be added to the WikiHum database. These include resources already available online (Historical Atlas of Poland, The Historical-Geographical dictionary of the Polish Lands in the Middle Ages), as well as resources that were previously available only in printed form (Polish Biographical Dictionary, Lists of Officials of the Polish-Lithuanian Commonwealth). Ultimately, the database in question will harmonize a much larger scale of resources. The database will be expanded both with new resources related to people and places, but also with new types of entities (e.g. intellectual or artistic works as well as seals). The aim of the proposed presentation is to share the experiences gathered during the creation of WikiHum.
Diefenbach, D., Wilde, M.D., Alipio, S. (2021), Wikibase as an infrastructure for knowledge graphs: The EU knowledge graph, in: International Semantic Web Conference, Springer, pp. 631–647
Hyvönen, E. (2012), Publishing and using cultural heritage linked data on the semantic web. Synthesis lectures on the semantic web: theory and technology, 2(1), pp. 1-159
Hyvönen, E. (2022), Digital humanities on the Semantic Web: Sampo model and portal series. Semantic Web Preprint, pp. 1-16
Scholz, M., & Goerz, G. (2012). WissKI: a virtual research environment for cultural heritage, in: ECAI 2012, IOS Press, pp. Spp. 1017-1018
Simons O. (2018), GNDCon 2018. Ein Nachklapp aus Forscherperspektive, in: info 7, 33/3, pp. 43-44
Smith-Yoshimura K., Washburn B., et al. (2019), Creating library linked data with Wikibase: Lessons learned from project passage, DOI: 10.25333/faq3-ax08.
C. Shimizu, A. Elles, S. Gonzales, et al., Ontology Design Facilitating Wikibase Integration – and a Worked Example for Historical Data, arXiv:2205.14032 , DOI: https://doi.org/10.48550/arXiv.2205.14032
Files
Zapała_Królik.pdf
Files
(489.0 kB)
Name | Size | Download all |
---|---|---|
md5:ce96962875f65c531d2a04601f729f07
|
489.0 kB | Preview Download |