Other Open Access

S21. Hic sunt dracones – Improving knowledge exchange in the Semantic Web with Linked Open and FAIR data (Standard)

Florian Thiery; Martina Trognitz; Ethan Gruber

In historical maps, the phrase `Hic sunt dracones’ (‘here be dragons’) is used to describe areas which were unknown to the map creator [UW19]. Today the WWW gives researchers the possibility of sharing their research (data) and enables the community to participate in the scientific discourse to create previously unknown knowledge. But much of these shared data are not findable or accessible, thus resulting in modern ‘unknown data dragons’. Often these ‘data dragons’ lack connections to other datasets, i.e. they are not interoperable and in some cases even lack usefulness or usability. To overcome these shortcomings, a set of techniques, standards and recommendations can be used: Semantic Web and Linked Open Data, the FAIR principles and LOUD data.

Tim Berners-Lee introduced the concept of `Semantic Web’, wherein he suggested using the ideas of Open Data, semantically described resources and links, as well as usable (machine readable) interfaces and applications for creating a `Giant Global Graph’. In 2016 the FAIR principles were introduced [MW16]: research data and their metadata have to be Findable, Accessible, Interoperable, and Reusable. Linked Data is an essential part of the FAIR principles: “The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. [TBL06].” Publishing research data as HTTP URIs with RDF content containing links to other resources makes data FAIR.

On top of that, these data should be `open’ for access, re-use and universal participation [ODH19]. A five star rating system of openness [MH12] was introduced to rate Linked Data, i.e. “Linked Open Data (LOD) is Linked Data which is released under an open licence. [TBL06].” Furthermore, LOD have to be `usable’ for scientists and programmers to take advantage of all the LOD power. Following the LOUD principles[RS18] will make LOD even more FAIR.

Merging all these principles to create FAIR and LOUD research data results in the `Sphere 7 Data Model’ [FT19], which enables a wide array of digital humanities and archaeological (web-)applications using LOUD and FAIR data.

The Linked Data Cloud already offers research data repositories for certain archaeological and humanities domains. Popular examples of FAIR LOUD providers are: Nomisma.org [EG18], Kerameikos.org [GS15], Pelagios [ISBS14], OpenContext [EC07], Portable Antiquities Scheme [EH18], ARIADNE [AN17] and there are more to come, e.g. NAVIS [TM18b], ARS3D [TKR19] and ARIADNEplus [AP19].

The development of more and more repositories poses challenges in handling the complex facets of data quality and completeness. This is especially valid for archaeological data, which are based on a complicated network of concepts from different knowledge domains. Moreover, it is necessary to include means of conveying knowledge about uncertainty in the data models to produce and publish transparent FAIR and LOUD data that can also describe specific stratigraphies or the (archaeological) context of objects. In order to be able to connect different data resources, exchange standards also have to be developed,  published, and applied.

To enable non-experts to engage with FAIR, and especially LOUD data, small tools – `minions’ – were created for different purposes, such as modelling a relative chronology (Alligator [DS18]), modelling and reasoning on vague edges in graph data (Academic Meta Tool [TM18]), creating annotated texts and images (Recogito [SBIS17]), and creating controlled vocabularies (Labeling System [TE16]). Furthermore, Wikidata [EGKMV14] not only offers community-driven data, but also provides a vast set of tools for using and interacting with it.

The goal of our session is to bring together experts on LOD and FAIR data, as well as anybody interested in learning about FAIR and LOUD data-driven publishing, applications and research projects based on this kind of data. We would like to discuss ideas for FAIR and LOUD data models as a basis for reproducible research and exchange in the Semantic Web.

This session is intended as a starting point for a CAA SIG on `Semantics and LOUD in Archaeology’. The core aim of this SIG is to use the CAA’s SIG format to raise awareness for Linked Data in archaeology by creating a friendly and open platform to discuss the role of LOUD and FAIR Data in archaeology, and to enable the CAA community to learn about  LOD basics. If you wish to join the SIG, feel free to contact us to be an active part of the discussion [SIG19] and help us to navigate archaeology away from the data dragons.

The success of the sessions on data quality in Linked Data at CAA 2017 and 2018 has raised awareness of the many challenges related to FAIR and LOUD data, and encourages pursuing the debate.

For this session we invite contributions that address part or all of the following issues:

application of semantic web technologies, such as ontologies or RDF, to the archaeological domain
modelling archaeological artefacts as FAIR and LOUD data
modelling archaeological context, including the specificity of stratigraphy, uncertainty, and vagueness as FAIR and LOUD data
proposals for FAIR and LOUD data exchange standards
development of research tools producing or using FAIR and LOUD data
identifying sources and dangers of incorrect or ambiguous LOD
identifying duplicates across different LOD sources
keeping track of the provenance of data as a means of solving errors and identifying their source
setting up methodologies and tools in order to label or assess datasets based on their quality
dealing with ambiguities resulting from multiple links in the LOD cloud
computer vision or machine learning applications built upon controlled, semantic data
We encourage presenters to derive the problems from real-world datasets and to formulate proposals for solutions, preferably demonstrating (prototypes of) realised data-driven web applications. Since we target a broad and diverse audience because of the thematic relevance, the challenges described should also be integrated into their archaeological context (excavation, museum, archive, etc.).

References
[AN17] Aloia, N, Binding, C, Cuy, S, Doerr, M, Fanini, B, Felicetti, A, Fihn, J, Gavrilis, D, Geser, G, Hollander, H, Meghini, C, Niccolucci, F, Nurra, F, Papatheodorou, C, Richards, J, Ronzino, P, Scopigno, R, Theodoridou, M, Tudhope, D, Vlachidis, A, and Wright, H, 2017 Enabling European Archaeological Research: The ARIADNE E-Infrastructure, Internet Archaeology 43. DOI: https://doi.org/10.11141/ia.43.11

[AP19] ARIADNEplus 2019 ARIADNEplus: Advanced Research Infrastructure for Archaeological Data Networking in Europe Available at https://ariadne-infrastructure.eu/wp-content/uploads/2019/02/ariadne-plus_A4_researcher_web.pdf [Last accessed 18 July 2019]

[EC07] Kansa, E C 2007 Publishing Primary Data on the World Wide Web: Opencontext.org and an Open Future for the Past. Technical Briefs in Historical Archaeology, 2(1):1-11. Availalbe at http://www.sha.org/publications/technical_briefs/volume02/article_01.pdf [Last accessed 18 July 2019]

[EG18] Gruber, E 2018 Linked Open Data for Numismatic Library, Archive and Museum Integration. In: Matsumoto, M. and Uleberg, E. (eds.) CAA2016. Oceans of Data. Proceedings of the 44th Conference on Computer Applications and Quantitative Methods in Archaeology. 2018 Oxford: Archaeopress. pp. 55–62.

[EGKMV14] Erxleben, F, Günther, M, Krötzsch, M, Mendez, J and Vrandečić, D 2014 Introducing Wikidata to the Linked Data Web. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., and Goble, C. (eds.) The Semantic Web – ISWC 2014. Cham: Springer International Publishing. pp. 50–65. DOI: https://doi.org/10.1007/978-3-319-11964-9_4.

[EH18] Harper, E 2018 Toys and the Portable Antiquities Scheme: A Source for Exploring Later Medieval Childhood in England and Wales, Childhood in the Past, 11(2): 85–99. DOI: https://doi.org/10.1080/17585716.2017.1348647.

[FT19] Thiery, F 2019 Sphere 7 Data: LOUD and FAIR Data for the Research Community DOI: https://doi.org/10.5281/zenodo.2643469.

[GS15] Gruber, E and Smith, T J 2015 Linked Open Greek Pottery. In: Giligny, F., Costa, L., Moscati, P., and Robert, S. (eds.) CAA2014. 21st Century Archaeology. Concepts, methods and tools. Proceedings of the 42nd Annual Conference on Computer Applications and Quantitative Methods in Archaeology. 2015 Oxford: Archaeopress. pp. 205–214.

[ISBS14] Isaksen, L, Simon, R, Barker, E T E and de Soto Cañamares, P 2014 Pelagios and the emerging graph of ancient world data. In: Proceedings of the 2014 ACM conference on Web science – WebSci ’14. 2014 Bloomington, Indiana, USA: ACM Press. pp. 197–201. DOI: https://doi.org/10.1145/2615569.2615693.

[MH12] Hausenblas, M 2012 5-star OPEN DATA. Available at https://5stardata.info/en/ [Last accessed 18 July 2019].

[MW16] Wilkinson, M D, Dumontier, M, Aalbersberg, Ij J, Appleton, G, Axton, M, Baak, A, Blomberg, N, Boiten, J-W, da Silva Santos, L B, Bourne, P E, Bouwman, J, Brookes, A J, Clark, T, Crosas, M, Dillo, I, Dumon, O, Edmunds, S, Evelo, C T, Finkers, R, Gonzalez-Beltran, A, Gray, A J G, Groth, P, Goble, C, Grethe, J S, Heringa, J, ’t Hoen, P A ., Hooft, R, Kuhn, T, Kok, R, Kok, J, Lusher, S J, Martone, M E, Mons, A, Packer, A L, Persson, B, Rocca-Serra, P, Roos, M, van Schaik, R, Sansone, S-A, Schultes, E, Sengstag, T, Slater, T, Strawn, G, Swertz, M A, Thompson, M, van der Lei, J, van Mulligen, E, Velterop, J, Waagmeester, A, Wittenburg, P, Wolstencroft, K, Zhao, J and Mons, B 2016 The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data, 3: 160018.

[ODH19] Open Data Handbook 2019 What is Open Data?. Available at http://opendatahandbook.org/guide/en/what-is-open-data/ [Last accessed 18 July 2019].

[RS18] Sanderson, R 2019 LOUD: Linked Open Usable Data. 28 May 2019. Available at https://linked.art/loud/ [Last accessed 18 July 2019].

[SBIS17] Simon, R, Barker, E, Isaksen, L and De Soto Cañamares, P 2017 Linked Data Annotation Without the Pointy Brackets: Introducing Recogito 2, Journal of Map & Geography Libraries, 13(1): 111–132. DOI: https://doi.org/10.1080/15420353.2017.1307303.

[SIG19] Thiery, F, Trognitz, M, Gruber, E, Tolle, K, Wigg-Wolf, D 2019 CAA SIG on "Semantics and LOUD in Archaeology" DOI: https://doi.org/10.5281/zenodo.3337979.
[STMS18] Seidensticker, D, Thiery, F, Mees, A and Schmid, C 2018 Rdf Based Modelling Of Relative And Absolute Chronological Data: Examples From The Central African Rainforest And Roman Periodisation DOI: https://doi.org/10.5281/zenodo.1410516.

[TBL06] Berners-Lee, T 2006 Linked Data. 27 July 2006. Available at https://www.w3.org/DesignIssues/LinkedData.html [Last accessed 18 July 2019].

[TE16] Thiery, F and Engel, T 2016 The Labeling System: The Labelling System: A Bottom-up Approach for Enriched Vocabularies in the Humanities. In: Campana, S., Scopigno, R., Carpentiero, G., and Cirillo, M. (eds.) CAA2015. Keep the Revolution Going. Proceedings of the 43rd Annual Conference on Computer Applications and Quantitative Methods in Archaeology. 2016 Oxford: Archaeopress. pp. 259–268.

[TKR19] Thiery, F, Karmacharya, A and Rokohl, L 2019 ARS3D – Documenting facts and interpretations of African Red Slip Ware DOI: https://doi.org/10.5281/zenodo.2648210.

[TM18] Thiery, F and Mees, A 2018 Taming the chronology of South Gaulish Samian found at Hadrian’s Wall and the German Limes using Linked Open Data DOI: https://doi.org/10.5281/zenodo.1469298.

[TM18b] Thiery, F and Mees, A 2018 Taming Ambiguity – Dealing With Doubts In Archaeological Datasets Using Lod DOI: https://doi.org/10.5281/zenodo.1200111.

[UW19] Wuttke, U 2019 “Here be dragons”: Open Access to Research Data in the Humanities. ulrikewuttke. Available at https://ulrikewuttke.wordpress.com/2019/04/09/open-data-humanities/ [Last accessed 17 April 2019].

Files (2.9 MB)
Name Size
CfP_CAA2019_S21.pdf
md5:bac856818e3ecc6cd32ad07139e5b722
2.9 MB Download
26
7
views
downloads
All versions This version
Views 2626
Downloads 77
Data volume 20.4 MB20.4 MB
Unique views 2323
Unique downloads 77

Share

Cite as