Reusability of data with complex semantic structure
- 1. MARUM Center for Marine Environmental Science | Bremen University
- 2. PANGAEA
Description
Data on the occurrence and abundance of fossils provide invaluable insights into past climate and biodiversity change. However, lack of common taxonomic standards and associated vocabularies, limit reusability of fossil data and thus global assessments. Inconsistent and variable taxonomy are a common challenge faced in biodiversity research using species occurrence data. This pilot aimed to resolve those semantic barriers for the example of planktonic foraminifera. We designed and developed an R workflow that applies the resolved semantics on legacy data stored in PANGAEA while making use of WoRMS (World Register of Marine Species). Furthermore, we provide community guidelines for new data submissions of species abundance data to generate sustainable ways of combining legacy and new data. As the pilot is closely linked to PANGAEA, we expect that many users will benefit from our workflows and best practice solutions. Since heterogeneous data structures and inadequate ontology support are a common problem for many other geoscientific and biodiversity research communities, we hope that our approach can be transferred on different types of long-tail data.
Notes
Files
NFDI4Earth_Pilot_Roadmap_ComplexSymantic_2023_final.pdf
Files
(1.6 MB)
Name | Size | Download all |
---|---|---|
md5:20ce0b5d2b4924114c155ead76513e2c
|
1.6 MB | Preview Download |
Additional details
Related works
- Is supplemented by
- Poster: 10.5281/zenodo.8123959 (DOI)
- Poster: 10.5281/zenodo.8123935 (DOI)
- Software: 10.5281/zenodo.8124240 (DOI)