From Observations to Interactions: iNaturalist and the Expanding Landscape of Biodiversity Data Sharing with DwC-DP
Authors/Creators
- 1. iNaturalist, San Rafael, United States of America
- 2. Rauthiflor LLC, Bariloche, Argentina
Description
iNaturalist hosts over 250 million biodiversity occurrence records representing more than 500,000 unique taxa—making it one of the largest platforms for community-contributed biodiversity data. Since 2012, iNaturalist has used the Darwin Core Archive (DwC-A, GBIF 2021) format to share its research-grade observations (occurrence records) with the Global Biodiversity Information Facility (GBIF), as well as more specialized information such as interactions with other partners such as Global Biotic Interactions (GloBI). However, there are some limitations to this format. For example, only one person is credited for the identification (there are always at least 2 people involved); interactions are captured in resourceRelationship, which is not specific to interactions; and many additional iNaturalist annotations are mapped to dynamicProperies (e.g., leaf phenology).
The presentation from which this extended abstract derives (Suppl. material 1) demonstrates how the Darwin Core Data Package (DwC-DP) offers an opportunity for more specialized and extensive data sharing than the DwC-A. This flexible and extensible format allows iNaturalist to represent not just Occurrences and their associated Media (images, sounds), but also relationships between those Occurrences, e.g., OrganismInteractions such as predation, pollination, parasitism, and other vital ecological connections.
As a demonstration*1, a subset of iNaturalist data focused on plant-pollinator interactions was mapped to the DwC-DP. This example incorporates interactions via community-generated Observation Fields*2 even though these are not currently controlled fields in the core iNaturalist data model. In Fig. 1, the relevant tables (e.g., occurrence, agent, event, media) of the DwC-DP are connected by the schemas (prepositions, e.g., conducted by, about, based on).
While Occurrence remains a foundational element, DwC-DP brings structure to a richer ecosystem of data that supports both incremental improvements and novel capabilities for data providers and consumers alike. When it comes to implementing this in production for iNaturalist, many decisions will need to be made to balance the tradeoffs of use cases unmet by DwC-A, complexity, processing time, file sizes, and more.
Files
BISS_article_182954.pdf
Files
(169.4 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:d53e482c77d88c5263a8fe03d1e12d8d
|
157.3 kB | Preview Download |
|
md5:9e2593cfc1217ce0d3d804a6a7ec3cbd
|
12.1 kB | Preview Download |
Linked records
Additional details
Related works
- Has part
- Other: 10.3897/biss.10.182954.suppl1 (DOI)
- Other: https://zenodo.org/record/19029415 (URL)
References
- GBIF (2021) Darwin Core Archives – How-to Guide, version 2.2. GBIF Secretariat, Copenhagen. URL: https://ipt.gbif.org/manual/en/ipt/3.0/dwca-guide