Published August 24, 2023 | Version v1
Journal article Open

Publishing Australian marine data to OBIS: twenty years of lessons learnt

  • 1. CSIRO, Hobart, Australia

Description

In 2003, the Australian Antarctic Data Centre published the first Australian dataset of seabirds from the Southern Ocean to OBIS (Ocean Biodiversity Information System) via DiGIR (Distributed Generic Information Retrieval). The dataset initially had 17 fields with an emphasis on counts of individuals. Standards evolved and with the development of the IPT (Integrated Publishing Toolkit) by GBIF (Global Biodiversity Information Facility) around 2008, large datasets could be published. OBIS subsequently adopted the IPT as the preferred publishing tool for providers to use. In 2016, the Darwin Core Event core with the  OBIS Extended Measurements and Facts extension was released (De Pooter et al. 2017), meaning that richer and more comprehensive datasets could be published via the IPT. It is only recently that the biological aggregators (e.g.,  OBIS, GBIF) are looking at enhancing functionality to report this data.

The Australian OBIS Node (OBIS-AU), hosted by CSIRO NCMI (the Commonwealth Science and Industrial Research Organisation National Collections and Marine Infrastructure Business Unit) now manages an Australian region marine biodiversity IPT with 30 million records from over 450 datasets. In the last 12 months, using the GBIF DNA Derived Data Extention, the OBIS-AU Node has published extensive eDNA datasets to OBIS with sequences and DNA related metadata.

OBIS-AU has developed tools and procedures to ensure that data is of the best possible quality before it is published. Issues covered include preventing the duplication of data, preserving context, enhancing data once published with improvements in publication schemas, matching taxa, and identification of temporal or spatial errors.

Files

BISS_article_111565.pdf

Files (64.2 kB)

Name Size Download all
md5:c9596b1853a16055475eb4bd5b38e920
64.2 kB Preview Download

System files (12.9 kB)

Name Size Download all
md5:103d564b5dae8b8ea9ddb23bd8ab9a0c
12.9 kB Download

Linked records