Published June 13, 2018 | Version v1
Journal article Open

Challenges For Implementing Collections Data Quality Feedback: synthesizing the community experience

  • 1. Florida State University, Tallahassee, United States of America|Florida State University, Tallahassee, United States of America
  • 2. CSIRO National Research Collections of Australia, Canberra, Australia|CSIRO , Canberra, Australia

Description

Much data quality (DQ) feedback is now available to data providers from aggregators of collections specimen and related data. Similarly, transcription centres and crowdsourcing platforms also provide data that must be assessed and often manipulated before uploading to a local database and subsequently published with aggregators. In order to facilitate broader DQ information use aggregators (GBIF, ALA, iDigBio, VertNet) and others, through the TDWG BDQ Interest Group, are harmonizing the DQ information provided - transforming part of the DQ feedback process. But, collections sharing data face challenges when trying to evaluate and integrate the information changes offered (by aggregators) for given records in local collection management systems and collection databases. Sharing DQ integration experiences can help reveal risks and opportunities. Discovering others have the same conundrums helps develop a community of belonging and may assist in removing duplication of effort. It is important to leverage the knowledge and experience of those who are currently validating data to improve the efficiency and effectiveness of the process. Documenting and classifying these challenges also facilitates motivation and community building by informing those who would tackle these challenges. In this case, talks from aggregators and data providers give all of us a chance to learn from their stories about implementing and integrating DQ feedback.

Following the symposium, a special interest group (SIG at SPNHC) meeting offers everyone an opportunity to add their experiences with aggregator DQ feedback. See the SIG meeting: "Add Your Input to Challenges for Implementing Collections Data Quality Feedback: synthesizing the community experience", for details. For tractable issues, we plan to assemble and note the expected ways in which these barriers can be overcome. Where possible, we can tap into existing community resources (SPNHC, TDWG, biology.stackexchange.com, iDigBio, etc.) to help our data providers implement future data updates and track changes. At the same time, we plan to analyze the intractable issues - documenting why they remain challenging - and what, if any potential solutions are available or likely to be available in the future. This information provides future projects like DiSSCo (Distributed System of Scientific Collections) and BCoN (Biodiversity Collections Network) and others worldwide the information required to plan more effectively for cyber/human infrastructure. Synthesizing this input helps visionaries better understand, anticipate and support DQ management and data mobilization efforts going forward by informing design of future proposals and global projects structured with these outcomes in mind. At the end of the workshop, we intend to publish our findings, and merge them with the results of a global survey on the same topic.

Files

BISS_article_26003.pdf

Files (55.6 kB)

Name Size Download all
md5:88a3d02bb1579c4b9890d8813e4ef45c
45.7 kB Preview Download
md5:51cc1a2806239106bcf6de9d1469bac9
9.9 kB Preview Download

Linked records