Published December 6, 2022 | Version 0.3
Dataset Open

A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)

Description

Publication date:
2022-12-06T07:37:19-06:00


A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)
---

Global Biodiversity Information Facility (GBIF) facilitates access to billions of biodiversity data records. These records include detailed accounts of life on earth.

To help records of specific life forms, GBIF provides a taxonomic backbone [1,2]. This backbone contains a long list of names used to describe species and associated hierarchies and taxonomic publications. These lists are sourced from datasets around the world.

At time of writing (6 Dec 2022), GBIF publishes a simplified version of their taxonomic backbone at [https://hosted-datasets.gbif.org/datasets/backbone/](https://hosted-datasets.gbif.org/datasets/backbone/) [1].

This repository provides script to pre-process https://hosted-datasets.gbif.org/datasets/backbone/current/simple.txt.gz to help facilitate access and improve performance of the creation of search indexes.

Pre-process steps currently include:
1. reducing amount of columns
2. reverse sort by id
3. reverse sort by name


Contents
---

README:
    this file

repackage-gbif-backbone.sh:
    script used to repackage GBIF Simple Backbone.

repackage-gbif-backbone.log:
    log of repackaging of GBIF Simple Backbone.

backbone-current-simple.txt.gz:
    original GBIF backbone archive

gbif-backbone-by-name.tsv.gz:
    two columns, gzipped, tab-separated text file with columns name, and id
    reverse sorted by name 

gbif-backbone-by-name.tsv.sha256:
    sha256 hash of the uncompressed gbif-backbone-by-name.tsv.gz

gbif-backbone-by-id.tsv.gz:
    20 columns, gzipped, tab-separated text file with first 20 columns of repackaged GBIF backbone file
    reverse sorted by id

gbif-backbone-by-id.tsv.sha256:
    sha256 hash of the uncompressed gbif-backbone-by-id.tsv.gz

References
---

[1] Simplied GBIF Backbone Taxonomy. Accessed at https://hosted-datasets.gbif.org/datasets/backbone/ on 2022-12-06.
[2] GBIF Secretariat (2021). GBIF Backbone Taxonomy. Checklist dataset https://doi.org/10.15468/39omei accessed via GBIF.org on 2021-08-18.


Hash URIs
---
This publication includes the following content uris:

hash://sha256/82d5f2153b4533322692d95eeb18b0f103e1b2297e38bd9ea935b07ba86cd7d5
hash://sha256/50c155f66efb2efba0b8b624f8541e81cbe16a701d420a5073791fb993f72919
hash://sha256/9cd7d4c91292d86c726210446cd6fe45602505a7c0ea3b7c4f4f481f85f193ad (uncompressed)
hash://sha256/f950dde25cce9ba9cce67caa1c68ce0c99cb31fe2dc9658fec85a987d9f31654
hash://sha256/f21c6b90f17c6083fcfb4853f3c581dcc2aadd291691fa128392a205321f420b (uncompressed)
hash://sha256/5e0a4d1d2d1cccbdcc6b2c9831fafe61c54eb055f2d13ec40d9ac161889b9f89
hash://sha256/f6e477133d0585706ee5522963b204200cb3cd198f011cbf62be0fa8519763b5 (uncompressed)
 

Files

Files (1.3 GB)

Name Size Download all
md5:976e79fe97546caaa52faf91e9c5907e
691.2 MB Download
md5:903791e394dd47fa68b61d7ed9d647a6
65 Bytes Download
md5:1985fe908ff3d73d8e51eb90b2451018
65 Bytes Download
md5:3b9945241ed2ddd8fffc9a6cf09f675b
566.0 MB Download
md5:5188beb8fbaa6e123c0277695a9934de
65 Bytes Download
md5:af7f9fcbce92675b102dbbd63640ee7b
65 Bytes Download
md5:572432ec35d2111292f6705ad1c0a9b7
61.3 MB Download
md5:cd480ae21ebd1c78ca5ed32d05405224
65 Bytes Download
md5:9b06af645280a3044356ec76e2b70240
65 Bytes Download
md5:e524b70099aa620dd978f544b09867d3
65 Bytes Download
md5:e524b70099aa620dd978f544b09867d3
65 Bytes Download
md5:cd0536469425d96e9889050027e3ff3f
2.8 kB Download
md5:76e9f300c4f13ae3aa280397a8238690
3.7 kB Download
md5:ade69b379fa65c590aad38dbcd74679d
2.0 kB Download
md5:bb6200457f936b386a3e5d7c584e6833
65 Bytes Download

Additional details

Related works

Is supplement to
Dataset: 10.15468/39omei (DOI)