Published March 12, 2024 | Version v1
Dataset Open

A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)

  • 1. ROR icon Universidade Estadual de Campinas

Contributors

Description

Publication date:
2024-03-12T14:26:33-03:00


A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)
---

Global Biodiversity Information Facility (GBIF) facilitates access to billions of biodiversity data records. These records include detailed accounts of life on earth.

To help records of specific life forms, GBIF provides a taxonomic backbone [1,2]. This backbone contains a long list of names used to describe species and associated hierarchies and taxonomic publications. These lists are sourced from datasets around the world.

At time of writing (18 Aug 2021), GBIF publishes a simplified version of their taxonomic backbone at [https://hosted-datasets.gbif.org/datasets/backbone/](https://hosted-datasets.gbif.org/datasets/backbone/) [1].

This repository provides script to pre-process https://hosted-datasets.gbif.org/datasets/backbone/backbone-current-simple.txt.gz to help facilitate access and improve performance of the creation of search indexes.

Pre-process steps currently include:
1. reducing amount of columns
2. reverse sort by id
3. reverse sort by name


Contents
---

README:
this file

repackage-gbif-backbone.sh:
script used to repackage GBIF Simple Backbone.

backbone-current-simple.txt.gz:
original GBIF backbone archive

gbif-backbone-by-name.tsv.gz:
two columns, gzipped, tab-separated text file with columns name, and id
reverse sorted by name

gbif-backbone-by-name.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-name.tsv.gz

gbif-backbone-by-id.tsv.gz:
20 columns, gzipped, tab-separated text file with first 20 columns of repackaged GBIF backbone file
reverse sorted by id

gbif-backbone-by-id.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-id.tsv.gz

References
---

[1] Simplied GBIF Backbone Taxonomy. Accessed at https://hosted-datasets.gbif.org/datasets/backbone/ on 2023-08-28.
[2] GBIF Secretariat (2021). GBIF Backbone Taxonomy. Checklist dataset https://doi.org/10.15468/39omei accessed via GBIF.org on 2023-08-28.


Hash URIs
---
This publication includes the following content uris:

hash://sha256/82d5f2153b4533322692d95eeb18b0f103e1b2297e38bd9ea935b07ba86cd7d5
hash://sha256/fde017e1315b4ae6fc1e1bae79f9cfd234b8ba40f6f4fb5ac031084a3b1763f0
hash://sha256/1804594be92a0e9a7b60c245925a1a488d4d98a4a38028cb0c8a420faefa36c2 (uncompressed)
hash://sha256/480926c8a1f218f8d5d76db7b4687c09ea11ab1c7ee9b0788238f2ff1eab7298
hash://sha256/8184f1e96d306ba5355e3e229d8e93eacd3fee4ab19107ae99b71bc4b9d523b6 (uncompressed)
hash://sha256/6241ffc32d0e1dbd826b36e45dfa469046ce69e671224bbb43f89996ca577955
hash://sha256/6df36a48615d9a3d7541995c6fc01da7f3ee34679d560ec619212a2c5f037679 (uncompressed)

Files

Files (1.1 GB)

Name Size Download all
md5:d8ae3f9d0c49c76f83e6067f95bdabb0
488.6 MB Download
md5:12b464a565f5e7231a2bd70764ee9742
65 Bytes Download
md5:df959c6fad42fea82ccdae6ed98f0f31
65 Bytes Download
md5:6263d0814566790a4da5926ff10f783f
578.9 MB Download
md5:516db236d105d985402399af77cfb2a2
65 Bytes Download
md5:27fb80ac44d3b39cbd969660424047da
65 Bytes Download
md5:88fe59a1ddae8c75d22b51b5c9c2838d
62.1 MB Download
md5:4e71de3e59ed1f131856cf09fc95d6ad
65 Bytes Download
md5:9f44af13b050b2c50d966442662d6c9d
65 Bytes Download
md5:d06045bf7c9b53faca3913aeae135fc5
2.7 kB Download
md5:ade69b379fa65c590aad38dbcd74679d
2.0 kB Download
md5:bb6200457f936b386a3e5d7c584e6833
65 Bytes Download

Additional details

Related works

Is supplement to
Dataset: 10.15468/39omei (DOI)

Software

Repository URL
https://github.com/jhpoelen/repackage-gbif-backbone
Programming language
Shell
Development Status
Active