A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)
Creators
Description
Publication date:
2022-12-06T07:37:19-06:00
A Repackaged Taxonomic Backbone of Global Biodiversity Information Facility (GBIF)
---
Global Biodiversity Information Facility (GBIF) facilitates access to billions of biodiversity data records. These records include detailed accounts of life on earth.
To help records of specific life forms, GBIF provides a taxonomic backbone [1,2]. This backbone contains a long list of names used to describe species and associated hierarchies and taxonomic publications. These lists are sourced from datasets around the world.
At time of writing (6 Dec 2022), GBIF publishes a simplified version of their taxonomic backbone at [https://hosted-datasets.gbif.org/datasets/backbone/](https://hosted-datasets.gbif.org/datasets/backbone/) [1].
This repository provides script to pre-process https://hosted-datasets.gbif.org/datasets/backbone/current/simple.txt.gz to help facilitate access and improve performance of the creation of search indexes.
Pre-process steps currently include:
1. reducing amount of columns
2. reverse sort by id
3. reverse sort by name
Contents
---
README:
this file
repackage-gbif-backbone.sh:
script used to repackage GBIF Simple Backbone.
repackage-gbif-backbone.log:
log of repackaging of GBIF Simple Backbone.
backbone-current-simple.txt.gz:
original GBIF backbone archive
gbif-backbone-by-name.tsv.gz:
two columns, gzipped, tab-separated text file with columns name, and id
reverse sorted by name
gbif-backbone-by-name.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-name.tsv.gz
gbif-backbone-by-id.tsv.gz:
20 columns, gzipped, tab-separated text file with first 20 columns of repackaged GBIF backbone file
reverse sorted by id
gbif-backbone-by-id.tsv.sha256:
sha256 hash of the uncompressed gbif-backbone-by-id.tsv.gz
References
---
[1] Simplied GBIF Backbone Taxonomy. Accessed at https://hosted-datasets.gbif.org/datasets/backbone/ on 2022-12-06.
[2] GBIF Secretariat (2021). GBIF Backbone Taxonomy. Checklist dataset https://doi.org/10.15468/39omei accessed via GBIF.org on 2021-08-18.
Hash URIs
---
This publication includes the following content uris:
hash://sha256/82d5f2153b4533322692d95eeb18b0f103e1b2297e38bd9ea935b07ba86cd7d5
hash://sha256/50c155f66efb2efba0b8b624f8541e81cbe16a701d420a5073791fb993f72919
hash://sha256/9cd7d4c91292d86c726210446cd6fe45602505a7c0ea3b7c4f4f481f85f193ad (uncompressed)
hash://sha256/f950dde25cce9ba9cce67caa1c68ce0c99cb31fe2dc9658fec85a987d9f31654
hash://sha256/f21c6b90f17c6083fcfb4853f3c581dcc2aadd291691fa128392a205321f420b (uncompressed)
hash://sha256/5e0a4d1d2d1cccbdcc6b2c9831fafe61c54eb055f2d13ec40d9ac161889b9f89
hash://sha256/f6e477133d0585706ee5522963b204200cb3cd198f011cbf62be0fa8519763b5 (uncompressed)
Files
Files
(1.3 GB)
Name | Size | Download all |
---|---|---|
md5:976e79fe97546caaa52faf91e9c5907e
|
691.2 MB | Download |
md5:903791e394dd47fa68b61d7ed9d647a6
|
65 Bytes | Download |
md5:1985fe908ff3d73d8e51eb90b2451018
|
65 Bytes | Download |
md5:3b9945241ed2ddd8fffc9a6cf09f675b
|
566.0 MB | Download |
md5:5188beb8fbaa6e123c0277695a9934de
|
65 Bytes | Download |
md5:af7f9fcbce92675b102dbbd63640ee7b
|
65 Bytes | Download |
md5:572432ec35d2111292f6705ad1c0a9b7
|
61.3 MB | Download |
md5:cd480ae21ebd1c78ca5ed32d05405224
|
65 Bytes | Download |
md5:9b06af645280a3044356ec76e2b70240
|
65 Bytes | Download |
md5:e524b70099aa620dd978f544b09867d3
|
65 Bytes | Download |
md5:e524b70099aa620dd978f544b09867d3
|
65 Bytes | Download |
md5:cd0536469425d96e9889050027e3ff3f
|
2.8 kB | Download |
md5:76e9f300c4f13ae3aa280397a8238690
|
3.7 kB | Download |
md5:ade69b379fa65c590aad38dbcd74679d
|
2.0 kB | Download |
md5:bb6200457f936b386a3e5d7c584e6833
|
65 Bytes | Download |
Additional details
Related works
- Is supplement to
- Dataset: 10.15468/39omei (DOI)