There is a newer version of the record available.

Published March 5, 2018 | Version 0.4.2
Dataset Open

Global Biotic Interactions: Taxon Graph

Authors/Creators

  • 1. 400 Perkins St Apt 104, Oakland, CA 94610

Description

Global Biotic Interactions: Taxon Cache and Taxon Map

Global Biotic Interactions (GloBI) provides access to existing species interaction datasets (Poelen et al. 2014, http://globalbioticinteractions.org). As part of the dataset integration and aggregation, a best effort is made to resolve, match and link taxonomic names and associated vernacular/common names, hierarchies and thumbnails. 

The data archives included in this publication contain established taxonomic links (taxonMap.tsv.gz) and taxonomic information (taxonCache.tsv.gz) that GloBI retrieved and integrated from taxonomic name sources and web services associated with http://itis.gov, http://globalnames.org, http://eol.org and others open data services. 

While GloBI is not a naming authority and the primary goal of the name matching process is to detect incorrect or outdates names, the archives may serve as an example of how to publish denormalized taxonomic records and their interrelatioships in a pragmatic way.

For related discussion threads, see https://github.com/jhpoelen/eol-globi-data/issues/145 , https://github.com/jhpoelen/eol-globi-data/issues/274 , https://github.com/jhpoelen/eol-globi-data/issues/70 , https://github.com/EOL/tramea/issues/10 and https://github.com/jhpoelen/eol-globi-data/issues/274 .

Files
  
  README 
      this file
  
  taxonCache.tsv.gz 
      Taxonomic name, ids, hierarchies, common names and thumbnail associated to taxa known to GloBI. Accessed at https://depot.globalbioticinteractions.org/datasets/org/globalbioticinteractions/taxon/0.4.2/taxon-0.4.2.zip on 5 March 2018.
 
  taxonCacheFirst10.tsv
      Header and 10 following lines from taxonCache.tsv.gz.
 
  taxonMap.tsv.gz 
      Links between taxon name and ids across various taxon providers. Accessed at https://depot.globalbioticinteractions.org/datasets/org/globalbioticinteractions/taxon/0.4.2/taxon-0.4.2.zip on 5 March 2018. 

  taxonMapFirst10.tsv
      Header and 10 following lines from taxonMap.tsv.gz.
 
  prefixes.tsv
      Term prefixes and their associated uri schemes. 

Column Descriptions

  taxonCache.tsv.gz 

    1 | id
    2 | name
    3 | rank
    4 | commonNames
    5 | path
    6 | pathIds 
    7 | pathNames
    8 | externalUrl
    9 | thumbnailUrl
 
  taxonMap.tsv.gz

    1 | providedTaxonId
    2 | providedTaxonName
    3 | resolvedTaxonId
    4 | resolvedTaxonName

References

Jorrit H. Poelen, James D. Simons and Chris J. Mungall. (2014). Global Biotic Interactions: An open infrastructure to share and analyze species-interaction datasets. Ecological Informatics. http://dx.doi.org/10.1016/j.ecoinf.2014.08.005.

Updates

2018-03-02
version: 0.3.0

This taxon archive version was created by taking GloBI taxon v0.2 (Jan 2018) and appending a semi-automatically created WikiData taxon mapping and taxon cache.

2018-03-05
version: 0.4.1

Taxon Cache/Map created by merging WikiData map/cache with globalbioticinteractions.org:taxon:0.1 .

Same as v0.4 with corrected taxonCache header that includes commonNames. 

2018-03-05
version: 0.4.2

was created by:

taking globi taxon 0.4.1 
extracting all interaction names using elton
matching all taxon 0.4.1 using nomer
matching all unresolved taxa using nomer with "globi-globalnames" matcher

Then, to create taxonMap and taxonCache,
resulting SAME_AS and SYNONYM_OF external ids were mapped to wikidata ids using taxon-wikidata 0.4.1
split columns to produce taxonMap and taxonCache
These taxonMap and taxonCache files were then appended to the 0.4.1 cache.

Files

Files (125.6 MB)

Name Size Download all
md5:0b155b813eda80632364d1b46f3101c8
1.4 kB Download
md5:d39422d01af5e0b777c9be1da4aa8a92
3.6 kB Download
md5:dff88fe72eb37f511f18beb16318666d
105.1 MB Download
md5:598c5c835ed321e08dcf124f77217020
9.1 kB Download
md5:85f74b289fee5e61688dc0c708ad32f4
20.4 MB Download
md5:e1d333f415db567cd7a62f64f6260f63
950 Bytes Download

Additional details

Related works