Dataset Open Access

20 GB in 10 minutes: Data linking across major biodiversity databases: Data supplements

Poelen, Jorrit

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.1213477", 
  "title": "20 GB in 10 minutes: Data linking across major biodiversity databases: Data supplements", 
  "issued": {
    "date-parts": [
  "abstract": "<p>This supplementary data publication contains:</p>\n\n<p><strong>links-globi-wd-ott.tsv.gz:</strong>&nbsp;aggregate list of taxon graphs from Open Tree of Life Taxonomy (OTT), GloBI and Wikidata. This tab separated two column table, describe the taxonomic identifiers&nbsp;(e.g., NCBI:9606) that map into OTT, GloBI and Wikidata. For instance, the line &quot;NCBI:9689{tab}WD:Q140&quot; indicates that wikidata links their lion (<em>Panthera leo</em>,&nbsp;;to NCBI&#39;s lion (<em>Panthera leo</em>,;id=9689).</p>\n\n<p><strong>wikidata-taxon-info20171227.tsv.gz:&nbsp;</strong>a terse 5 column file in tab-separated format of taxon objects extracted from&nbsp;WikiData. (2018). Wikidata dump 2017-12-27 [Data set]. Zenodo. . The columns contain the following:</p>\n\n<ol>\n\t<li>wikidata taxon item id (e.g., Q140 or</li>\n\t<li>scientific name of taxon item id (e.g., Panthera leo, Mammalia)</li>\n\t<li>rank id of the taxon item id (e.g., Q7432 species or To retrieve a full list of wikidata taxon rank ids and their common names, you can use sparql to query wikidata (e.g.,&nbsp;<a href=\"\">Nomer&#39;s WikidataTaxonRankLoader</a>&nbsp;).&nbsp;</li>\n\t<li>parent ids if taxon item id using pipes &quot;|&quot; as separators if there&#39;s multiple parents.&nbsp;&nbsp;Please note that some taxon items have multiple parents (e.g.,&nbsp;</li>\n\t<li>external taxonomic identifiers that taxon item link to (e.g. &quot;ITIS:162532|EOL:8266|GBIF:2960|WORMS:125440&quot;) . If muliple are present, pipes &quot;|&quot; are used to separate the links. Only a selection of taxonomic schemes was used, namely: NCBI, GBIF, ITIS, WORMS, FISHBASE, IF (index fungorum) and EOL.</li>\n</ol>\n\n<p>The datasets can be recreated by scripts in&nbsp; or <a href=\"\"></a>&nbsp;.</p>", 
  "author": [
      "family": "Poelen, Jorrit"
  "version": "0.1", 
  "type": "dataset", 
  "id": "1213477"
All versions This version
Views 236236
Downloads 5757
Data volume 3.3 GB3.3 GB
Unique views 223223
Unique downloads 5050


Cite as