Dataset Open Access
Poelen, Jorrit
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <controlfield tag="005">20200124192457.0</controlfield> <controlfield tag="001">1213477</controlfield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">77571089</subfield> <subfield code="z">md5:b9ef1826b8994a135226511f3442f1ee</subfield> <subfield code="u">https://zenodo.org/record/1213477/files/links-globi-wd-ott.tsv.gz</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">53115231</subfield> <subfield code="z">md5:5c71baf1a0f96146731e4e1bcf00fc72</subfield> <subfield code="u">https://zenodo.org/record/1213477/files/wikidata-taxon-info20171227.tsv.gz</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2018-04-06</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="p">openaire_data</subfield> <subfield code="o">oai:zenodo.org:1213477</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Poelen, Jorrit</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">20 GB in 10 minutes: Data linking across major biodiversity databases: Data supplements</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution 4.0 International</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p>This supplementary data publication contains:</p> <p><strong>links-globi-wd-ott.tsv.gz:</strong>&nbsp;aggregate list of taxon graphs from Open Tree of Life Taxonomy (OTT), GloBI and Wikidata. This tab separated two column table, describe the taxonomic identifiers&nbsp;(e.g., NCBI:9606) that map into OTT, GloBI and Wikidata. For instance, the line &quot;NCBI:9689{tab}WD:Q140&quot; indicates that wikidata links their lion (<em>Panthera leo</em>,&nbsp;https://www.wikidata.org/wiki/Q140)&nbsp;to NCBI&#39;s lion (<em>Panthera leo</em>, https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Info&amp;id=9689).</p> <p><strong>wikidata-taxon-info20171227.tsv.gz:&nbsp;</strong>a terse 5 column file in tab-separated format of taxon objects extracted from&nbsp;WikiData. (2018). Wikidata dump 2017-12-27 [Data set]. Zenodo. http://doi.org/10.5281/zenodo.1211767 . The columns contain the following:</p> <ol> <li>wikidata taxon item id (e.g., Q140 or https://www.wikidata.org/wiki/Q140)</li> <li>scientific name of taxon item id (e.g., Panthera leo, Mammalia)</li> <li>rank id of the taxon item id (e.g., Q7432 species or https://www.wikidata.org/wiki/Q7432). To retrieve a full list of wikidata taxon rank ids and their common names, you can use sparql to query wikidata (e.g.,&nbsp;<a href="https://github.com/globalbioticinteractions/nomer/blob/c3a1f5a2ebfb87ffc67e3bace19b82d96c0d25e8/nomer/src/main/java/org/globalbioticinteractions/nomer/util/WikidataTaxonRankLoader.java">Nomer&#39;s WikidataTaxonRankLoader</a>&nbsp;).&nbsp;</li> <li>parent ids if taxon item id using pipes &quot;|&quot; as separators if there&#39;s multiple parents.&nbsp;&nbsp;Please note that some taxon items have multiple parents (e.g.,&nbsp;https://www.wikidata.org/wiki/Q774014).</li> <li>external taxonomic identifiers that taxon item link to (e.g. &quot;ITIS:162532|EOL:8266|GBIF:2960|WORMS:125440&quot;) . If muliple are present, pipes &quot;|&quot; are used to separate the links. Only a selection of taxonomic schemes was used, namely: NCBI, GBIF, ITIS, WORMS, FISHBASE, IF (index fungorum) and EOL.</li> </ol> <p>The datasets can be recreated by scripts in&nbsp;https://github.com/bio-guoda/guoda-datasets/tree/master/wikidata or <a href="https://doi.org/10.5281/zenodo.1428949">https://doi.org/10.5281/zenodo.1428949</a>&nbsp;.</p></subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="a">10.5281/zenodo.1213476</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.5281/zenodo.1213477</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> </record>
All versions | This version | |
---|---|---|
Views | 205 | 205 |
Downloads | 48 | 48 |
Data volume | 2.8 GB | 2.8 GB |
Unique views | 195 | 195 |
Unique downloads | 41 | 41 |