Global Biotic Interactions: Interpreted Data Products hash://md5/89797a5a325ac5c50990581689718edf hash://sha256/946178b36c3ea2f2daa105ad244cf5d6cd236ec8c99956616557cf4e6666545b
Creators
Description
Global Biotic Interactions: Interpreted Data Products
Global Biotic Interactions (GloBI, https://globalbioticinteractions.org, [1]) aims to facilitate access to existing species interaction records (e.g., predator-prey, plant-pollinator, virus-host). This data publication provides interpreted species interaction data products. These products are the result of a process in which versioned, existing species interaction datasets ([2]) are linked to the so-called GloBI Taxon Graph ([3]) and transformed into various aggregate formats (e.g., tsv, csv, neo4j, rdf/nquad, darwin core-ish archives). In addition, the applied name maps are included to make the applied taxonomic linking explicit.
Citation
--------
GloBI is made possible by researchers, collections, projects and institutions openly sharing their datasets. When using this data, please make sure to attribute these *original data contributors*, including citing the specific datasets in derivative work. Each species interaction record indexed by GloBI contains a reference and dataset citation. Also, a full lists of all references can be found in citations.csv/citations.tsv files in this publication. If you have ideas on how to make it easier to cite original datasets, please open/join a discussion via https://globalbioticinteractions.org or related projects.
To credit GloBI for more easily finding interaction data, please use the following citation to reference GloBI:
Jorrit H. Poelen, James D. Simons and Chris J. Mungall. (2014). Global Biotic Interactions: An open infrastructure to share and analyze species-interaction datasets. Ecological Informatics. https://doi.org/10.1016/j.ecoinf.2014.08.005.
Bias and Errors
--------
As with any analysis and processing workflow, care should be taken to understand the bias and error propagation of data sources and related data transformation processes. The datasets indexed by GloBI are biased geospatially, temporally and taxonomically ([5], [6]). Also, mapping of verbatim names from datasets to known name concept may contains errors due to synonym mismatches, outdated names lists, typos or conflicting name authorities. Finally, bugs may introduce bias and errors in the resulting integrated data product.
To help better understand where bias and errors are introduced, only versioned data and code are used as an input: the datasets ([2]), name maps ([3]) and integration software ([6]) are versioned so that the integration processes can be reproduced if needed. This way, steps take to compile an integrated data record can be traced and the sources of bias and errors can be more easily found.
Contents
--------
README:
this file
citations.csv.gz:
contains data citations in a in a gzipped comma-separated values format.
citations.tsv.gz:
contains data citations in a gzipped tab-separated values format.
datasets.csv.gz:
contains list of indexed datasets in a gzipped comma-separated values format.
datasets.tsv.gz:
contains list of indexed datasets in a gzipped tab-separated values format.
verbatim-interactions.csv.gz
contains species interactions tabulated as pair-wise interaction in a gzipped comma-separated values format. Included taxonomic name are *not* interpreted, but included as documented in their sources.
verbatim-interactions.tsv.gz
contains species interactions tabulated as pair-wise interaction in a gzipped tab-separated values format. Included taxonomic name are *not* interpreted, but included as documented in their sources.
interactions.csv.gz:
contains species interactions tabulated as pair-wise interactions in a gzipped comma-separated values format. Included taxonomic names are interpreted using taxonomic alignment workflows and may be different than those provided by the original sources.
interactions.tsv.gz:
contains species interactions tabulated as pair-wise interactions in a gzipped tab-separated values format. Included taxonomic names are interpreted using taxonomic alignment workflows and may be different than those provided by the original sources.
refuted-interactions.csv.gz:
contains refuted species interactions tabulated as pair-wise interactions in a gzipped comma-separated values format. Included taxonomic names are interpreted using taxonomic alignment workflows and may be different than those provided by the original sources.
refuted-interactions.tsv.gz:
contains refuted species interactions tabulated as pair-wise interactions in a gzipped tab-separated values format. Included taxonomic names are interpreted using taxonomic alignment workflows and may be different than those provided by the original sources.
refuted-verbatim-interactions.csv.gz:
contains refuted species interactions tabulated as pair-wise interactions in a gzipped comma-separated values format. Included taxonomic name are *not* interpreted, but included as documented in their sources.
refuted-verbatim-interactions.tsv.gz:
contains refuted species interactions tabulated as pair-wise interactions in a gzipped tab-separated values format. Included taxonomic name are *not* interpreted, but included as documented in their sources.
interactions.nq.gz:
contains species interactions expressed in the resource description framework in a gzipped rdf/quads format.
dwca-by-study.zip:
contains species interactions data as a Darwin Core Archive aggregated by study using a custom, occurrence level, association extension.
dwca.zip:
contains species interactions data as a Darwin Core Archive using a custom, occurrence level, association extension.
neo4j-graphdb.zip:
contains a neo4j v3.5.x graph database snapshot containing a graph representation of the species interaction data.
taxonCache.tsv.gz:
contains hierarchies and identifiers associated with names from naming schemes in a gzipped tab-separated values format.
taxonMap.tsv.gz:
describes how names in existing datasets were mapped into existing naming schemes in a gzipped tab-separated values format.
References
-----
[1] Jorrit H. Poelen, James D. Simons and Chris J. Mungall. (2014). Global Biotic Interactions: An open infrastructure to share and analyze species-interaction datasets. Ecological Informatics. doi: 10.1016/j.ecoinf.2014.08.005.
[2] Poelen, J. H. (2020) Global Biotic Interactions: Elton Dataset Cache. Zenodo. doi: 10.5281/ZENODO.3950557.
[3] Poelen, J. H. (2021). Global Biotic Interactions: Taxon Graph (Version 0.3.28) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.4451472
[4] Hortal, J. et al. (2015) Seven Shortfalls that Beset Large-Scale Knowledge of Biodiversity. Annual Review of Ecology, Evolution, and Systematics, 46(1), pp.523–549. doi: 10.1146/annurev-ecolsys-112414-054400.
[5] Cains, M. et al. (2017) Ivmooc 2017 - Gap Analysis Of Globi: Identifying Research And Data Sharing Opportunities For Species Interactions. Zenodo. Zenodo. doi: 10.5281/ZENODO.814978.
[6] Poelen, J. et al. (2022) globalbioticinteractions/globalbioticinteractions v0.24.6. Zenodo. doi: 10.5281/ZENODO.7327955.
Content References
-----
hash://sha256/2ed02ef8ab52cb51aef6fb42badeb495ba6a87dd6cf11be5f480c7bc1c902054 citations.csv.gz
hash://sha256/00195434368cec79f051ccb69238d2646b53530e4fd42936748428f055fdb0cc citations.tsv.gz
hash://sha256/b8898e7aea05121e7d15948dcc76d4dde6ed330db98f76ebcc4c03ba52622dcc datasets.csv.gz
hash://sha256/b8898e7aea05121e7d15948dcc76d4dde6ed330db98f76ebcc4c03ba52622dcc datasets.tsv.gz
hash://sha256/aa13e6fb98fd3aa4aaeaa89d6dccfd983e542fe010f0ffbb31fa17243f5735e3 dwca-by-study.zip
hash://sha256/4ae323bfc1255f3c6dd60b13a1be237cbfbd1c87aad595f7570e83fc9e84db08 dwca.zip
hash://sha256/0f1328b00c1b44aa19cf677790a0e649ddceb2a4e0babbe251a4af9e032f3dde interactions.csv.gz
hash://sha256/ad0297993328deee5178db4e5fe20135a21dde529f68adab63f8de9a02512514 interactions.nq.gz
hash://sha256/1c8de35d42fb298f1a27f4eb286309e39e6ab768d24d3c3bec1490f23d3594b6 interactions.tsv.gz
hash://sha256/f35ce82bf5c00882e4258edc883b41123f002c1fb9d64485abc101b00cb28e79 neo4j-graphdb.zip
hash://sha256/b002bcb378482a33847725fc52c8e26a42af5c5da9755449d8f0d10c9aa9f7f0 refuted-interactions.csv.gz
hash://sha256/7beb77546aad6e9de756d6161e35f55cfa725072ca77ba5c0b72a00e53146127 refuted-interactions.tsv.gz
hash://sha256/89fa5fc3bdc76451dd5d2a79c1473b437615e5c7e551ec5e57ff8b71e9a280ea refuted-verbatim-interactions.csv.gz
hash://sha256/ea83faba0aa0792cebe055832553197701025fdfe2f07ec34599075819916707 refuted-verbatim-interactions.tsv.gz
hash://sha256/4cf48959ea839e371a0344aab4b31f36242c84ac24e44f4db948524523b3563f taxonCache.tsv.gz
hash://sha256/bf38fe30df535f9e0b6b22fa726c10f35d391d616e6d107cc7582505141fd13d taxonMap.tsv.gz
hash://sha256/ce0d4f35b0970df3fe4e1623e473a5390b39297efae7f9e1474bfe2e8bc15d48 verbatim-interactions.csv.gz
hash://sha256/965718c7a9ec4ec1adc98413b52e31c090ad1ba5a04be088d579c5c9d59ffef0 verbatim-interactions.tsv.gz
hash://md5/ad99f71b8d3e0b67b7d4578a0a123c40 citations.csv.gz
hash://md5/2a27a963e745a12042c6c9886f87f842 citations.tsv.gz
hash://md5/580a4e1cfed5a6235f6c35277d0c7b10 datasets.csv.gz
hash://md5/580a4e1cfed5a6235f6c35277d0c7b10 datasets.tsv.gz
hash://md5/6c7294aa2b507143e10c390ae6008ed1 dwca-by-study.zip
hash://md5/a9694ecc6de81d9893998be05a8ef2de dwca.zip
hash://md5/0415cd469b8892fb3f5435048b6e85bf interactions.csv.gz
hash://md5/1b48bf7a344bdd3c706a94666607cd71 interactions.nq.gz
hash://md5/445b2c97e2e44d2dbc4aa93084ecacfc interactions.tsv.gz
hash://md5/ca3e4780032c8c58e90242bcdf1328d5 neo4j-graphdb.zip
hash://md5/03600b16405fc2a4ea60925d69b6e16f refuted-interactions.csv.gz
hash://md5/f412a50badaccc32b133574af1042d0e refuted-interactions.tsv.gz
hash://md5/e37f340ba251043f10076a933dbc25eb refuted-verbatim-interactions.csv.gz
hash://md5/73dc18c9ae6ff462f45cb11c98f13364 refuted-verbatim-interactions.tsv.gz
hash://md5/8bcde4b9e5610b02321235adea2fe251 taxonCache.tsv.gz
hash://md5/278f5225019ab935a65832b38fb9cc32 taxonMap.tsv.gz
hash://md5/801550fac1f58247e17e24372574edf2 verbatim-interactions.csv.gz
hash://md5/a18697d59e5f6756c22d8c4a1346685e verbatim-interactions.tsv.gz
Files
dwca-by-study.zip
Files
(15.1 GB)
Name | Size | Download all |
---|---|---|
md5:ad99f71b8d3e0b67b7d4578a0a123c40
|
71.9 MB | Download |
md5:2a27a963e745a12042c6c9886f87f842
|
71.7 MB | Download |
md5:580a4e1cfed5a6235f6c35277d0c7b10
|
3.8 kB | Download |
md5:580a4e1cfed5a6235f6c35277d0c7b10
|
3.8 kB | Download |
md5:6c7294aa2b507143e10c390ae6008ed1
|
471.5 MB | Preview Download |
md5:a9694ecc6de81d9893998be05a8ef2de
|
655.2 MB | Preview Download |
md5:0415cd469b8892fb3f5435048b6e85bf
|
2.1 GB | Download |
md5:1b48bf7a344bdd3c706a94666607cd71
|
99.8 MB | Download |
md5:445b2c97e2e44d2dbc4aa93084ecacfc
|
2.0 GB | Download |
md5:ca3e4780032c8c58e90242bcdf1328d5
|
8.1 GB | Preview Download |
md5:89797a5a325ac5c50990581689718edf
|
10.1 kB | Download |
md5:03600b16405fc2a4ea60925d69b6e16f
|
3.5 MB | Download |
md5:f412a50badaccc32b133574af1042d0e
|
3.5 MB | Download |
md5:e37f340ba251043f10076a933dbc25eb
|
474.6 kB | Download |
md5:73dc18c9ae6ff462f45cb11c98f13364
|
474.5 kB | Download |
md5:8bcde4b9e5610b02321235adea2fe251
|
122.9 MB | Download |
md5:278f5225019ab935a65832b38fb9cc32
|
69.7 MB | Download |
md5:801550fac1f58247e17e24372574edf2
|
631.9 MB | Download |
md5:a18697d59e5f6756c22d8c4a1346685e
|
629.4 MB | Download |