Published June 4, 2026 | Version v4

Versioned Archive and Review of Biotic Interactions and Taxon Names Found within globalbioticinteractions/dorey2023 hash://md5/dfc653904fa4d12bc44bf6f9f36a93d1

Authors/Creators

Description

Life on Earth is sustained by complex interactions between organisms and their environment. These biotic interactions can be captured in datasets and published digitally. We present a review and archiving process for such an openly accessible digital interactions dataset of known origin and discuss its outcome. The dataset under review, named globalbioticinteractions/dorey2023, has fingerprint hash://md5/dfc653904fa4d12bc44bf6f9f36a93d1, is 666MiB in size and contains 25,696 interactions with 4 unique types of associations (e.g., hasHost) between 1,125 primary taxa (e.g., Leioproctus leai (Cockerell, 1913)) and 2,192 associated taxa (e.g., Myrtaceae Eucalyptus). This report includes detailed summaries of interaction data, a taxonomic review from multiple catalogs, and an archived version of the dataset from which the reviews are derived.

Technical info

Introduction

Data Review and Archive

Data review and archiving can be a time-consuming process, especially when done manually. This review report aims to help facilitate both activities. It automates the archiving of datasets, including Darwin Core archives, and is a citable backup of a version of the dataset. Additionally, an automatic review of species interaction claims made in the dataset is generated and registered with Global Biotic Interactions (J. H. Poelen, Simons, and Mungall 2014).

This review includes summary statistics about, and observations about, the dataset under review :

Dorey, J.B., Fischer, E.E., Chesshire, P.R. et al. A globally synthesised and flagged bee occurrence dataset and cleaning workflow. Sci Data 10, 747 (2023). https://doi.org/10.1038/s41597-023-02626-w https://github.com/globalbioticinteractions/dorey2023/archive/ee2f9f138a5c4464fc8edc69c39507d20fe4a0f4.zip 2026-06-02T14:45:56.245Z hash://md5/dfc653904fa4d12bc44bf6f9f36a93d1

Methods

The review is performed through programmatic scripts that leverage tools like Preston (Elliott et al. 2025), Elton (Kuhn, Poelen, and Leinweber 2025), Nomer (Salim and Poelen 2025), globinizer (J. Poelen, Seltmann, and Mietchen 2024) combined with third-party tools like grep, mlr, tail and head.

Tools used in this review process
tool name version
preston 0.11.1
elton 0.16.11
nomer 0.6.5
globinizer 0.4.0
mlr 6.0.0
jq 1.6
yq 4.25.3
pandoc 3.1.6.1
duckdb 1.3.1
mapserver 7.6.4

The review process can be described in the form of the script below 1.

# get versioned copy of the dataset (size approx.  666MiB) under review 
elton pull globalbioticinteractions/dorey2023

# generate review notes
elton review globalbioticinteractions/dorey2023 \
 > review.tsv

# export indexed interaction records
elton interactions globalbioticinteractions/dorey2023 \
 > interactions.tsv

# export names and align them with the Catalogue of Life using Nomer 
elton names globalbioticinteractions/dorey2023 \
 | nomer append col \
 > name-alignment.tsv

or visually, in a process diagram.

Review Process Overview

You can find a copy of the full review script at check-data.sh. See also GitHub and Codeberg.

Results

In the following sections, the results of the review are summarized 2. Then, links to the detailed review reports are provided.

Files

An extensive list of files produced as part of the review process can be found in Appendix A. Review Files.

Archived Dataset

Note that data.zip file in this archive contains the complete, unmodified archived dataset under review.

Biotic Interactions

Biotic Interaction Data Model

In this review, biotic interactions (or biotic associations) are modeled as a primary (aka subject, source) organism interacting with an associate (aka object, target) organism. The dataset under review classified the primary/associate organisms with specific taxa. The primary and associate organisms The kind of interaction is documented as an interaction type.

The dataset under review, named globalbioticinteractions/dorey2023, has fingerprint hash://md5/dfc653904fa4d12bc44bf6f9f36a93d1, is 666MiB in size and contains 25,696 interactions with 4 unique types of associations (e.g., hasHost) between 1,125 primary taxa (e.g., Leioproctus leai (Cockerell, 1913)) and 2,192 associated taxa (e.g., Myrtaceae Eucalyptus).

An exhaustive list of indexed interaction claims can be found in gzipped csv, tsv, geopackage and parquet archives. To facilitate discovery, a preview of claims available in the gzipped html page at indexed-interactions.html.gz are shown below.

The exhaustive list was used to create the following data summaries below.

Sample of Indexed Interaction Claims
sourceTaxonName interactionTypeName targetTaxonName referenceCitation
Euryglossa allunga (Exley, 2001) hasHost Proteaceae Banksia prionotes 1996.
Euryglossa atra Exley, 1998 hasHost Myoporaceae Eremophila 1981.
Euryglossa atra Exley, 1998 hasHost Myoporaceae Eremophila 1981.
Euryglossa aureophila Houston, 1992 hasHost Myrtaceae Verticordia aurea 1990.
Most Frequently Mentioned Interaction Types (up to 20 most frequent)
interactionTypeName count
hasHost 16961
interactsWith 8451
adjacentTo 264
visitsFlowersOf 20
Most Frequently Mentioned Primary Taxa (up to 20 most frequent)
sourceTaxonName count
Leioproctus leai (Cockerell, 1913) 2542
Bombus bifarius Cresson, 1878 1303
Bombus flavifrons Cresson, 1863 623
Bombus mixtus Cresson, 1878 442
Bombus huntii Greene, 1860 406
Bombus occidentalis Greene, 1858 383
Hylaeus proximus (Smith, 1879) 362
Bombus centralis Cresson, 1864 355
Lasioglossum erythrurum (Cockerell, 1914) 353
Austronomia flavoviridis (Cockerell, 1905) 308
Lasioglossum urbanum (Smith, 1879) 307
Lasioglossum dotatum (Cockerell, 1912) 301
Bombus vosnesenskii Radoszkowski, 1862 290
Xanthesma furcifera (Cockerell, 1913) 283
Lasioglossum florale (Smith, 1853) 272
Exoneura pictifrons Alfken, 1907 268
Lasioglossum appositum (Rayment, 1939) 252
Lasioglossum vitripenne (Smith, 1879) 237
Bombus hypnorum (Linnaeus, 1758) 216
Most Frequently Mentioned Associate Taxa (up to 20 most frequent)
targetTaxonName count
Myrtaceae Eucalyptus 3394
Myrtaceae Melaleuca 636
Proteaceae Hakea 521
Cirsium sp. 470
Myoporaceae Eremophila 435
Lupinus sp. 386
Trifolium sp. 297
Sapindaceae Atalaya hemiglauca 270
Epilobium parviflorum 267
Epilobium angustifolium 261
Melilotus alba 254
Myrtaceae Kunzea ericoides 245
Campanulaceae Wahlenbergia 236
Symphoricarpos albus 232
Myrtaceae Melaleuca lanceolata 185
Vicia sp. 182
Aster sp. 181
Geranium sp. 170
Mimosaceae Acacia 164
Most Frequent Interactions between Primary and Associate Taxa (up to 20 most frequent)
sourceTaxonName interactionTypeName targetTaxonName count
Leioproctus leai (Cockerell, 1913) hasHost Proteaceae Hakea 306
Xanthesma furcifera (Cockerell, 1913) hasHost Myrtaceae Eucalyptus 283
Bombus bifarius Cresson, 1878 interactsWith Lupinus sp. 199
Bombus occidentalis Greene, 1858 interactsWith Melilotus alba 191
Bombus bifarius Cresson, 1878 interactsWith Trifolium sp. 148
Pachyprosopis xanthodonta (Cockerell, 1913) hasHost Myrtaceae Eucalyptus 134
Euryglossula fultoni (Cockerell, 1913) hasHost Myrtaceae Eucalyptus 133
Bombus huntii Greene, 1860 interactsWith Cirsium sp. 132
Hylaeus proximus (Smith, 1879) hasHost Myrtaceae Eucalyptus 127
Lasioglossum appositum (Rayment, 1939) hasHost Myrtaceae Eucalyptus 121
Hylaeus elegans (Smith, 1853) hasHost Myrtaceae Eucalyptus 119
Bombus bifarius Cresson, 1878 interactsWith Symphoricarpos albus 114
Lasioglossum erythrurum (Cockerell, 1914) hasHost Myrtaceae Eucalyptus 110
Bombus centralis Cresson, 1864 interactsWith Cirsium sp. 100
Leioproctus leai (Cockerell, 1913) hasHost Myrtaceae Eucalyptus 98
Austronomia flavoviridis (Cockerell, 1905) hasHost Myrtaceae Eucalyptus 97
Lasioglossum vitripenne (Smith, 1879) hasHost Myrtaceae Eucalyptus 95
Hylaeus chlorosoma (Cockerell, 1913) hasHost Myrtaceae Eucalyptus 92
Bombus bifarius Cresson, 1878 interactsWith Aster sp. 91

Interaction Networks

The figures below provide a graph view on the dataset under review. The first shows a summary network on the kingdom level, and the second shows how interactions on the family level. It is important to note that both network graphs were first aligned taxonomically using the Catalogue of Life. Please refer to the original (or verbatim) taxonomic names for a more original view on the interaction data.

Interactions on taxonomic kingdom rank as interpreted by the Catalogue of Life download svg Interactions on the taxonomic family rank as interpreted by the Catalogue of Life. download svg

You can download the indexed dataset under review at indexed-interactions.csv.gz. A tab-separated file can be found at indexed-interactions.tsv.gz

Geospatial Distribution

If geospatial information was extracted from the dataset under review, the map below will show their distribution. These maps were generated using MapServer (McKenna et al. 2025) tools configured via map configuration indexed-interactions.map :

MAP
  SIZE 1600 800
  EXTENT -180 -90 180 90
  PROJECTION
    "init=epsg:4326"
  END
  LAYER # MODIS WMS map from NASA
    NAME         "modis_nasa"
    TYPE         RASTER
    OFFSITE      0 0 0
    STATUS       ON
    CONNECTIONTYPE WMS
    CONNECTION "https://gibs.earthdata.nasa.gov/wms/epsg4326/best/wms.cgi?"

    METADATA
      "wms_srs" "EPSG:4326"
      "wms_name" "OSM_Land_Water_Map"
      "wms_server_version" "1.1.1"
      "wms_format" "image/jpeg"
    END
    CLASS
      STYLE
        COLOR        232 232 232
        OUTLINECOLOR 32 32 32
      END
    END
  END 
  LAYER
    NAME "indexed-interactions"
    TYPE POLYGON
    STATUS ON
    CONNECTIONTYPE OGR
    CONNECTION "indexed-interactions-h3.gpkg"
    DATA "indexed-interactions-h3"
    CLASS
      STYLE
        COLORRANGE 253.0 231.0 37.0 32.0 164.0 134.0
        DATARANGE 0.3010299956639812 3.1202447955463652
        RANGEITEM "log_number_of_records"
        OUTLINECOLOR 0 0 0
      END
    END
  END
END
Hexagonal grid cells indicate that interactions claims are available for selected geospatial area: light yellow means relatively fewer claims, dark green relatively more claims.

Associated data can be found in the geopackage files at indexed-interactions.gpkg for point data and indexed-interactions-h3.gpkg for data clustered in geospatial h3 hexagonals.

Learn more about the structure of this download at GloBI website, by opening a GitHub issue, or by sending an email.

Another way to discover the dataset under review is by searching for it on the GloBI website.

Taxonomic Alignment

As part of the review, all names are aligned against various name catalogs (e.g., col, ncbi, discoverlife, gbif, itis, wfo, mdd, tpt, pbdb, and worms). These alignments can help review name usage or aid in selecting of a suitable taxonomic name resource.

Sample of Name Alignments
providedName relationName resolvedCatalogName resolvedName
Abronia latifolia HAS_ACCEPTED_NAME col Abronia latifolia
Abronia maritima HAS_ACCEPTED_NAME col Abronia maritima
Abronia villosa HAS_ACCEPTED_NAME col Abronia villosa
Acacia berlandieri SYNONYM_OF col Senegalia berlandieri
Distribution of Taxonomic Ranks of Aligned Names by Catalog. Names that were not aligned with a catalog are counted as NAs. So, the total number of unaligned names for a catalog will be listed in their NA row.
resolvedCatalogName resolvedRank count
col NA 142
col family 108
col genus 208
col species 1570
col subgenus 4
col subspecies 27
col variety 13
discoverlife NA 929
discoverlife species 1121
gbif NA 131
gbif family 109
gbif genus 208
gbif species 1574
gbif subspecies 32
gbif variety 16
itis NA 201
itis family 105
itis genus 196
itis species 1522
itis subspecies 8
itis variety 19
mdd NA 2050
ncbi NA 844
ncbi family 88
ncbi genus 200
ncbi species 912
ncbi subgenus 3
ncbi subspecies 3
ncbi varietas 2
pbdb NA 1869
pbdb family 103
pbdb genus 72
pbdb species 5
pbdb subfamily 1
tpt NA 2049
tpt genus 1
wfo NA 1247
wfo family 97
wfo genus 201
wfo species 489
wfo subspecies 11
wfo variety 11
worms NA 1674
worms family 65
worms genus 132
worms species 167
worms subspecies 6
worms variety 8
Name relationship types per catalog. Name relationship type "NONE" means that a name was not recognized by the associated catalog. "SAME_AS" indicates either a "HAS_ACCEPTED_NAME" or "SYNONYM_OF" name relationship type. We recognize that "SYNONYM_OF" encompasses many types of nomenclatural synonymies
resolvedCatalogName relationName count
col HAS_ACCEPTED_NAME 2726
col SYNONYM_OF 754
col NONE 154
discoverlife NONE 2196
discoverlife HAS_ACCEPTED_NAME 1142
discoverlife SYNONYM_OF 100
discoverlife HOMONYM_OF 16
gbif HAS_ACCEPTED_NAME 2955
gbif SYNONYM_OF 772
gbif NONE 141
itis HAS_ACCEPTED_NAME 2633
itis SYNONYM_OF 496
itis NONE 237
mdd NONE 3337
ncbi SAME_AS 2110
ncbi SYNONYM_OF 110
ncbi NONE 1132
pbdb NONE 1948
pbdb HAS_ACCEPTED_NAME 1370
pbdb SYNONYM_OF 97
tpt NONE 3336
tpt HAS_ACCEPTED_NAME 1
wfo HAS_ACCEPTED_NAME 1621
wfo SYNONYM_OF 327
wfo NONE 1456
wfo HAS_UNCHECKED_NAME 105
worms NONE 2004
worms HAS_ACCEPTED_NAME 1277
worms SYNONYM_OF 142
List of Available Name Alignment Reports
catalog name alignment results
col associated names alignments report in gzipped html, csv, and tsv)
ncbi associated names alignments report in gzipped html, csv, and tsv)
discoverlife associated names alignments report in gzipped html, csv, and tsv)
gbif associated names alignments report in gzipped html, csv, and tsv)
itis associated names alignments report in gzipped html, csv, and tsv)
wfo associated names alignments report in gzipped html, csv, and tsv)
mdd associated names alignments report in gzipped html, csv, and tsv)
tpt associated names alignments report in gzipped html, csv, and tsv)
pbdb associated names alignments report in gzipped html, csv, and tsv)
worms associated names alignments report in gzipped html, csv, and tsv)

Additional Reviews

Elton, Nomer, and other tools may have difficulties interpreting existing species interaction datasets. Or, they may misbehave, or otherwise show unexpected behavior. As part of the review process, detailed review notes are kept that document possibly misbehaving, or confused, review bots. An sample of review notes associated with this review can be found below.

First few lines in the review notes.
reviewDate reviewCommentType reviewComment
2026-06-04T02:32:36Z note target taxon name missing
2026-06-04T02:32:36Z note target taxon name missing
2026-06-04T02:32:36Z note target taxon name missing
2026-06-04T02:32:36Z note target taxon name missing

In addition, you can find the most frequently occurring notes in the table below.

Most frequently occurring review notes, if any.
reviewComment count
target taxon name missing 472701
found unsupported interaction type with name: [foraging on] 1808
issue handling date range [2019/10/30 00:00:00]: The end instant must be greater than the start instant 694
issue handling date range [1985/08/23 00:00:00]: The end instant must be greater than the start instant 642

For additional information on review notes, please have a look at the first 500 Review Notes in html format or the download full gzipped csv or tsv archives.

GloBI Review Badge

As part of the review, a review badge is generated. This review badge can be included in webpages to indicate the review status of the dataset under review.

Picture of a GloBI Review Badge 3

Note that if the badge is green, no review notes were generated. If the badge is yellow, the review bots may need some help with interpreting the species interaction data.

GloBI Index Badge

If the dataset under review has been registered with GloBI, and has been succesfully indexed by GloBI, the GloBI Index Status Badge will turn green. This means that the dataset under review was indexed by GloBI and is available through GloBI services and derived data products.

Picture of a GloBI Index Badge 4

If you'd like to keep track of reviews or index status of the dataset under review, please visit GloBI's dataset index 5 for badge examples.

Discussion

This review and archive provides a means of creating citable versions of datasets that change frequently. This may be useful for dataset managers, including natural history collection data managers, as a backup archive of a shared Darwin Core archive. It also serves as a means of creating a trackable citation for the dataset in an automated way, while also including some information about the contents of the dataset.

This review aims to provide a perspective on the dataset to aid in understanding of species interaction claims discovered. However, it is important to note that this review does not assess the quality of the dataset. Instead, it serves as an indication of the open-ness6 and FAIRness (Wilkinson et al. 2016; Trekels et al. 2023) of the dataset: to perform this review, the data was likely openly available, Findable, Accessible, Interoperable and Reusable. The current Open-FAIR assessment is qualitative, and a more quantitative approach can be implemented with specified measurement units.

This report also showcases the reuse of machine-actionable (meta)data, something highly recommended by the FAIR Data Principles (Wilkinson et al. 2016). Making (meta)data machine-actionable enables more precise procesing by computers, enabling even naive review bots like Nomer and Elton to interpret the data effectively. This capability is crucial for not just automating the generation of reports, but also for facilitating seamless data exchanges, promoting interoperability.

Acknowledgements

We thank the many humans that created us and those who created and maintained the data, software and other intellectual resources that were used for producing this review. In addition, we are grateful for the natural resources providing the basis for these human and bot activities. Also, thanks to https://github.com/zygoballus for helping improve the layout of the review tables.

Author contributions

Nomer was responsible for name alignments. Elton carried out dataset extraction, and generated the review notes. Preston tracked, versioned, and packaged, the dataset under review.

Appendix A. Review Files

The following files are produced in this review:

filename description
biblio.bib list of bibliographic reference of this review
check-dataset.sh data review workflow/process as expressed in a bash script
data.zip a versioned archive of the data under review
HEAD the digital signature of the data under review
index.docx review in MS Word format
index.html review in HTML format
index.md review in Pandoc markdown format
index.pdf review in PDF format
indexed-citations.csv.gz list of distinct reference citations for reviewed species interaction claims in gzipped comma-separated values file format
indexed-citations.html.gz list of distinct reference citations for reviewed species interactions claims in gzipped html file format
indexed-citations.tsv.gz list of distinct reference citations for reviewed species interaction claims in gzipped tab-separated values format
indexed-interactions-col-family-col-family.svg network diagram showing the taxon family to taxon family interaction claims in the dataset under review as interpreted by the Catalogue of Life via Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024)
indexed-interactions-col-kingdom-col-kingdom.svg network diagram showing the taxon kingdom to taxon kingom interaction claims in the dataset under review as interpreted by the Catalogue of Life via Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024)
indexed-interactions.csv.gz species interaction claims indexed from the dataset under review in gzipped comma-separated values format
indexed-interactions.html.gz species interaction claims indexed from the dataset under review in gzipped html format
indexed-interactions.tsv.gz species interaction claims indexed from the dataset under review in gzipped tab-separated values format
indexed-interactions.parquet species interaction claims indexed from the dataset under review in Apache Parquet format
indexed-interactions.png species interaction claims indexed from the dataset under review plotted on a map
indexed-interactions.map mapserver configuration to plot species interaction claims indexed from the dataset under review on a map
indexed-interactions.gpkg species interaction claims indexed from the dataset under review in GeoPackage format
indexed-interactions-h3.gpkg geospatially clustered h3 species interaction claims indexed from the dataset under review in GeoPackage format
indexed-interactions-sample.csv list of species interaction claims indexed from the dataset under review in gzipped comma-separated values format
indexed-interactions-sample.html first 500 species interaction claims indexed from the dataset under review in html format
indexed-interactions-sample.tsv first 500 species interaction claims indexed from the dataset under review in tab-separated values format
indexed-names.csv.gz taxonomic names indexed from the dataset under review in gzipped comma-separated values format
indexed-names.html.gz taxonomic names found in the dataset under review in gzipped html format
indexed-names.tsv.gz taxonomic names found in the dataset under review in gzipped tab-separated values format
indexed-names.parquet taxonomic names found in the dataset under review in Apache Parquet format
indexed-names-resolved-col.csv.gz taxonomic names found in the dataset under review aligned with the Catalogue of Life as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-col.html.gz taxonomic names found in the dataset under review aligned with the Catalogue of Life as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-col.tsv.gz taxonomic names found in the dataset under review aligned with the Catalogue of Life as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-col.parquet taxonomic names found in the dataset under review aligned with the Catalogue of Life as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-discoverlife.csv.gz taxonomic names found in the dataset under review aligned with Discover Life bee species checklist as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-discoverlife.html.gz taxonomic names found in the dataset under review aligned with Discover Life bee species checklist as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-discoverlife.tsv.gz taxonomic names found in the dataset under review aligned with Discover Life bee species checklist as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-discoverlife.parquet taxonomic names found in the dataset under review aligned with Discover Life bee species checklist as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-gbif.csv.gz taxonomic names found in the dataset under review aligned with GBIF Backbone Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-gbif.html.gz taxonomic names found in the dataset under review aligned with GBIF Backbone Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-gbif.tsv.gz taxonomic names found in the dataset under review aligned with GBIF Backbone Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-gbif.parquet taxonomic names found in the dataset under review aligned with GBIF Backbone Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-itis.csv.gz taxonomic names found in the dataset under review aligned with Integrated Taxonomic Information System (ITIS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-itis.html.gz taxonomic names found in the dataset under review aligned with Integrated Taxonomic Information System (ITIS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-itis.tsv.gz taxonomic names found in the dataset under review aligned with Integrated Taxonomic Information System (ITIS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-itis.parquet taxonomic names found in the dataset under review aligned with Integrated Taxonomic Information System (ITIS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-mdd.csv.gz taxonomic names found in the dataset under review aligned with the Mammal Diversity Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-mdd.html.gz taxonomic names found in the dataset under review aligned with Mammal Diversity Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-mdd.tsv.gz taxonomic names found in the dataset under review aligned with Mammal Diversity Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-mdd.parquet taxonomic names found in the dataset under review aligned with Mammal Diversity Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-ncbi.csv.gz taxonomic names found in the dataset under review aligned with the NCBI Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-ncbi.html.gz taxonomic names found in the dataset under review aligned with the NCBI Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-ncbi.tsv.gz taxonomic names found in the dataset under review aligned with the NCBI Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-ncbi.parquet taxonomic names found in the dataset under review aligned with the NCBI Taxonomy as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-pbdb.csv.gz taxonomic names found in the dataset under review aligned with the Paleobiology Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-pbdb.html.gz taxonomic names found in the dataset under review aligned with Paleobiology Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-pbdb.tsv.gz taxonomic names found in the dataset under review aligned with Paleobiology Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-pbdb.parquet taxonomic names found in the dataset under review aligned with Paleobiology Database as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-tpt.csv.gz taxonomic names found in the dataset under review aligned with the Terrestrial Parasite Tracker (TPT) Taxonomic Resource as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-tpt.html.gz taxonomic names found in the dataset under review aligned with the Terrestrial Parasite Tracker (TPT) Taxonomic Resource as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-tpt.tsv.gz taxonomic names found in the dataset under review aligned with the Terrestrial Parasite Tracker (TPT) Taxonomic Resource as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-tpt.parquet taxonomic names found in the dataset under review aligned with the Terrestrial Parasite Tracker (TPT) Taxonomic Resource as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-wfo.csv.gz taxonomic names found in the dataset under review aligned with the World of Flora Online as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-wfo.html.gz taxonomic names found in the dataset under review aligned with the World of Flora Online as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-wfo.tsv.gz taxonomic names found in the dataset under review aligned with the World of Flora Online as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-wfo.parquet taxonomic names found in the dataset under review aligned with the World of Flora Online as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-resolved-worms.csv.gz taxonomic names found in the dataset under review aligned with the World Register of Marine Species (WoRMS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped comma-separated values format
indexed-names-resolved-worms.html.gz taxonomic names found in the dataset under review aligned with the World Register of Marine Species (WoRMS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped html format
indexed-names-resolved-worms.tsv.gz taxonomic names found in the dataset under review aligned with the World Register of Marine Species (WoRMS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in gzipped tab-separated values format
indexed-names-resolved-worms.parquet taxonomic names found in the dataset under review aligned with the World Register of Marine Species (WoRMS) as accessed through the Nomer Corpus of Taxonomic Resources (J. H. (ed. ). Poelen 2024) in Apache Parquet format
indexed-names-sample.csv first 500 taxonomic names found in the dataset under review in comma-separated values format
indexed-names-sample.html first 500 taxonomic names found in the dataset under review in html format
indexed-names-sample.tsv first 500 taxonomic names found in the dataset under review in tab-separated values format
interaction.svg diagram summarizing the data model used to index species interaction claims
nanopub-sample.trig first 500 species interaction claims as expressed in the nanopub format (Kuhn and Dumontier 2014)
nanopub.trig.gz species interaction claims as expressed in the nanopub format (Kuhn and Dumontier 2014)
process.svg diagram summarizing the data review processing workflow
prov.nq origin of the dataset under review as expressed in rdf/nquads
review.csv.gz review notes associated with the dataset under review in gzipped comma-separated values format
review.html.gz review notes associated with the dataset under review in gzipped html format
review.tsv.gz review notes associated with the dataset under review in gzipped tab-separated values format
review-sample.csv first 500 review notes associated with the dataset under review in comma-separated values format
review-sample.html first 500 review notes associated with the dataset under review in html format
review-sample.tsv first 500 review notes associated with the dataset under review in tab-separated values format
review.svg a review badge generated as part of the dataset review process
zenodo.json metadata of this review expressed in Zenodo record metadata

References

Elliott, Michael, Jorrit Poelen, Icaro Alzuru, Emilio Berti, and partha04patel. 2025. "Bio-Guoda/Preston: 0.10.5." Zenodo. https://doi.org/10.5281/zenodo.14662206.
ICZN. 1999. "International Code of Zoological Nomenclature." The International Trust for Zoological Nomenclature, London, UK. https://www.iczn.org/the-code/the-code-online/.
Kuhn, Tobias, and Michel Dumontier. 2014. "Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data." In The Semantic Web: Trends and Challenges, edited by Valentina Presutti, Claudia d'Amato, Fabien Gandon, Mathieu d'Aquin, Steffen Staab, and Anna Tordai, 395–410. Cham: Springer International Publishing.
Kuhn, Tobias, Jorrit Poelen, and Katrin Leinweber. 2025. "Globalbioticinteractions/Elton: 0.15.1." Zenodo. https://doi.org/10.5281/zenodo.14927734.
McKenna, Jeff, Steve Lime, Thomas Bonfort, Jérome Boué, Howard Butler, Seth Girvin, Tom Kralidis, et al. 2025. "MapServer." Zenodo. https://doi.org/10.5281/zenodo.17807263.
Poelen, Jorrit H. (ed.). 2024. "Nomer Corpus of Taxonomic Resources Hash://Sha256/ B60c0d25a16ae77b24305782017b1a270b79b5d1746f832650 F2027ba536e276 Hash://Md5/17f1363a277ee0e4ecaf1b91c665e47e." Zenodo. https://doi.org/10.5281/zenodo.12695629.
Poelen, Jorrit H., James D. Simons, and Chris J. Mungall. 2014. "Global Biotic Interactions: An Open Infrastructure to Share and Analyze Species-Interaction Datasets." Ecological Informatics 24 (November): 148–59. https://doi.org/10.1016/j.ecoinf.2014.08.005.
Poelen, Jorrit, Katja Seltmann, and Daniel Mietchen. 2024. "Globalbioticinteractions/Globinizer: 0.4.0." Zenodo. https://doi.org/10.5281/zenodo.10647565.
Salim, José Augusto, and Jorrit Poelen. 2025. "Globalbioticinteractions/Nomer: 0.5.15." Zenodo. https://doi.org/10.5281/zenodo.14893840.
Trekels, Maarten, Debora Pignatari Drucker, José Augusto Salim, Jeff Ollerton, Jorrit Poelen, Filipi Miranda Soares, Max Rünzel, Muo Kasina, Quentin Groom, and Mariano Devoto. 2023. "WorldFAIR Project (D10.1) Agriculture-related pollinator data standards use cases report." Zenodo. https://doi.org/10.5281/zenodo.8176978.
Wilkinson, Mark D., Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, et al. 2016. "The FAIR Guiding Principles for Scientific Data Management and Stewardship." Scientific Data 3 (1). https://doi.org/10.1038/sdata.2016.18.
  1. Note that you have to first get the data (e.g., via elton pull globalbioticinteractions/dorey2023) before being able to generate reviews (e.g., elton review globalbioticinteractions/dorey2023), extract interaction claims (e.g., elton interactions globalbioticinteractions/dorey2023), or list taxonomic names (e.g., elton names globalbioticinteractions/dorey2023)↩︎

  2. Disclaimer: The results in this review should be considered friendly, yet naive, notes from an unsophisticated robot. Please keep that in mind when considering the review results.↩︎

  3. Up-to-date status of the GloBI Review Badge can be retrieved from the GloBI Review Depot↩︎

  4. Up-to-date status of the GloBI Index Badge can be retrieved from GloBI's API↩︎

  5. At time of writing (2026-06-04) the version of the GloBI dataset index was available at https://globalbioticinteractions.org/datasets↩︎

  6. According to http://opendefinition.org/: "Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike."↩︎

Files

data.zip

Files (368.9 MB)

Name Size
md5:5a0f74338ea87192a688f360c217e90f
8.4 kB Download
md5:d7d6604b3ecdd7f1f9edf6e5fc799f7c
86.0 kB Download
md5:d1b1b6513c5c21e4b5fc7e02e008e1b7
99.1 MB Preview Download
md5:9b62c1f8707fe93ce7a9d5ec2f8d71db
44 Bytes Download
md5:1703d56335a66895e3cc8c0534206ced
404.1 kB Download
md5:560ff4c14d66f6b36fd4a2c80b852683
488.0 kB Download
md5:958f0f18af9a6484fcf90b5223a3fd1b
41.9 kB Preview Download
md5:39f3a50b8aa530e015cf8ad987cd8752
466.2 kB Preview Download
md5:333472ab8d33525a951d13148b86fb0f
84.5 kB Preview Download
md5:83d77ec5fc5d49f0b5e3e4c284b20f26
928 Bytes Download
md5:7030862a8f329c0021bd3b4938a32b69
171.3 kB Download
md5:7bca6ee7574e94b7f71ebc21a10c378a
927 Bytes Download
md5:37fbeccfce3893fde9f1f6b43d244222
583 Bytes Download
md5:37fbeccfce3893fde9f1f6b43d244222
583 Bytes Download
md5:fc9f7842e84ffe14ae6ad7c427311812
172.0 kB Download
md5:d6fe6d4a2e208a9942e6e1f89eb5ced2
428.5 kB Preview Download
md5:c19f46d8d156c0918e8e5edb0738cb48
323.0 kB Download
md5:434049b542e8699ad82014eb57995734
425.5 kB Download
md5:89e5af4381571d1b1fefdb9044d04517
881.5 kB Download
md5:8df349a71027b9ca5132d9120e71be7e
12.2 MB Download
md5:c74e062bccd46f134c275396e1834155
173.7 kB Download
md5:23b667b5d2403d20767a53bd7a41d552
983 Bytes Download
md5:07c0d65999be16c701f389d3b0372267
823.9 kB Download
md5:5c7c19c4840828e9236cfc11d0164f1a
305.0 kB Preview Download
md5:d7eab5d3cc8c61ac9520613bdbb7001a
877.7 kB Download
md5:7d3c9bfeecae682b2252e574a7198bce
149.4 kB Download
md5:e9bafe2dfbea164441c4f7c7496cc4bf
179.7 kB Download
md5:e8d84da93fc7ea6296049bc69cce00f4
294.4 kB Download
md5:fc88b0d4f4df1f89e1b73701a19cf86c
148.7 kB Download
md5:b7a3cfcc2ad77e8aa7f989b0b3c7bb76
72.5 kB Download
md5:e0233e42856ddd4c9367a78d59097c63
177.4 kB Download
md5:58b79936f7d8b0f0e94f178c553c011e
192.3 kB Download
md5:a4f2df93b5997a148fa994ef756ad157
72.0 kB Download
md5:e5dc54bac501bfbdfd5c4a851f302d17
117.7 kB Download
md5:2ae9d1ec923afd7eaf273cc1fe6f336f
178.8 kB Download
md5:63f60ff2f00f6ee1ddd7de54864a6ba5
247.6 kB Download
md5:e49bee1bd6780281bb9342cc6916b455
116.8 kB Download
md5:8302668a97c1786f18047e18ffe5e761
128.6 kB Download
md5:e99c10534a73f00cccad69b5c109fe85
179.3 kB Download
md5:b4b4b9ceff0df712fe63a452a6ccfc70
276.1 kB Download
md5:6fbc23dcfa52badd502353fec69e189a
128.0 kB Download
md5:b470a97f2b92ab348ff35ef6dc34bd69
38.6 kB Download
md5:8ddf4959a4fb9a2d7099df2697989281
175.3 kB Download
md5:6a8efc236b66b3fae5147dd15aca4f41
88.3 kB Download
md5:31da6db7c52045ac1aeca5ba259aaf38
38.6 kB Download
md5:cce03cba3b749f0fd8ac87708f65c6ef
110.2 kB Download
md5:59878d0d9c32796ab58ef6396a1420c7
177.5 kB Download
md5:7d7b8179759d7ce06d30eb080dca556f
221.2 kB Download
md5:1b0337d9bb4aacec8af1fb006443d05e
110.2 kB Download
md5:c01e20e015ae0aec8035a2ca74354145
74.3 kB Download
md5:938763f8b0f5e5527114ae641e89b4d2
176.1 kB Download
md5:8ead347a8368f933f0478205b21a2257
119.1 kB Download
md5:55fc41d4608f1b9fcfbad64a7dc577f3
74.0 kB Download
md5:2681a76d22c5139630bdd85cf5533ee6
38.7 kB Download
md5:1b567a84840a7a3444bbf24466f023f6
175.3 kB Download
md5:13743e89243259c2dc2b33b3706a3b9d
89.3 kB Download
md5:333cb9af56465906109b7200f29b7773
38.7 kB Download
md5:db998256dfca8bd5a912f13c59a80b86
85.8 kB Download
md5:081b3353d9f7e65a15ca2f9825a3736e
177.7 kB Download
md5:57c5fbbb37b316551be4c81fa2440396
168.9 kB Download
md5:09db6d4b141cbe6896096ccdc4d81377
85.8 kB Download
md5:aa3784a85caf44cd4ce89e6cc848a8fe
80.2 kB Download
md5:1157279a63d04221ee2480ffc9203e48
176.7 kB Download
md5:b0a19c1b42d72072f54e802e8b1c44a5
146.3 kB Download
md5:f611bbf2583cc7bdc3fc1f6f85d404f3
80.0 kB Download
md5:a563e96cd12ab78424cd800e241ff7bb
895.3 kB Download
md5:f685904ebf7e8b9f7d59e8742fef538a
198.8 kB Download
md5:c3b0a0c1970a1492a73a9ea5443d6c0d
1.1 MB Download
md5:90716639be9ae51894f80c9d5181de77
891.6 kB Download
md5:b10dfaa839aac58390316450f3cb8d4f
259.8 kB Preview Download
md5:9425ecc5c20bdb55111949088f1522b3
400.1 kB Download
md5:9841c4102c0b7adc23d56f2e40e5ba2c
258.3 kB Download
md5:ae0296d3a6de14242a714223c540601c
46.3 kB Download
md5:2b0736353a04af0d48203bf1b7bd93c5
177.2 kB Download
md5:e347a91e183d4614f0ecc88e627bb962
82.1 kB Download
md5:696c2964d5c251c4427892a3e98548dc
46.0 kB Download
md5:49f211b50b6fc3cfed8df03984afb4c8
3.1 kB Download
md5:8b249b0266d2f6227f58e5bda62ff41f
1.8 kB Download
md5:41dfccaee928e7a7495ef484472c9a8f
1.4 MB Download
md5:5fa2de20e16beb7cac98ec915383dbcf
4.4 kB Download
md5:dfc653904fa4d12bc44bf6f9f36a93d1
15.2 kB Download
md5:25dd50a404943e9e53c40d2323feb836
3.6 MB Preview Download
md5:ec90fcd01d24580f3e5d9273bbeecfaf
5.0 MB Download
md5:d2e694e5a3c113772940c1afd7739440
3.3 MB Download
md5:8beb90cdcc46b6f442da67e7cf13349e
117.2 MB Download
md5:57682cadf75395600726415f0deccbb8
281.8 kB Download
md5:a7634fc96fc0ecfba160d6a5b4536f55
867 Bytes Download
md5:dd8fca570ecc1e9ac9fe00ed89f3f877
110.8 MB Download
md5:3318b49d18a1b6367354debbd23b7710
76.3 kB Preview Download

Additional details

Related works

References