There is a newer version of the record available.

Published October 19, 2024 | Version v3
Dataset Open

Global monthly catch of tuna, tuna-like and shark species (1950-2021) by 1° or 5° squares (IRD level 2)

  • 1. Institut de Recherche pour le Développement
  • 1. Indian Ocean Tuna Commission
  • 2. International Commission for the Conservation of Atlantic Tunas
  • 3. Western and Central Pacific Fisheries Commission
  • 4. Inter-American Tropical Tuna Commission
  • 5. Commission for the Conservation of Southern Bluefin Tuna

Description

This dataset lists global catches of tuna, tuna-like, and shark species from 1950 to 2021. Catches are stratified by month, species, gear type, fishing fleet, fishing mode (i.e., type of school used), area (1° or 5° squares), and unit of catch (weight or number). This dataset was computed using public domain catch datasets released by FIRMS.

IRD Level 2 refers to the processes applied by the French National Research Institute for Sustainable Development (IRD) to primary datasets, generating this refined dataset.

This work aims not only to provide a biomass dataset for scientific purposes but also to identify and address data inconsistencies, improving the overall process. All code, materials, and packages used are available on the GitHub repository, firms-gta/geoflow-tunaatlas, along with detailed documentation on the impact of each specific treatment.

Warning: This dataset is designed to enhance the understanding of fish counts at level 0. It is not suitable for accurately georeferencing data by country or fishing fleet and should not be used for studies on fishing zone legality or quota management. While it offers a georeferenced footprint of captures to reflect reported biomass more closely, significant uncertainty remains regarding the precise locations of the catches.

Global level 2 processing includes the conversion and raising of georeferenced catch data to match nominal dataset values.

Appendix work:

  1. The github repository (DOI:10.5281/zenodo.14039665 Allowing to reproduce this dataset)
  2. A Shiny app has been created to easily visualize catch data in CWP format: ghcr.io/firms-gta/tunaatlas_pie_map_shiny_cwp_database:latest. The docker image is based on this DOI dataset and allows to explore it.
  3. Data paper incoming.

If you are interested in creating a customized version of this Global Tuna Atlas with specific filters or adjustments based on particular issues, please feel free to reach out to us. 

 

Notes

Major differences from previous work:

For level 2: 

  • Catches in tons, raised to match nominal values, now consider the geographic area of the nominal data for improved accuracy.
  • Captures in "Number of fish" are converted to weight based on nominal data. The conversion factors used in the previous version are no longer used, as they did not adequately represent the diversity of captures.
  • Number of fish without corresponding data in nominal are not removed as they were before, creating a huge difference for this measurement_unit between the two datasets.
  • Nominal data from WCPFC includes fishing fleet information, and georeferenced data has been raised based on this instead of solely on the triplet year/gear/species, to avoid random reallocations.
  • Strata for which catches in tons are raised to match nominal data have had their numbers removed.
  • Raising only applies to complete years to avoid overrepresenting specific months, particularly in the early years of georeferenced reporting.
  • Strata where georeferenced data exceed nominal data have not been adjusted downward, as it is unclear if these discrepancies arise from missing nominal data or different aggregation methods in both datasets.
  • The data is not aggregated to 5-degree squares and thus remains unharmonized spatially. Aggregation can be performed using CWP codes for geographic identifiers. For example, an R function is available: source("https://raw.githubusercontent.com/firms-gta/geoflow-tunaatlas/master/sardara_functions/transform_cwp_code_from_1deg_to_5deg.R")

Level 0 dataset has been modified creating differences in this new version notably : 

  • The species retained are different; only 32 major species are kept.
  • Mappings have been somewhat modified based on new standards implemented by FIRMS.
  • New rules have been applied for overlapping areas.
  • Data is only displayed in 1 degrees square area and 5 degrees square areas.

The data is enriched with "Species group", "Gear labels" using the fdiwg standards.

These main differences are recapped in the Differences_v2018_v2024.zip

Recommendations:

To avoid converting data from number using nominal stratas, we recommend the use of conversion factors which could be provided by tRFMOs.

In some strata, nominal data appears higher than georeferenced data, as observed during level 2 processing. These discrepancies may result from errors or differences in aggregation methods. Further analysis will examine these differences in detail to refine treatments accordingly. A summary of differences by tRFMOs, based on the number of strata, is included in the appendix.

Some nominal data have no equivalent in georeferenced data and therefore cannot be disaggregated. What could be done is to check for each nominal data without equivalence if a georeferenced data exists in different buffers, and to average the distribution of this footprint. Then, disaggregate the nominal data based on the georeferenced data. This would lead to the creation of data (approximately 3%), and would necessitate reducing/removing all georeferenced data without a nominal equivalent or with a lesser equivalent. Tests are currently being conducted with and without this. It would help improve the biomass captured footprint but could lead to unexpected discrepancies with current datasets.

Files

global_catch_tunaatlasird_level2 (without_geom).csv

Files (976.6 MB)

Name Size Download all
md5:bcc047645701429fa18db9632b201f1b
6.4 MB Preview Download
md5:3e78c901f4572e0052614ef0a8875fa2
21.4 kB Preview Download
md5:4acce608871235ff8ec58956f51b2d27
40.9 kB Preview Download
md5:7fe1721fb7b2efda91b2846e821f9845
900.2 MB Preview Download
md5:0edd7e2af65701d73bbc62e72065480f
53.8 MB Download
md5:1e6334f9ca1d6d43ae3fc699d8a8bc2c
16.2 MB Preview Download

Additional details

Related works

Is described by
Computational notebook: 10.5281/zenodo.14025108 (DOI)
Requires
Dataset: 10.5281/zenodo.11410529 (DOI)
Dataset: 10.5281/zenodo.11459367 (DOI)
Workflow: 10.5281/zenodo.14039665 (DOI)

Software

Repository URL
https://github.com/firms-gta/geoflow-tunaatlas
Programming language
R
Development Status
Active