Global monthly tuna, tuna-like and shark catch (levels 1-2) and fishing effort (level 0) datasets (1950-2024) at 1° and 5° spatial resolution

Grasset, Bastien; Barde, Julien; Paul Taconet

doi:10.5281/zenodo.20019715

Published May 4, 2026 | Version v6

Dataset Open

Global monthly tuna, tuna-like and shark catch (levels 1-2) and fishing effort (level 0) datasets (1950-2024) at 1° and 5° spatial resolution

1. Institut de Recherche pour le Développement
2. Marine Biodiversity Exploitation and Conservation
3. Marine Biodiversity, Exploitation and Conservation (Sète)

Contributors

Data collector (5):

1. Indian Ocean Tuna Commission
2. International Commission for the Conservation of Atlantic Tunas
3. Western and Central Pacific Fisheries Commission
4. Inter-American Tropical Tuna Commission
5. Commission for the Conservation of Southern Bluefin Tuna

This deposit contains various datasets describing tuna fisheries activities (currently catches and efforts) and different levels of processing on 1° or 5° spatial grids with a monthly temporal resolution.

Notes

For level 2 catch:

Catches and number raised to nominal are only raised to exactly matching stratas or if not existing, to a strata corresponding with major probability to be the one corresponding as every catches in georeferenced strata should be match with a nominal catch
Issues in matching due to aggregations in species name or in gear type are solved in the v6 version.
IATTC Purse seine catch-and-effort are available in 3 separate files according to the group of species: tuna, billfishes, sharks. This is due to the fact that PS data is collected from 2 sources: observer and fishing vessel logbooks. Observer records are used when available, and for unobserved trips logbooks are used. Both sources collect tuna data but only observers collect shark and billfish data. As an example, a strata may have observer effort and the number of sets from the observed trips would be counted for tuna and shark and billfish. But there may have also been logbook data for unobserved sets in the same strata so the tuna catch and number of sets for a cell would be added. This would make a higher total number of sets for tuna catch than shark or billfish. Efforts in the billfish and shark datasets might hence represent only a proportion of the total effort allocated in some strata since it is the observed effort, i.e. for which there was an observer onboard. As a result, catch in the billfish and shark datasets might represent only a proportion of the total catch allocated in some strata. Hence, shark and billfish catch were raised to the fishing effort reported in the tuna dataset. (new feature in v4, was done in Firms Level 0 before)
Data with resolution of 10degx10deg is removed, it is considered to disaggregate it in next versions.
Catches in tons, raised to match nominal values, now consider the geographic area of the nominal data for improved accuracy. (as v5)
Captures in "Number of fish" are converted to weight based on nominal data. (as v5)
Number of fish without corresponding data in nominal are not removed as they were before, creating a huge difference for this measurement_unit between the two datasets. (as v5)
Strata for which catches in tons are raised to match nominal data have had their numbers removed. (as v5)
Raising only applies to complete years to avoid overrepresenting specific months, particularly in the early years of georeferenced reporting. (as v5)
Strata where georeferenced data exceed nominal data have not been adjusted downward, as it is unclear if these discrepancies arise from missing nominal data or different aggregation methods in both datasets. (as v5)
The data is not aggregated to 5-degree squares and thus remains unharmonized spatially. Aggregation can be performed using CWP codes for geographic identifiers. For example, an R function is available: source("https://raw.githubusercontent.com/firms-gta/geoflow-tunaatlas/master/sardara_functions/transform_cwp_code_from_1deg_to_5deg.R") (as v5)

Level 0 dataset has been modified creating differences in this new version notably :

The species retained are different; only 32 major species are kept.
Mappings have been somewhat modified based on new standards implemented by FIRMS.
New rules have been applied for overlapping areas.
Data is only displayed in 1 degrees square area and 5 degrees square areas.

Level 1 dataset has been introduced, this dataset convert number to tons for 5 major species which are the only ones reatained : Yellowfin tuna, Skipjack tuna, Bigeye tuna, Albacore, Southern bluefin tuna, Swordfish :

The conversion factors used for IOTC data is a dataset provided by IOTC data managers.
The conversion factors used for the other tRFMOs is an historical dataset created by Alain fonteneau working with IRD.

Recommendations:

In some strata, nominal data appears higher than georeferenced data, as observed during level 2 processing. These discrepancies may result from differences in aggregation methods. Further analysis with data providers should examine these differences in detail to refine treatments accordingly.

For level 0 effort :

In some datasets, those from ICCAT and the purse seine (PS) data from WCPFC, same effort data has been reported multiple times by using different units which have been kept as is, since no official mapping allows conversion between these units. As a result, users have be remind that some ICCAT and WCPFC effort data are deliberately duplicated :

in the case of ICCAT data, lines with identical strata but different effort units are duplicates reporting the same fishing activity with different measurement units. It is indeed not possible to infer strict equivalence between units, as some contain information about others (e.g., Hours.FAD and Hours.FSC may inform Hours.STD).
in the case of WCPFC data, effort records were also kept in all originally reported units. Here, duplicates do not necessarily share the same “fishing_mode”, as SETS for purse seiners are reported with an explicit association to fishing_mode, while DAYS are not. This distinction allows SETS records to be separated by fishing mode, whereas DAYS records remain aggregated. Some limited harmonization—particularly between units such as NET-days and Nets—has not been implemented in the current version of the dataset, but may be considered in future releases if a consistent relationship can be established.

Methods

Lower levels of processing have been officially endorsed by FIRMS and are also published on Zenodo : see FIRMS Global Tuna Atlas datasets. Currently, FIRMS datasets only deal with catches and Level 0 data (a global dataset which remains as close as possible from datasets published on tuna RFMOs Website), including a lower spatio-temporal resolution dataset which gives the best estimates of total catches (nominal catches, per year and per ocean).

Data structure

All Global Tuna Atlas datasets comply with a common data format in line with CWP Reference Harmonization standard (https://www.fao.org/3/cc6734en/cc6734en.pdf) which is described in a json file (https://github.com/fdiwg/fdi-formats/blob/main/cwp_rh_generic_gta_taskI.json).

Global Catch dataset (IRD level 2)

IRD Level 2 denotes the series of processing steps applied by the French National Research Institute for Sustainable Development (IRD) to generate this dataset from the primary RFMO catch-and-effort data. Although some steps mirror those used in the FIRMS Level 0 product (DOI: https://doi.org/10.5281/zenodo.5745958), the entire workflow was rerun to integrate early adjustments to IATTC shark and billfish data prior to final aggregation.

This dataset compiles monthly global catch data for tuna, tuna-like species and sharks from 1950 through 2023. Catches are stratified according to the latest CWP standards update :

- month

- species

- gear_type (reporting fishing_gear)

- fishing_fleet (reporting country)

- fishing_mode (type of school used)

- geographic_identifier (1° or 5° grid cell)

- measurement_unit i.e. unit of catch (weight or number)

- measurement (catch)

- measurement_type (landings or retained catches)

- measurement_processing_level (original samples or processed data)

- a `label` column has been added for each field (e.g. `fishing_mode`, `species`, `gear_type`, etc.) to provide clear descriptive metadata

This work aims not only to provide a biomass dataset for scientific purposes but also to identify and address data inconsistencies, improving the overall process. All code, materials, and packages used are available on the GitHub repository, firms-gta/geoflow-tunaatlas, along with detailed documentation on the impact of each specific treatment.

Warning: This dataset is designed to enhance the understanding of fish counts at level 0, and the amount of georeferenced data. It is not suitable for accurately georeferencing data by country or fishing fleet and should not be used for studies on fishing zone legality or quota management. While it offers a georeferenced footprint of captures to reflect reported biomass more closely, significant uncertainty remains regarding the precise locations of the catches.

Global level 2 processing includes the conversion and raising of georeferenced catch data to match nominal dataset values.

Global Effort dataset (IRD Level 0)

We compiled a comprehensive dataset of geo-referenced fishing effort observations from global tuna fisheries, covering the period from 1950 to 2024. These data are collected from the public domain datasets released by the five tuna Regional Fisheries Management Organizations (t-RFMOs): CCSBT, IATTC, ICCAT, IOTC, and WCPFC. As with the catch dataset, the effort data were processed by using the same data generation workflow as the one used for FIRMS-GTA with a different parametrization complying with the standardized data structure promoted by the Coordinating Working Party (CWP) standards for (tuna) fisheries statistics.

Contrariwise to catches, effort values are reported using a significant number of measurement units (23). Only a few mapping between similar tRFMOs units has been managed based on fdiwg codelists (see GitHub repository: https://github.com/fdiwg/fdi-mappings). Each remaining unit reflects different operational aspects depending on the fishing gear, fleet behavior, and the reporting RFMO. The Level 0 global dataset includes all reported units without conversion or aggregation, to preserve the original semantic richness and reflect the heterogeneity in reporting practices.

This IRD Level 0 global effort dataset thus, preserves all original effort records from t-RFMOs and complies with a unified data structure while maintaining the granularity and diversity of reporting. This level of processing is not a standardized or simplified effort dataset. No upper level of processing is currently made available by IRD. Any further aggregation or transformation of effort data should be conducted by the end-user, based on specific scientific goals and with careful consideration of the semantics behind each unit.

Both datasets are enriched with "gear_type_label", "fishing_fleet label", for catch, "species_group" using the FDIWG standards and for efforts "measurement_unit_labels".

Global Catch dataset (IRD Level 1)

We compiled a comprehensive dataset of geo-referenced fishing effort observations from global tuna fisheries, covering the period from 1950 to 2024. These data are collected from the public domain datasets released by the five tuna Regional Fisheries Management Organizations (t-RFMOs): CCSBT, IATTC, ICCAT, IOTC, and WCPFC. As with the catch dataset, the effort data were processed by using the same data generation workflow as the one used for FIRMS-GTA with a different parametrization complying with the standardized data structure promoted by the Coordinating Working Party (CWP) standards for (tuna) fisheries statistics.

Contrariwise to catches, effort values are reported using a significant number of measurement units (23). Only a few mapping between similar tRFMOs units has been managed based on fdiwg codelists (see GitHub repository: https://github.com/fdiwg/fdi-mappings). Each remaining unit reflects different operational aspects depending on the fishing gear, fleet behavior, and the reporting RFMO. The Level 0 global dataset includes all reported units without conversion or aggregation, to preserve the original semantic richness and reflect the heterogeneity in reporting practices.

This IRD Level 0 global effort dataset thus, preserves all original effort records from t-RFMOs and complies with a unified data structure while maintaining the granularity and diversity of reporting. This level of processing is not a standardized or simplified effort dataset. No upper level of processing is currently made available by IRD. Any further aggregation or transformation of effort data should be conducted by the end-user, based on specific scientific goals and with careful consideration of the semantics behind each unit.

All the datasets are enriched with "gear_type_label", "fishing_fleet label", for catch, "species_group" using the FDIWG standards and for efforts "measurement_unit_labels".

Appendix work:

To reproduce the data and the workflow we provide a .zip with all the initial data used as well as labeling and the mapping to nominal geometries (see all_rawdata.zip)

The github repository (DOI:10.5281/zenodo.14039665 Allowing to reproduce this dataset)
A Shiny app has been created to easily visualize catch data in CWP format: ghcr.io/firms-gta/tunaatlas_pie_map_shiny_cwp_database:latest. The docker image is based on this DOI dataset and allows to explore it.
bastienird/CWP.dataset : an R package that allows to perform easy manipulation on data following the CWP standards and to create plots and structured reports.

If you are interested in creating a customized version of this Global Tuna Atlas with specific filters or adjustments based on particular issues, please feel free to reach out to us.

Files

Report_level2.pdf

Files (2.3 GB)

Name	Size	Download all
all_raw_data_GTA.tar.gz md5:77216895121fd61c00d67b899740b522	183.1 MB	Download
global_catch_ird_level1_1950_2024_public_harmonized.csv md5:11c6b2b6ecd19871426589a489827efc	648.1 MB	Preview Download
global_catch_ird_level1_1950_2024_public_harmonized.rds md5:bcb36d39d973549cc455daa2ae145f12	32.8 MB	Download
global_catch_ird_level2_1950_2024_public_harmonized.csv md5:d4b6500be898a58f86bd77adb47f3933	948.6 MB	Preview Download
global_catch_ird_level2_1950_2024_public_harmonized.rds md5:0c2b1ce1444beaf50ba73d7d5a6ab1f4	51.3 MB	Download
global_effort_ird_1950_2024_public_harmonized.csv md5:dbf99a1589f221b3cf9092d9df8df9a9	331.1 MB	Preview Download
global_effort_ird_1950_2024_public_harmonized.rds md5:d5adbd16cb7ae4617bc72b2be7717a34	17.4 MB	Download
Report_level0.pdf md5:5484221f9112e84445f9f189730f7ae6	10.4 MB	Preview Download
Report_level1.pdf md5:67a771c7550814dca89dd69c32b928d8	8.0 MB	Preview Download
Report_level2.pdf md5:41fcfcb94103d8f79ccb47f71c305b44	10.3 MB	Preview Download
Report_level2_Major_tunas.pdf md5:ee229cb5221f23d5c37a4fd4514ca0e4	10.3 MB	Preview Download

Additional details

Is described by: Computational notebook: 10.5281/zenodo.14025108 (DOI)
Requires: Dataset: 10.5281/zenodo.11410529 (DOI); Dataset: 10.5281/zenodo.11459367 (DOI); Workflow: 10.5281/zenodo.14039665 (DOI)

European Commission
Blue-Cloud 2026 - A federated European FAIR and Open Research Ecosystem for oceans, seas, coastal and inland waters 101094227
European Commission
Blue Cloud - Blue-Cloud: Piloting innovative services for Marine Research & the Blue Economy 862409

Repository URL: https://github.com/firms-gta/geoflow-tunaatlas
Programming language: R
Development Status: Active

	All versions	This version
Views	2,382	47
Downloads	3,764	183
Data volume	2.9 TB	38.2 GB

Contributors

Data collector (5):

Report_level2.pdf

Files (2.3 GB)

Related works

Funding

Software

Global monthly tuna, tuna-like and shark catch (levels 1-2) and fishing effort (level 0) datasets (1950-2024) at 1° and 5° spatial resolution

Authors/Creators

Contributors

Data collector (5):

Description

Notes

Methods

Files

Report_level2.pdf

Files (2.3 GB)

Additional details

Related works

Funding

Software