Unified Brazilian Rainfall Dataset (UNIPLU-BR): A Standardized National Database of Point Precipitation from Major Brazilian Monitoring Networks (1885 - 2025)
Authors/Creators
-
Lemos, Filipe Carvalho
(Data curator)1
-
Freitas, Emerson da Silva
(Project member)2
-
Coelho, Victor Hugo Rabelo
(Project member)1
-
Reis, Dirceu Silveira
(Project member)3
-
Patriota, Eduardo Gonçalves
(Project member)1
-
Meira, Marcela Antunes
(Project member)4
-
Vidal-Barbosa, José Lindemberg
(Project member)1
-
Claudino, Cinthia Maria de Abreu
(Project member)1
-
Silva, Gerald Souza
(Project member)1
-
Nascimento, Daniel
(Project member)1
-
Moura Ramos Filho, Geraldo
(Project member)1
-
de Sá Santos Rabello, Ana Paula
(Project member)5
-
Alves, Luna
(Project member)6
-
Zeri, Luis Marcelo de Mattos
(Project member)5
-
Ribeiro Neto, Germano
(Project member)7
-
Bertrand, Guillaume
(Project member)8
-
Tomasella, Javier
(Project member)9
-
Melo, Davi
(Project member)1
-
Souza, Saulo
(Project member)10
-
Abdalla Araujo, Alexandre
(Project member)10
-
Rampinelli, Cássio
(Project member)10
-
Das Neves Almeida, Cristiano
(Supervisor)1
- 1. Federal University of Paraíba
- 2. Federal Institute of Paraíba
- 3. University of Brasilia
-
4.
Swansea University
- 5. National Center for Monitoring and Early Warning of Natural Disasters (CEMADEN)
- 6. Geological Survey of Brazil
-
7.
University of Bristol
- 8. Université de Franche-Comté
-
9.
National Institute for Space Research
- 10. National Water and Sanitation Agency (ANA)
Description
Unified Brazilian Rainfall Dataset (UNIPLU-BR): A Standardized National Database of Point Precipitation from Major Brazilian Monitoring Networks (1885 – 2025)
This dataset is the first unified and standardized national database of point precipitation (non-interpolated) in Brazil, consolidating raw data from five primary monitoring networks:
- CEMADEN: National Center for Monitoring and Early Warning of Natural Disasters.
- INMET: National Institute of Meteorology.
- ANA (Hidroweb): National Water and Sanitation Agency.
- Telemetria: Telemetry system for hydrological monitoring.
- ICEA: Institute of Air Space Control.
The primary contribution of this work is overcoming the high fragmentation of rainfall data in Brazil through a rigorous curation process:
- Structural and Nominal Standardization: Harmonization of column names, attributes, and storage formats, addressing historical changes in data protocols and formats within the same agency.
- Time Zone Adjustment: Standardization of timestamps based on the station's geographical location (UTC offset).
- Temporal Resolution: Records ranging from 10-minute intervals to 1-day resolution.
Data Quality Disclaimer: The processing of this dataset is strictly focused on structural standardization. No qualitative assessment, physical consistency checks, or outlier filtering were performed. The rainfall values remain as originally reported by the agencies, now organized into a unified, analysis-ready structure.
Impressive numbers:
The dataset covers the period from 1885 to 2025, highlighted by the presence of stations with historical series exceeding a century. With broad national coverage, the database consolidates approximately 2.2 billion precipitation records from over 21,000 stations. This information features varied temporal resolutions, ranging from 10-minute intervals to 24-hour totals.
The distribution of these records among the main Brazilian monitoring networks is detailed below.
|
Source |
Potential average number of years |
Quantity |
Initial and Final Year |
|
CEMADEN |
7,99 |
5.061 |
2014 - 2025 |
|
Hidroweb |
32,38 |
12.067 |
1885 - 2025 |
|
ICEA |
33,19 |
183 |
1951 - 2025 |
|
INMET daily |
45,03 |
627 |
1889 - 2025 |
|
INMET sub-daily |
15,87 |
629 |
2000 - 2025 |
|
Telemetria |
6,63 |
2.819 |
2014 - 2025 |
|
Total |
— |
21.386 |
|
The following figures illustrate the spatial distribution of the stations and the annual data availability for both daily and sub-daily networks.
Accessing the Data
The data is stored in compressed ZIP files, which function as optimized containers. Each ZIP file internally contains two files in Parquet format: table_info.parquet (station metadata) and table_data.parquet (rainfall time series).
The primary advantage of this structure is that, using Python or R, you can read the data directly from memory. This eliminates the need to manually decompress files to the disk, saving storage space and accelerating processing within automation workflows.
Within these files, the gauge_code (station code) serves as the primary key that links the registration information to the measurement data.
Metadata
Rainfall gauge information (table_info)
This dataframe functions as the 'identity document' for the rain gauge stations. It contains the static characteristics of each monitoring point:
|
Column |
Description |
|
gauge_code |
Unique station identifier (ID). This serves as the link to table_data. |
|
city / state |
The administrative location of the station (e.g. João Pessoa, PB). |
|
lat / long |
Geographic coordinates in decimal degrees. |
|
elevation |
The station's altitude above mean sea level (meters). |
|
time_step |
Estimated temporal resolution of the data (1440 minutes = 24 hours/daily). |
|
network |
Data network source (e.g. Hidroweb). |
|
responsible |
The agency responsible for operations (ANA or SGB-CPRM). |
|
utc |
Local time zone relative to the Greenwich Meridian (-3 for Brasília Time). |
Time Series (table_data)
Whilst table_info defines the location and identity of the station, table_data records the rainfall measurements. It contains the following columns:
|
Column |
Description |
|
gauge_code |
Unique station identifier (ID). This serves as the link to table_info. |
|
datetime |
The date and time of the reading. |
|
rain_mm |
The volume of precipitation recorded during that interval, measured in millimeters (mm). |
Script Examples
Examples of scripts for accessing and filtering data, as well as generating plots, can be found at the following link: GitHub - LARHENA/UNIPLU-BR: Unified Brazilian Rainfall Dataset (UNIPLU-BR): A Standardized National Database of Point Precipitation from Major Brazilian Monitoring Networks (1885 – 2025) · GitHub
How to Cite
Das Neves Almeida, C., Francis Bertrand, G., Carvalho Lemos, F., da Silva Freitas, E., Lins Silva, A., Vidal Barbosa, J. L., ... & Coelho, V. H. R. (2025). The design of the Brazilian Sub-Daily Rainfall dataset (BR-SDR): two decades of high-time-resolution data in Brazil. Hydrological Sciences Journal, 70(11), 1850-1862. https://doi.org/10.1080/02626667.2025.2506193
Main papers published by the group
Das Neves Almeida, C., Francis Bertrand, G., Carvalho Lemos, F., da Silva Freitas, E., Lins Silva, A., Vidal Barbosa, J. L., ... & Coelho, V. H. R. (2025). The design of the Brazilian Sub-Daily Rainfall dataset (BR-SDR): two decades of high-time-resolution data in Brazil. Hydrological Sciences Journal, 70(11), 1850-1862.
Vidal-Barbosa, J. L., Lemos, F. C., da Silva Freitas, E., Coelho, V. H. R., da Silva, G. N. S., Patriota, E. G., ... & das Neves Almeida, C. (2025). BRain-D: A quality-controlled methodology for constructing the BRazilian Daily rainfall gridded data. Atmospheric Research, 108552.
Freitas, E. D. S., Coelho, V. H. R., Bertrand, G. F., Lemos, F. C., & Almeida, C. D. N. (2024). IMERG BraMaL: An improved gridded monthly rainfall product for Brazil based on satellite‐based IMERG estimates and machine learning techniques. International Journal of Climatology, 44(11), 3976-3997.
Lemos, F. C., Coelho, V. H. R., Freitas, E. D. S., Tomasella, J., Bertrand, G. F., Meira, M. A., ... & Almeida, C. D. N. (2023). Spatiotemporal distribution of precipitation and its characteristics under tropical atmospheric systems of Brazil: Insights from a large sub‐hourly database. Hydrological Processes, 37(11), e15017.
Ramos Filho, G. M., Coelho, V. H. R., da Silva Freitas, E., Xuan, Y., Brocca, L., & das Neves Almeida, C. (2022). Regional-scale evaluation of 14 satellite-based precipitation products in characterising extreme events and delineating rainfall thresholds for flood hazards. Atmospheric Research, 276, 106259.
Meira, M. A., Freitas, E. S., Coelho, V. H. R., Tomasella, J., Fowler, H. J., Ramos Filho, G. M., ... & Almeida, C. D. N. (2022). Quality control procedures for sub-hourly rainfall data: An investigation in different spatio-temporal scales in Brazil. Journal of Hydrology, 613, 128358.
Freitas, E. D. S., Coelho, V. H. R., Xuan, Y., de CD Melo, D., Gadelha, A. N., Santos, E. A., ... & Almeida, C. D. N. (2020). The performance of the IMERG satellite-based product in identifying sub-daily rainfall events and their properties. Journal of Hydrology, 589, 125128.