Published December 5, 2025 | Version MicroML
Software Open

DaniloCVieira/SantosBasin-Microbiome-ML

Authors/Creators

  • 1. Universidade Federal de São Paulo

Description

# Microbial Composition and Environmental Properties in the Santos Basin: Data and Code Repository

This repository contains the resources and data associated with the manuscript **"A Machine Learning Approach Elucidates Spatial Patterns of Environmental Properties Driving Microbial Composition Over Santos Basin, South Atlantic."**

## Overview

The study investigates the structure and diversity of microbial communities in the Santos Basin (SB), Brazil's largest marine sedimentary basin and an ecologically and industrially significant region. By integrating amplicon sequencing data and flow cytometry quantitative cell counts with environmental parameters, this research provides insights into microbial diversity and the environmental gradients shaping these communities.

## Repository Contents

```text
├── family_data.csv               # Microbial family abundance data
├── Cito_data.csv                 # Cytometry data for microbial cell counts
├── Coords_Depth_data.csv         # Coordinates and depth information
├── environ_data.csv              # Environmental variable dataset
├── depth_layers.csv              # Depth layer metadata
├── args_map3D.rds                # Arguments for 3D map visualizations
├── 01-Script_ML_supplementary.R  # Main machine learning analysis script
├── 02-Script_3D_maps_supplementary.R  # Script for generating 3D visualizations
├── Auxiliar_functions.R          # Auxiliary functions for data analysis

Files

DaniloCVieira/SantosBasin-Microbiome-ML-MicroML.zip

Files (4.9 MB)

Additional details