Published March 27, 2025 | Version v1
Dataset Open

8,163 High-Quality metagenome-assembled genomes from the global soil microbiome

Authors/Creators

  • 1. ROR icon Macquarie University

Description

Metagenome-assembled genome (MAG) sequences from the global soil microbiome were obtained from the Soil Microbial Dark Matter MAG and Microflora Danica long-read MAG catalogues, and filtered to retain only high-quality (HQ) MAGs based on MIMAG standards. That is, genomes that had >90% CheckM2 completeness and <5% CheckM2 contamination, and had the 23S, 16S, and 5S rRNA genes, and at least 18 different tRNA genes. MAGs were dereplicated at approximately species-level, using clustering thresholds of 95% average nucleotide identity (ANI) and 50% aligned fraction (AF).

MAG sequences are within the compressed directory, 'MAGs.tar.gz'.
Pooled predicted protein sequences are in 'HQ_MAGs.prots.faa.gz'.

Files

Files (16.3 GB)

Name Size Download all
md5:9168575681a67749c900c9b16a8fbb68
5.8 GB Download
md5:1f7f3c2eec6a4d710f03316a71db05bf
10.4 GB Download