8,163 High-Quality metagenome-assembled genomes from the global soil microbiome
Description
Metagenome-assembled genome (MAG) sequences from the global soil microbiome were obtained from the Soil Microbial Dark Matter MAG and Microflora Danica long-read MAG catalogues, and filtered to retain only high-quality (HQ) MAGs based on MIMAG standards. That is, genomes that had >90% CheckM2 completeness and <5% CheckM2 contamination, and had the 23S, 16S, and 5S rRNA genes, and at least 18 different tRNA genes. MAGs were dereplicated at approximately species-level, using clustering thresholds of 95% average nucleotide identity (ANI) and 50% aligned fraction (AF).
MAG sequences are within the compressed directory, 'MAGs.tar.gz'.
Pooled predicted protein sequences are in 'HQ_MAGs.prots.faa.gz'.
Files
Files
(16.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:9168575681a67749c900c9b16a8fbb68
|
5.8 GB | Download |
|
md5:1f7f3c2eec6a4d710f03316a71db05bf
|
10.4 GB | Download |