Published June 14, 2024
| Version v2
Dataset
Open
NBC++ Database
Creators
Description
The NBC++ Metagenome Database is a collection of metagenomic data sampled from the RefSeq database using Woltka. The database includes three distinct profiles:
- Basic Profile: Comprising almost one genome per genus, resulting in a compilation of 4,634 genomes as of July 24, 2023.
- Standard Profile: Encompassing all NCBI-defined reference and representative genomes, totaling 18,237 genomes collected on July 26, 2023.
- Extended Profile: Featuring one genome per species with a Latinate name and higher ranks, accumulating 319,554 genomes by July 26, 2023.
The assembly summary information about genomes in the database are in:
- database_assembly_summaries.zip