COG_Functional_Category_Abundances_and_GTDB_Taxonomy
Description
Dataset S1:
Individual rows correspond to individual genomes (excluding the top row which are column headers). Columns 1 through 25 correspond to raw abundances for each COG functional category. Column 26 corresponds to the total number of COGs in a genome. Columns 27, 28, 29, 30, 31, 32, and 33, correspond to the GTDB domain, phylum, class, order, family, genus, and species classification, respectively. Column 34 corresponds to the culture-status. Column 35 is the genomes size in base pairs. Column 36 corresponds to the accession number for each genome. Accessions starting with GCF and GCA are from Refseq and Genbank, respectively. Accessions that are numbers only correspond to IMG/G. Column 37 corresponds to the total number of open reading frames in the genome.
Files
DatasetS1.csv
Files
(29.0 MB)
Name | Size | Download all |
---|---|---|
md5:6488d04960b3a56ac94c8c0487d06918
|
29.0 MB | Preview Download |
Additional details
Related works
- Is supplement to
- 10.1101/520973 (DOI)
- 10.1128/mSphere.00446-19 (DOI)