There is a newer version of the record available.

Published November 12, 2021 | Version 1.0.0
Journal article Open

Naming the unnamed: Over 45,000 Candidatus names for unnamed Archaea and Bacteria in the Genome Taxonomy Database

  • 1. University of East Anglia
  • 2. Quadram Insitute Bioscience

Description

Thousands of new bacterial and archaeal species and higher-level taxa are discovered each year through the analysis of genomes and metagenomes. The Genome Taxonomy Database (GTDB) provides hierarchical sequence-based descriptions and classifications for new and as-yet-unnamed taxa. However, bacterial nomenclature, as currently configured, cannot keep up with the need for new well-formed names. Instead, microbiologists have been forced to use hard-to-remember alphanumeric placeholder labels. Here, we exploit an approach to the generation of well-formed arbitrary Latinate names at a scale sufficient to name tens of thousands of unnamed taxa within GTDB. These newly created names represent an important resource for the microbiology community, facilitating communication between bioinformaticians, microbiologists and taxonomists, while populating the emerging landscape of microbial taxonomic and functional discovery with accessible and memorable linguistic labels.

Presented here are input and output files associated with the scripts used in this project. Scripts are available from 

Files

ar122_ar_genus_names_table.txt

Files (725.6 MB)

Name Size Download all
md5:af2947a354b1f356156a7fd19e2576a9
13.3 kB Preview Download
md5:0607728ae1f56bdb1a7cc24d238185c3
5.5 MB Download
md5:07e973180d1a6343e0d473568fb0228a
5.5 MB Download
md5:9131fcf89da6a7d2283d4166891ddb6a
160.8 kB Preview Download
md5:7065c1691b9790e4ce67482b5a8e93fa
42.4 kB Preview Download
md5:924bfdd0943aa2510f4b8f5d59205a0a
648.9 kB Download
md5:02c99f90b8b60b08b8e9b139dfc35dce
637.8 kB Download
md5:6b165a8f4ee1e46bf21c5be1f7a68ca6
649.7 kB Download
md5:4be8ee2c5feef1496556cf06b5bf8df6
5 Bytes Preview Download
md5:5c5e33f69d980a22bd6418525668e2ca
16.2 kB Preview Download
md5:4d20570705b3bfb70c05606d1bbb3b01
2.2 MB Download
md5:6c134d81868cc99d8e289f1d129cc7dd
162.8 kB Preview Download
md5:743d58acefc40c809f1f3cda3cd7241d
734.2 kB Preview Download
md5:1e94ca31cace5459b01fa87fdae499b5
351.5 MB Download
md5:92750665ac322c6e4bd17819da3a9cfa
2.5 MB Preview Download
md5:d9cee7d6eb0026fba4ed09073a7996b6
37.4 MB Download
md5:954210b04aa5d24806696e5407d69ad3
37.2 MB Download
md5:8efaa9a46a12a19c9a9c8fcaa414013b
37.4 MB Download
md5:80868c982e765a55848c659be7fbbf1e
30 Bytes Preview Download
md5:8b941bc98791341996a97ab9a81f4d4c
91.3 kB Preview Download
md5:9a3685eeb5cdee1aab8549377f5a2fe6
33.8 MB Download
md5:d19e4eb0aff3a7aac040fa185ecd4413
74.1 MB Preview Download
md5:d7406f9b57896c9ce3983ddb998b1889
80.6 MB Preview Download
md5:9899285497f052cf0af6f822b303007a
6.1 MB Preview Download
md5:580027850ded79eb229146343e03106e
7.0 MB Download
md5:192b846a96d4b67b08f3673b2851bf1b
6.9 MB Download
md5:6ae384ca64c1ded888082ad99040966f
6.9 MB Download
md5:2be3f38c493c145edd68fd318984f05e
4.9 kB Download
md5:4d7703c9a66ee1f6f94b1205a6e4117c
6.9 MB Download
md5:f6a9863f8eb4881de5c3c7ef532211e7
7.0 MB Download
md5:429933f76e1f42dda3dbfa17e9a5c41b
6.9 MB Download
md5:647f04c2be955e0d5e8f608b41456c90
8.8 kB Preview Download
md5:8c0d1e72249ebf305df14977e25a4613
112.2 kB Preview Download
md5:ca609e12050156f19a7d9a3655d49ecf
1.7 MB Preview Download
md5:5e62da5bb33b4753ba766ad32a053259
57.0 kB Preview Download
md5:c001ed1091f3864e909358be78382da6
932.1 kB Preview Download
md5:1fccafe947a0e147accc02f6c4c4dfe1
2.7 MB Preview Download
md5:7b59d1161cb6ae1cd2146e570cccf083
179 Bytes Preview Download
md5:c6dd836afe6d2fc4753a0066a33ed789
500.1 kB Preview Download
md5:7bcb5f2b1354b686144a77f8d6cf2374
9.1 kB Preview Download
md5:05c5d069b9cd6913c417755e4898688a
481.0 kB Preview Download
md5:a0d05ac8ebe16e250f01b8545a45632a
12.5 kB Preview Download
md5:6c9811826b2f42fbca7368336259e03f
150.7 kB Preview Download
md5:2c581200ad7b3c5e1011265a8299d5e4
432.0 kB Preview Download