Dataset Open Access

Ooh Na Na

Charles Tapley Hoyt

This is a gzipped three-column TSV file that has 138,749,325 prefixes, identifiers, and names for lots of biomedical entities, drawing from the OBO Foundry, ontologies in the Ontology Lookup Service, and many other nomenclature consortia that just haven't made it to the prime-time of standardized goodness. Ultimately, this dataset helps answer the question: what's my name?

It's really a lot of work to get this stuff, so I tried to make it easy. It was generated with the following code in the shell:

pip install pyobo
obo database names

More information on how and why this resource was made is available at https://cthoyt.com/2020/04/18/ooh-na-na.html.

The 1.2.0 version includes the addition of PubChem, ChEBML, and NPASS
Files (1.6 GB)
Name Size
names.tsv.gz
md5:ac802e2e3270cbfaf9029606fbfc1a98
1.6 GB Download
names_sample.tsv
md5:91b104bb39b3c6af23a1f4d29d7dab1e
289 Bytes Download
names_summary.tsv
md5:e4c0304b9252c23bc1259599c06219fb
1.3 kB Download
168
51
views
downloads
All versions This version
Views 16835
Downloads 512
Data volume 27.0 GB3.2 GB
Unique views 13228
Unique downloads 372

Share

Cite as