Dataset Open Access

# Ooh Na Na

Charles Tapley Hoyt

This is a gzipped three-column TSV file that has 138,749,325 prefixes, identifiers, and names for lots of biomedical entities, drawing from the OBO Foundry, ontologies in the Ontology Lookup Service, and many other nomenclature consortia that just haven't made it to the prime-time of standardized goodness. Ultimately, this dataset helps answer the question: what's my name?

It's really a lot of work to get this stuff, so I tried to make it easy. It was generated with the following code in the shell:

pip install pyobo
obo database names

The 1.2.0 version includes the addition of PubChem, ChEBML, and NPASS
Files (1.6 GB)
Name Size
names.tsv.gz
md5:ac802e2e3270cbfaf9029606fbfc1a98
1.6 GB
names_sample.tsv
md5:91b104bb39b3c6af23a1f4d29d7dab1e
289 Bytes
names_summary.tsv
md5:e4c0304b9252c23bc1259599c06219fb
1.3 kB
168
51
views