Dataset Open Access
This is a gzipped three-column TSV file that has 138,749,325 prefixes, identifiers, and names for lots of biomedical entities, drawing from the OBO Foundry, ontologies in the Ontology Lookup Service, and many other nomenclature consortia that just haven't made it to the prime-time of standardized goodness. Ultimately, this dataset helps answer the question: what's my name?
It's really a lot of work to get this stuff, so I tried to make it easy. It was generated with the following code in the shell:
pip install pyobo obo database names
More information on how and why this resource was made is available at https://cthoyt.com/2020/04/18/ooh-na-na.html.