There is a newer version of the record available.

Published December 20, 2021 | Version 1.1
Dataset Open

The LOTUS Initiative for Open Natural Products Research: wikidata query results

  • 1. University of Geneva
  • 2. University of Illinois at Chicago
  • 3. University of Fribourg
  • 4.


Wikidata query results returned by the downloadLotus module of the program.

See details of the module here

This dataset is constituted of 4 tables.

  1. compounds.tsv - chemical structures metadata (wikidataId, canonicalSmiles, isomericSmiles, inchi, inchiKey)
  2. references.tsv - bibliographical references metadata (wikidataId, pipe separated DOIs, titles)
  3. taxa.tsv - biological organisms metadata (wikidataId, pipe separated names, taxa rank)
  4. compound_reference_taxon.tsv - the documented structure-organism pairs

This dataset includes not only the outputs of the LOTUS processing pipeline (available here ) but also any of wikidata chemical compounds having the found in taxon property ( and their associated organisms and documenting references.




Files (245.2 MB)

Name Size Download all
117.1 MB Download
111.9 MB Download
13.5 MB Download
2.7 MB Download