Published September 16, 2022 | Version v4
Dataset Open

The LOTUS Initiative for Open Natural Products Research: wikidata query results

  • 1. University of Geneva
  • 2. University of Illinois at Chicago
  • 3. University of Fribourg
  • 4. https://www.wikidata.org/

Description

Wikidata query results returned by the downloadLotus module of the https://github.com/lotusnprod/lotus-wikidata-interact program.

See details of the module here https://github.com/lotusnprod/lotus-wikidata-interact/blob/main/downloadLotus/README.md

This dataset is constituted of 4 tables.

  1. compounds.tsv - chemical structures metadata (wikidataId, canonicalSmiles, isomericSmiles, inchi, inchiKey)
  2. references.tsv - bibliographical references metadata (wikidataId, pipe separated DOIs, titles)
  3. taxa.tsv - biological organisms metadata (wikidataId, pipe separated names, taxa rank)
  4. compound_reference_taxon.tsv - the documented structure-organism pairs

This dataset includes not only the outputs of the LOTUS processing pipeline (available here https://doi.org/10.5281/zenodo.5665295 ) but also any of wikidata chemical compounds having the found in taxon property (https://www.wikidata.org/wiki/Property:P703) and their associated organisms and documenting references.

 


 

Files

Files (185.5 MB)

Name Size Download all
md5:514fa04a6213a14bee6e303a5f901c3c
81.2 MB Download
md5:4288bf4db4e1cec05a368c74d11655f2
87.9 MB Download
md5:55bbaf039263366e86f4f5228ee56bf2
13.9 MB Download
md5:4071b4da38fd5547a8627f1b516c8d97
2.5 MB Download