There is a newer version of the record available.

Published April 9, 2020 | Version 3
Dataset Open

COCONUT: the COlleCtion of Open NatUral producTs.

  • 1. Friedrich-Schiller University Jena

Description

COCONUT is a COlleCtion of Open NatUral producTs.

To assemble COCONUT, data from 50 open access collections and databases of natural products was retrieved and curated.

This archive contains two files:

  • The MongoDB dump, the most complete version of the dataset, with extensive molecular annotations
  • The COCONUT4MetFrag file, used for MetFrag. The last version of COCONUT4MetFrag is in the file "COCONUT4MetFrag_april.csv"

To restore the dataset in MongoDB:

unzip COCONUTapril.zip
cd COCONUTapril /COCONUT/
mongorestore --db=COCONUT --noIndexRestore .

It is generally useful to avoid restoring indexes, as they can interfere with the local installation. Here are the commands to rebuild indexes:

mongo
use COCONUT


db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})

db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})
db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})
db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})


This version of COCONUT is beta and will be curated further, but can already be used as it is.

Files

COCONUT4MetFrag.csv

Files (5.3 GB)

Name Size Download all
md5:48d42ba0e45c54d3b334df0230fe41c6
35.2 MB Download
md5:d7736795d57e0404c20c9a3ca2b400e8
120.2 MB Preview Download
md5:c3925f2f8da49047101967438816c9db
123.7 MB Preview Download
md5:7a1f5a6a46c59b9babc1073b00471e38
119.8 MB Preview Download
md5:4615507bf83aec1b7124dab4675a8986
3.6 GB Preview Download
md5:5490171dfddf98f7fefb1d8c49a66ce2
1.2 GB Preview Download

Additional details