There is a newer version of the record available.

Published April 30, 2020 | Version 4
Dataset Open

COCONUT: the COlleCtion of Open NatUral producTs.

  • 1. Friedrich-Schiller University Jena

Description

COCONUT is a COlleCtion of Open NatUral producTs.

 

The database is now available at coconut.naturalproducts.net, where the latest updates will appear before being available here.

To assemble COCONUT, data from 50 open access collections and databases of natural products was retrieved and curated.

This archive contains two files:

  • The MongoDB dump, the most complete version of the dataset, with extensive molecular annotations
  • The COCONUT4MetFrag file, used for MetFrag. The last version of COCONUT4MetFrag is in the file "COCONUT4MetFrag_april.csv"
  • The COCONUT.sdf file containing all unique NP molecules with selected metadata

To restore the dataset in MongoDB:

unzip COCONUTapril.zip
cd COCONUTapril /COCONUT/
mongorestore --db=COCONUT --noIndexRestore .

It is generally useful to avoid restoring indexes, as they can interfere with the local installation. Here are the commands to rebuild indexes:

mongo
use COCONUT


db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})

db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})
db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})
db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})


This version of COCONUT is beta and will be curated further, but can already be used as it is.

Files

COCONUT.sdf.zip

Files (5.4 GB)

Name Size Download all
md5:e9eaf541fd6b4fd22a05ca2fe7ff14cc
217.6 MB Preview Download
md5:d7736795d57e0404c20c9a3ca2b400e8
120.2 MB Preview Download
md5:c3925f2f8da49047101967438816c9db
123.7 MB Preview Download
md5:7a1f5a6a46c59b9babc1073b00471e38
119.8 MB Preview Download
md5:4615507bf83aec1b7124dab4675a8986
3.6 GB Preview Download
md5:5490171dfddf98f7fefb1d8c49a66ce2
1.2 GB Preview Download

Additional details