There is a newer version of the record available.

Published November 19, 2019 | Version 2
Dataset Open

COCONUT: the COlleCtion of Open NatUral producTs.

  • 1. Friedrich-Schiller University Jena

Description

COCONUT is a COlleCtion of Open NatUral producTs.

To assemble COCONUT, data from 50 open access collections and databases of natural products was retrieved and curated.

This archive contains two files:

  • The MongoDB dump, the most complete version of the dataset, with extensive molecular annotations
  • The COCONUT4MetFrag file, used for MetFrag

To restore the dataset in MongoDB:

unzip COCONUTv2.zip
cd COCONUTv1/COCONUT/
mongorestore --db=COCONUT --noIndexRestore .

It is generally useful to avoid restoring indexes, as they can interfere with the local installation. Here are the commands to rebuild indexes:

mongo
use COCONUT

db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:1})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.uniqueNaturalProduct.createIndex( {inchi:1})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {smiles:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles:1})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})


This version of COCONUT is beta and will be curated further, but can already be used as it is.

Files

COCONUT4MetFrag.csv

Files (3.2 GB)

Name Size Download all
md5:d7736795d57e0404c20c9a3ca2b400e8
120.2 MB Preview Download
md5:7a1f5a6a46c59b9babc1073b00471e38
119.8 MB Preview Download
md5:5490171dfddf98f7fefb1d8c49a66ce2
1.2 GB Preview Download
md5:ad97ebb6ad01907575ddb328c944a827
1.8 GB Preview Download