Published April 9, 2020
| Version 3
Dataset
Open
COCONUT: the COlleCtion of Open NatUral producTs.
Description
COCONUT is a COlleCtion of Open NatUral producTs.
To assemble COCONUT, data from 50 open access collections and databases of natural products was retrieved and curated.
This archive contains two files:
- The MongoDB dump, the most complete version of the dataset, with extensive molecular annotations
- The COCONUT4MetFrag file, used for MetFrag. The last version of COCONUT4MetFrag is in the file "COCONUT4MetFrag_april.csv"
To restore the dataset in MongoDB:
unzip COCONUTapril.zip
cd COCONUTapril /COCONUT/
mongorestore --db=COCONUT --noIndexRestore .
It is generally useful to avoid restoring indexes, as they can interfere with the local installation. Here are the commands to rebuild indexes:
mongo
use COCONUT
db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})
db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})
db.sourceNaturalProduct.createIndex( {source:1})
db.sourceNaturalProduct.createIndex( {simpleInchi:"hashed"})
db.sourceNaturalProduct.createIndex( {simpleInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalInchiKey:1})
db.sourceNaturalProduct.createIndex( {originalSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {absoluteSmiles:"hashed"})
db.sourceNaturalProduct.createIndex( {idInSource:1})
db.uniqueNaturalProduct.createIndex( {inchi:"hashed"})
db.uniqueNaturalProduct.createIndex( {inchikey:1})
db.uniqueNaturalProduct.createIndex( {clean_smiles: "hashed"})
db.uniqueNaturalProduct.createIndex( {molecular_formula:1})
db.uniqueNaturalProduct.createIndex( {name:1})
db.uniqueNaturalProduct.createIndex( {coconut_id:1})
db.uniqueNaturalProduct.createIndex( {fragmentsWithSugar:"hashed"})
db.uniqueNaturalProduct.createIndex( {fragments:"hashed"})
db.fragment.createIndex({signature:1})
db.fragment.createIndex({signature:1, withsugar:-1})
This version of COCONUT is beta and will be curated further, but can already be used as it is.
Files
COCONUT4MetFrag.csv
Files
(5.3 GB)
Name | Size | Download all |
---|---|---|
md5:48d42ba0e45c54d3b334df0230fe41c6
|
35.2 MB | Download |
md5:d7736795d57e0404c20c9a3ca2b400e8
|
120.2 MB | Preview Download |
md5:c3925f2f8da49047101967438816c9db
|
123.7 MB | Preview Download |
md5:7a1f5a6a46c59b9babc1073b00471e38
|
119.8 MB | Preview Download |
md5:4615507bf83aec1b7124dab4675a8986
|
3.6 GB | Preview Download |
md5:5490171dfddf98f7fefb1d8c49a66ce2
|
1.2 GB | Preview Download |