Generated Tautomeric forms for Ames mutagenicity structure dataset
Authors/Creators
Description
The raw structures are extracted from DOI: 10.1021/ci900161g. All structures were pre-processed using ChemAxon Standardizer version 5.12.2 including extraction of SMILES linear notation from sdf files, kekulization of aromatic structures, conversion of explicit hydrogen atoms to implicit ones and removal of stereo information. All tautomeric forms for the testing structures were generated by means of Ambit-Tautomer software [https://doi.org/10.1002/minf.201200133], IA-DFS algorithm (incremental approach based on depth-first search) with tautomeric rules for 1.3 and 1.5 hydrogen shifts and removal of topologically equivalent atoms and allene atom. The generated tautomeric forms for the Ames Mutagenicity dataset with size 5 451 structures are 73 028.
Files
dataset_AMES.csv
Files
(3.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:1b8f5759359745ca76a4b9143d945c42
|
187.2 kB | Preview Download |
|
md5:cde7d5a4d7a635c25d6d494a503a7bed
|
3.6 MB | Preview Download |