10.5281/zenodo.3445476
https://zenodo.org/records/3445476
oai:zenodo.org:3445476
Daga, Pankaj R.
Pankaj R.
Daga
0000-0002-2508-0903
Simulation Plus Inc.
Data Curation: The Forgotten Practice in the Era of AI
Zenodo
2019
Open Force Field Initiative
OFFwebinar
Drug discovery
Data mining
Data curation
AI
ADMET
QSAR
Molecular descriptors
2019-09-18
eng
Presentation
https://youtu.be/pjlFHGuVlLo
10.5281/zenodo.3445475
https://zenodo.org/communities/openforcefield
Creative Commons Attribution 4.0 International
Pankaj R. Daga from Simulation-Plus visited the Mobley group at UC Irvine on Sep 13, 2019 and gave a talk as a part of OFF seminar series about all the hazards that can appear in trying to automate mining of chemical and chemistry-related databases.
Abstract: Availability of large databases of chemical structures along with experimental data provides a great opportunity to build predictive and robust QSAR/QSPR models for application in various fields. The most common concern while using these databases is the quality of the chemical structures and associated biological data. It is very important to deal with correct chemical structure since incorrect structure will lead to the errors in calculation of molecular descriptors. Incorrect biological data will ultimately lead to meaningless results. This seminar will discuss experiences while curating these bioactivity databases with focus towards ADMET properties in drug discovery. Various sources of these errors and measures to find and correct these errors will be discussed.