Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published July 3, 2022 | Version v2
Software Open

Calculation of Exact Shapley Values for Support Vector Machines with Tanimoto Kernel Enables Model Interpretation

  • 1. Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, D-53115 Bonn, Germany.

Description

Code and analysis workflow for calculating Shapley values (SVs) for support vector machines (SVM) with Tanimoto kernel.

The Shapley Value-Expressed Tanimoto similarity (SVETA) approach enables the efficient calculation of SVs for SVM predictions. Code provided in the folder 'sveta' contains functions for the computation of SVETA values and SVM algorithm to explain individual predictions.

Jupyter notebooks containing workflows for assessing SVETA values can be found in the folder 'analysis'.

An exemplary data set consisting of compounds active against the adenosine receptor A3 (UniProt ID: P0DMS8) and random compounds is provided in the folder 'data'.

Files

SVETA_code_and_analysis.zip

Files (20.3 MB)

Name Size Download all
md5:a470df05e8bd0ef00767e73e8cd62985
20.3 MB Preview Download