Published November 7, 2022 | Version 1
Dataset Open

Predictive modeling of moonlighting DNA-binding proteins

  • 1. School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi-110067, India
  • 2. Computational Structural Biology Section, Cancer Innovation Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA, Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Israel

Contributors

  • 1. School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi-110067, India

Description

This repository contains the codes used for the prediction of moonlighting proteins in the paper "Predictive modeling of moonlighting DNA binding proteins".

The repository is organized as the following:

1. The DNA binding protein identifiers and their features that were used to train the models for the prediction of DNA binding Moonlighting proteins.

2. Five feature sets were used to create Catboost models that make predictions. The source code for generating predictions based on all the features and predictions based on particular features is supplied. In addition, the source code for generating maximum and average ensemble predictions has been made available. A detailed explanation is given in README file.

Files

README.md

Files (4.7 MB)

Name Size Download all
md5:ce2b97036ee7bb9ebb1a4ee598730333
2.8 kB Download
md5:728fa864a3948f718280d8e28b2529ce
4.7 MB Download
md5:25ba5080cd25c5cfcbea4c3b569706e6
4.5 kB Download
md5:4fedaa7549eda1e443730bbb8d0f9770
4.5 kB Download
md5:0438d5a401f7b93c31c8bdfd69c3f8b7
4.6 kB Download
md5:e9696eaf9e57826a2443f25fae32546a
4.5 kB Download
md5:86d1c5c53ae15078f3a169fb28931b19
4.6 kB Preview Download
md5:3b6f277b6b5d0bcc67f869ff270b5f51
4.5 kB Download
md5:772566c4fd82cfeb53096c86d4f84bab
4.5 kB Download