Published February 4, 2024 | Version v1
Model Open

GH29BERT

  • 1. ROR icon Beijing Jiaotong University
  • 2. ROR icon Quadram Institute

Description

Model Introduction

This repository contains the model parameters for GH29BERT and the task-training model parameters for relating protein pre-training models. GH29BERT is a protein functional cluster prediction model designed for GH29 family sequences. It is trained based on a semi-supervised deep learning method with:

  • a. 34,258 unlabeled and non-redundant GH29 sequences extracted from CAZy and Interpro databases and,
  • b. 2,796 labelled sequences with 45 cluster classes based on a thorough SSN analysis.

Refer to the detailed code and data at https://github.com/ke-xing/GH29BERT.git

Files

model.zip

Files (104.2 MB)

Name Size Download all
md5:340fa1c2805e1980974bfabe00030ea5
104.2 MB Preview Download