Journal article Open Access

Machine Learnable Fold Space Representation based on Residue Cluster Classes

Corral-Corral, Ricardo; Del Rio, Gabriel; Chavez, Edgar

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Corral-Corral, Ricardo</dc:creator>
  <dc:creator>Del Rio, Gabriel</dc:creator>
  <dc:creator>Chavez, Edgar</dc:creator>


Protein fold space is a conceptual framework where all possible protein folds exist and ideas about protein structure, function and evolution may be analyzed. Classification of protein folds in this space is commonly achieved by using similarity indexes and/or machine learning approaches, each with different limitations.


We propose a method for constructing a compact vector space model of protein fold space by representing each protein structure by its residues local contacts. We developed an efficient method to statistically test for the separability of points in a space and showed that our protein fold space representation is learnable by any machine-learning algorithm.


An API is freely available at</dc:description>
  <dc:source>Computational Biology and Chemistry 59 1-7 (2015)</dc:source>
  <dc:subject>Computational Biology</dc:subject>
  <dc:subject>Machine Learning</dc:subject>
  <dc:subject>Protein Structure</dc:subject>
  <dc:subject>Protein Fold Space</dc:subject>
  <dc:subject>Sperner Family</dc:subject>
  <dc:title>Machine Learnable Fold Space Representation based on Residue Cluster Classes</dc:title>
Views 319
Downloads 184
Data volume 250.7 MB
Unique views 316
Unique downloads 181


Cite as