Code and datafiles with PNAS publication "Defining a new nomenclature for the structures of active and inactive kinases"
Description
We are providing Perl scripts and data files required to perform clustering reported in PNAS manuscript "Defining a new nomenclature for the structures of active and inactive kinases". Following is a brief description of the uploaded files:
1. kinasefamilylist.txt - List of protein kinases analyzed in the study with their Uniprot, gene names and domain boundaries.
2. kinasepmlz.pl: Perl script which uses PSI-BLAST to identify and download the protein kinase structures from PDB. It also updates the residue numbering scheme of downloaded PDBs with Uniprot numbering scheme.
3. readsiftz.pl: Perl script to download Sifts files for all the PDBs in study to get the residue correspondence between PDB numbering and Uniprot numbering scheme for all the kinases.
4. dfg.pl: Perl script that reads data file containing kinase structure list and dihedral angles of residues (kinasedata.txt) and prints distance matrix. This distance matrix is used as an input to dbscan function in fpc package in R.
5. kinasedata.txt: Data file containing list of kinases with dihedral angles of residues used in clustering. This data file is the input for the script dfg.pl.
Files
kinasedata.txt
Files
(784.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:fa2e7838b6583763c7196bd10c0d83b6
|
5.2 kB | Download |
|
md5:d15e9eaa0dcb476e9b6b2ec44acb5263
|
715.1 kB | Preview Download |
|
md5:c4780a3a364f42573f48a0199dbaaf32
|
24.8 kB | Preview Download |
|
md5:bc01dccfc47a0d65c06acaa6318a620d
|
34.3 kB | Download |
|
md5:8e09728eafa978f99325298b86009192
|
5.1 kB | Download |