Published May 10, 2021
| Version v1
Poster
Open
SEQUENCE BASED PREDICTION OF PHYTOPHTHORA- HOST INTERACTION USING MACHINE LEARNING METHODS
Authors/Creators
- 1. ICAR-Indian Institute of Spices Research
- 2. ICAR- Central Tuber Crops Research Institute
Description
Phytophthora is a genus of oomycetes that cause extensive crop damage and economic loss. Specific proteins from the host and pathogen facilitate their interaction and mediate a multifaceted mechanism in infection. In this study, we utilized published protein protein interactions between Phytophthora and its hosts for developing a model for prediction. We applied supervised learning algorithms- Support vector machine (SVM) and Ensemble methods to predict interactions.Different features of proteins in host and pathogen proteins like amino acid composition, dipeptide composition, pseudo amino acid composition, amphiphilic pseudo amino acid composition, C/T/D, conjoint triads, autocorrelation, sequence order coupling number, quasi-sequence order descriptors were utilized to develop the model for the binary classification, whether the proteins interact or not. The relative importance of different protein features in the training model were also evaluated. SVM with radial kernel had an accuracy of 75%. Bagging algorithm Random Forest showed an accuracy of 84.6%. A GLM ensemble of K-Nearest Neighbour (KNN), SVM (radial), rpart and random forest gave an accuracy of 70.1%. The model developed can be trained with more experimentally validated interactions to improve the accuracy. Furthermore we constructed a protein-protein interaction network of host and pathogen proteins to depict the interaction network operating during pathogenesis and evaluated the network topology. The results of the study may be taken forward for experimental validation.
Files
1620240215-GLBIO2021_Sona_Charles.pdf
Files
(552.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:42c1b858b7e047f67f8f02df6f4e7d14
|
552.7 kB | Preview Download |