Computational Method for Annotation of Protein Sequence According to Gene Ontology Terms
Creators
Description
Annotation of a protein sequence is pivotal for the understanding of its function. Accuracy of manual annotation provided by curators is still questionable by having lesser evidence strength and yet a hard task and time consuming. A number of computational methods including tools have been developed to tackle this challenging task. However, they require high-cost hardware, are difficult to be setup by the bioscientists, or depend on time intensive and blind sequence similarity search like Basic Local Alignment Search Tool. This paper introduces a new method of assigning highly correlated Gene Ontology terms of annotated protein sequences to partially annotated or newly discovered protein sequences. This method is fully based on Gene Ontology data and annotations. Two problems had been identified to achieve this method. The first problem relates to splitting the single monolithic Gene Ontology RDF/XML file into a set of smaller files that can be easy to assess and process. Thus, these files can be enriched with protein sequences and Inferred from Electronic Annotation evidence associations. The second problem involves searching for a set of semantically similar Gene Ontology terms to a given query. The details of macro and micro problems involved and their solutions including objective of this study are described. This paper also describes the protein sequence annotation and the Gene Ontology. The methodology of this study and Gene Ontology based protein sequence annotation tool namely extended UTMGO is presented. Furthermore, its basic version which is a Gene Ontology browser that is based on semantic similarity search is also introduced.
Files
9098.pdf
Files
(542.3 kB)
Name | Size | Download all |
---|---|---|
md5:c587a4b0a4ae4d52cb3d713584b37c9d
|
542.3 kB | Preview Download |
Additional details
References
- A. Chinnasamy, A. Mittal, and W.K. Sung, "Probabilistic prediction of protein-protein interactions from the protein sequences," Computers in Biology & Medicine, vol. 36, no. 10, pp. 1143-1154, Oct. 2006.
- L. Pireddu, D. Szafron, P. Lu, and R. Greiner, "The Path-A metabolic pathway prediction web server," Nucleic Acids Research, vol. 34, no. 1, pp. W714-W719, Jul. 2006.
- G.K. Acquaah-Mensah, S.M. Leach, and C. Guda, "Predicting the subcellular localization of human proteins using machine learning and exploratory data analysis," Genomics Proteomics Bioinformatics, vol. 4, no. 2, pp. 120-133, May 2006.
- E.J. Whitfield, M. Pruess, and R. Apweiler, "Bioinformatics database infrastructure for biotechnology research," J. Biotechnology, vol. 124, no. 4, pp. 629-639, Aug. 2006.
- C. Brooksbank, G. Cameron, and J. Thornton, "The European Bioinformatics Institute-s data resources: towards systems biology," Nucleic Acids Research, vol. 33, no. 1, pp. D46-D53, Jan. 2005.
- R. Apweiler, A. Bairoch, and C.H. Wu, "Protein sequence databases," Current Opinion in Chemical Biology, vol. 8, no. 1, pp. 76-80, Feb. 2004.
- E. Kretschmann, W. Fleischmann, and R. Apweiler, "Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT," Bioinformatics, vol. 17, no. 10, pp. 920-926, Oct. 2001.
- C.H. Wu, H. Huang, L.S. Yeh, and W.C. Barker, "Protein family classification and functional annotation," Computational Biology & Chemistry, vol. 27, no. 1, pp. 37-47, Feb. 2003.
- A. Gattiker, K. Michoud, C. Rivoire, A.H. Auchincloss, E. Coudert, T. Lima, P. Kersey, M. Pagni, C.J. Sigrist, C. Lachaize, A.L. Veuthey, E. Gasteiger, and A. Bairoch, "Automated annotation of microbial proteomes in SWISS-PROT," Computational Biology & Chemistry, vol. 27, no. 1, pp. 49-58, Feb. 2003. [10] W. Fleischmann, S. Moller, A. Gateau, and R. Apweiler, "A novel method for automatic functional annotation of proteins," Bioinformatics, vol. 15, no. 3, pp. 228-233, Mar. 1999. [11] R. Apweiler, A. Bairoch, C.H. Wu, W.C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, M.J. Martin, D.A. Natale, C. O-Donovan, N. Redaschi, and L.S. Yeh, "UniProt: the Universal Protein knowledgebase," Nucleic Acids Research, vol. 32, no. 1, pp. D115-D119, Jan. 2004. [12] C.H. Wu, R. Apweiler, A. Bairoch, D.A. Natale, W.C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, M.J. Martin, R. Mazumder, C. O-Donovan, N. Redaschi, and B. Suzek, "The Universal Protein Resource (UniProt): an expanding universe of protein information," Nucleic Acids Research, vol. 34, no. 1, pp. D187- D191, Jan. 2006. [13] A. Bairoch, R. Apweiler, C.H. Wu, W.C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, M.J. Martin, D.A. Natale, C. O-Donovan, N. Redaschi, and L.S. Yeh, "The Universal Protein Resource (UniProt)," Nucleic Acids Research, vol. 33, no. 1, pp. D154-D159, Jan. 2005. [14] K.A. Snyder, H.J. Feldman, M. Dumontier, J.J. Salama, and C.W. Hogue, "Domain-based small molecule binding site annotation," Bioinformatics, vol. 22, no. 3, pp. 291-296, Feb. 2006. [15] L.B. Koski, M.W. Gray, B.F. Lang, and G. Burger, "AutoFACT: an automatic functional annotation and classification tool," BMC Bioinformatics, vol. 6, no. 1, rec. 151, Jun. 2005. [16] C.E. Jones, U. Baumann, and A.L. Brown, "Automated methods of predicting the function of biological sequences using GO and BLAST," BMC Bioinformatics, vol. 6, no. 1, rec. 272, Nov. 2005. [17] A. Prlic, F.S. Domingues, P. Lackner, and M.J. Sippl, "WILMAautomated annotation of protein sequences," Bioinformatics, vol. 20, no. 1, pp. 127-128, Jan. 2004. [18] X. Yuan, Z.Z. Hu, H.T. Wu, M. Torii, M. Narayanaswamy, K.E. Ravikumar, K. Vijay-Shanker, and C.H. Wu, "An online literature mining tool for protein phosphorylation," Bioinformatics, vol. 22, no. 13, pp. 1668-1669, Jul. 2006. [19] J.H. Chiang and H.C. Yu, "Literature extraction of protein functions using sentence pattern mining," IEEE Trans. Knowledge & Data Engineering, vol. 17, no. 8, pp. 1088-1098, Aug. 2005. [20] C.J.A. Sigrist, E.D. Castro, P.S. Langendijk-Genevaux, V.L. Saux, A. Bairoch, and N. Hulo, "ProRule: a new database containing functional and structural information on PROSITE profiles," Bioinformatics, vol. 21, no. 21, pp. 4060-4066, Aug. 2005. [21] G.X. Yu, "Ruleminer: a knowledge system for supporting highthroughput protein function annotations," J. Bioinformatics & Computational Biology, vol. 2, no. 4, pp. 595-617, Dec. 2004.[22] J. Morbach, A. Yang, and W. Marquardt, "OntoCAPEÔÇöa large-scale ontology for chemical process engineering," Engineering Applications Artificial Intelligence, vol. 20, no. 2, pp. 147-161, Mar. 2007. [23] R.J. Williams, N.D. Martinez, and J. Golbeck, "Ontologies for ecoinformatics," Web Semantics: Science, Services & Agents on the World Wide Web, vol. 4, no. 4, pp. 237-242, Dec. 2006. [24] M. Naphade, J.R. Smith, J. Tesic, S. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis, "Large-scale concept ontology for multimedia," IEEE Multimedia, vol. 13, no. 3, pp. 86-91, Jul.-Sep. 2006. [25] J. Köhler, S. Philippi, M. Specht, and A. R├╝egg, "Ontology based text indexing and querying for the semantic web," Knowledge-Based Systems, vol. 19, no. 8, pp. 744-754, Dec. 2006. [26] C. Hess and C. Schlieder, "Ontology-based verification of core model conformity in conceptual modeling," Computers, Environment & Urban Systems, vol. 30, no. 5, pp. 543-561, Sep. 2006. [27] D. Pérez-Rey, V. Maojo, M. Garc├¡a-Remesal, R. Alonso-Calvo, H. Billhardt, F. Martin-S├ínchez, and A. Sousa, "ONTOFUSION: ontologybased integration of genomic and clinical databases," Computers in Biology & Medicine, vol. 36, no. 7-8, pp. 712-730, Jul.-Aug. 2006. [28] E. Camon, M. Magrane, D. Barrell, V. Lee, E. Dimmer, J. Maslen, D. Binns, N. Harte, R. Lopez, and R. Apweiler, "The Gene Ontology Annotation (GOA) database: sharing knowledge in Uniprot with gene ontology," Nucleic Acids Research, vol. 32, no. 1, pp. D262-266, Jan. 2004. [29] A. Lewin and I.C. Grieve, "Grouping gene ontology terms to improve the assessment of gene set enrichment in microarray data," BMC Bioinformatics, vol. 7, no. 1, rec. 426, Oct. 2006. [30] X. Wu, L. Zhu, J. Guo, D.Y. Zhang, and K. Lin, "Prediction of yeast protein-protein interaction network: insights from the gene ontology and annotations," Nucleic Acids Research, vol. 34, no. 7, pp. 2137-2150, Apr. 2006. [31] Z. Cai, X. Mao, S. Li, and L. Wei, "Genome comparison using Gene Ontology (GO) with statistical testing," BMC Bioinformatics, vol. 7, no. 1, rec. 374, Aug. 2006. [32] B. Zheng, D.C. McLean, and X. Lu, "Identifying biological concepts from a protein-related corpus with a probabilistic topic model," BMC Bioinformatics, vol. 7, no. 1, rec. 58, Feb. 2006. [33] The Gene Ontology Consortium, "The Gene Ontology (GO) project in 2006," Nucleic Acids Research, vol. 34, no. 1, pp. D322-D326, Jan. 2006. [34] J. Lomax, "Get ready to GO! A biologist-s guide to the gene ontology," Briefings in Bioinformatics, vol. 6, no. 3, pp. 298-304, Sep. 2005. [35] M.A. Harris, J. Lomax, A. Ireland, and J.I. Clark, "The gene ontology project," in Encyclopedia Genetics, Genomics, Proteomics & Bioinformatics, part 4, S. Subramaniam, Ed. New York: John Wiley & Sons, 2005. [36] M. Bada, R. Stevens, C. Goble, Y. Gil, M. Ashburner, J.A. Blake, J.M. Cherry, M.A. Harris, and S. Lewis, "A short study on the success of the gene ontology," J. Web Semantics, vol. 1, no. 2, pp. 235-240, Feb. 2004. [37] The Gene Ontology Consortium, "The Gene Ontology (GO) database and informatics resource," Nucleic Acids Research, vol. 32, no. 1, pp. D258-D261, Jan. 2004. [38] The Gene Ontology Consortium, "Creating the gene ontology resource: design and implementation," Genome Research, vol. 11, no. 8, pp. 1425- 1433, Aug. 2001. [39] The Gene Ontology Consortium, "Gene ontology: tool for the unification of biology," Nature Genetics, vol. 25, no. 1, pp. 25-29, May 2000. [40] D. Beckett. (2004, Feb. 10). RDF/XML syntax specification (revised) [Online]. Available: http://www.w3.org/TR/rdf-syntax-grammar. [41] J. Ye, L. Fang, H. Zheng, Y. Zhang, J. Chen, Z. Zhang, J. Wang, S. Li, R. Li, L. Bolund, and J. Wang, "WEGO: a web tool for plotting GO annotations," Nucleic Acids Research, vol. 34, no. 1, pp. W293-W297, Jul. 2006. [42] H.K. Lee, W. Braynen, K. Keshav, and P. Pavlidis, "ErmineJ: tool for functional analysis of gene expression data sets," BMC Bioinformatics, vol. 6, no. 1, rec. 269, Nov. 2005. [43] H. Liu, Z.Z. Hu, and C.H. Wu, "DynGO: a tool for visualizing and mining of gene ontology and its associations," BMC Bioinformatics, vol. 6, no. 1, rec. 201, Aug. 2005. [44] S. Aitken, R. Korf, B. Webber, and J. Bard, "COBrA: a bio-ontology editor," Bioinformatics, vol. 21, no. 6, pp. 825-826, Mar. 2005.[45] W. Cai, S. Chen, and D. Zhang, "Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation," Pattern Recognition, vol. 40, no. 3, pp. 825-838, Mar. 2007. [46] M. Friedman, M. Last, Y. Makover, and A. Kandel, "Anomaly detection in web documents using crisp and fuzzy-based cosine clustering methodology," Information Sciences, vol. 177, no. 2, pp. 467-475, Jan. 2007. [47] J.J. Huang, G.H. Tzeng, and C.S. Ong, "Marketing segmentation using support vector clustering," Expert Systems with Applications, vol. 32, no. 2, pp. 313-317, Feb. 2007. [48] W. Zhong, J. He, R. Harrison, P.C. Tai, and Y. Pan, "Clustering support vector machines for protein local structure prediction," Expert Systems with Applications, vol. 32, no. 2, pp. 518-526, Feb. 2007. [49] G.P. Papamichail and D.P. Papamichail, "The k-means range algorithm for personalized data clustering in e-commerce," European J. Operational Research, vol. 177, no. 3, pp. 1400-1408, Mar. 2007. [50] Z.S.H. Chan, L. Collins, and N. Kasabov, "An efficient greedy k-means algorithm for global gene trajectory clustering," Expert Systems with Applications, vol. 30, no. 1, pp. 137-141, Jan. 2006. [51] A.C. Martinez-Estudillo, C. Hervas-Martinez, F.J. Martinez-Estudillo, and N. Garcia-Pedrajas, "Hybridization of evolutionary algorithms and local search by means of a clustering method," IEEE Trans. Systems, Man & Cybernetics, Part B, vol. 36, no. 3, pp. 534-545, Jun. 2006. [52] D.F. Rogers and S.S. Kulkarni, "Optimal bivariate clustering and a genetic algorithm with an application in cellular manufacturing," European J. Operational Research, vol. 160, no. 2, pp. 423-444, Jan. 2005. [53] J.Y. Hwang and W. Kuo, "Model-based clustering for integrated circuit yield enhancement," European J. Operational Research, vol. 178, no. 1, pp. 143-153, Apr. 2007. [54] Y. Zeng and J. Garcia-Frias, "A novel HMM-based clustering algorithm for the analysis of gene expression time-course data," Computational Statistics & Data Analysis, vol. 50, no. 9, pp. 2472-2494, May 2006. [55] H.M. Torres, J.A. Gurlekian, H.L. Rufiner, and M.E. Torres, "Selforganizing map clustering based on continuous multiresolution entropy," Physica A: Statistical Mechanics & its Applications, vol. 361, no. 1, pp. 337-354, Feb. 2006. [56] S. Mitra, H. Banka, and W. Pedrycz, "Rough-fuzzy collaborative clustering," IEEE Trans. Systems, Man & Cybernetics, Part B, vol. 36, no. 4, pp. 795-805, Aug. 2006. [57] G. Peters, "Some refinements of rough k-means clustering," Pattern Recognition, vol. 39, no. 8, pp. 1481-1491, Aug. 2006. [58] E. Zio and P. Baraldi, "Evolutionary fuzzy clustering for the classification of transients in nuclear components," Progress in Nuclear Energy, vol. 46, no. 3-4, pp. 282-296, Apr. 2005. [59] S.A. Mingoti and J.O. Lima, "Comparing SOM neural network with fuzzy c-means, k-means and traditional hierarchical clustering algorithms," European J. Operational Research, vol. 174, no. 3, pp. 1742-1759, Nov. 2006. [60] G. Chicco, R. Napoli, and F. Piglione, "Comparisons among clustering techniques for electricity customer classification," IEEE Trans. Power Systems, vol. 21, no. 2, pp. 933-940, May 2006. [61] H. Guldem─▒r and A. Sengur, "Comparison of clustering algorithms for analog modulation classification," Expert Systems with Applications, vol. 30, no. 4, pp. 642-649, May 2006. [62] P.C.H. Ma, K.C.C. Chan, X. Yao, and D.K.Y. Chiu, "An evolutionary clustering algorithm for gene expression microarray data analysis," IEEE Trans. Evolutionary Computation, vol. 10, no. 3, pp. 296-314, Jun. 2006. [63] M. Laszlo and S. Mukherjee, "A genetic algorithm using hyperquadtrees for low-dimensional k-means clustering," IEEE Trans. Pattern Analysis & Machine Intelligence, vol. 28, no. 4, pp. 533-543, Apr. 2006. [64] W. Sheng, W. Swift, L. Zhang, and X. Liu, "A weighted sum validity function for clustering with a hybrid niching genetic algorithm," IEEE Trans. Systems, Man & Cybernetics, Part B, vol. 35, no. 6, pp. 1156- 1167, Dec. 2005. [65] S. Bandyopadhyay, "Simulated annealing using a reversible jump Markov chain Monte Carlo algorithm for fuzzy clustering," IEEE Trans. Knowledge & Data Engineering, vol. 17, no. 4, pp. 479-490, Apr. 2005. [66] C. Aykanat, B.B. Cambazoglu, F. Findik, and T. Kurc, "Adaptive decomposition and remapping algorithms for object-space-parallel direct volume rendering of unstructured grids," J. Parallel & Distributed Computing, vol. 67, no. 1, pp. 77-99, Jan. 2007. [67] A. Duarte, ├ü. S├ínchez, F. Fern├índez, and A.S. Montemayor, "Improving image segmentation quality through effective region merging using a hierarchical social metaheuristic," Pattern Recognition Letters, vol. 27, no. 11, pp. 1239-1251, Aug. 2006. [68] M. Boulif and K. Atif, "A new branch-&-bound-enhanced genetic algorithm for the manufacturing cell formation problem," Computers & Operations Research, vol. 33, no. 8, pp. 2219-2245, Aug. 2006. [69] B.S. Mitchell and S. Mancoridis, "On the automatic modularization of software systems using the Bunch tool," IEEE Trans. Software Engineering, vol. 32, no. 3, pp. 193-208, Mar. 2006. [70] J. Grabowski and J. Pempera, "The permutation flow shop problem with blocking. A tabu search approach," Omega, vol. 35, no. 3, pp. 302-311, Jun. 2007. [71] M. Sun, "Solving the uncapacitated facility location problem using tabu search," Computers & Operations Research, vol. 33, no. 9, pp. 2563- 2589, Sep. 2006. [72] M.K. Tiwari, S. Kumar, Prakash, and R. Shankar, "Solving part-type selection and operation allocation problems in an FMS: an approach using constraints-based fast simulated annealing algorithm," IEEE Trans. Systems, Man & Cybernetics, Part A, vol. 36, no. 6, pp. 1170- 1184, Nov. 2006. [73] G. Attiya and Y. Hamam, "Task allocation for maximizing reliability of distributed systems: a simulated annealing approach," J. Parallel & Distributed Computing, vol. 66, no. 10, pp. 1259-1266, Oct. 2006. [74] M. Moz and M.V. Pato, "A genetic algorithm approach to a nurse rerostering problem," Computers & Operations Research, vol. 34, no. 3, pp. 667-691, Mar. 2007. [75] I.H. Toroslu and Y. Arslanoglu, "Genetic algorithm for the personnel assignment problem with multiple objectives," Information Sciences, vol. 177, no. 3, pp. 787-803, Feb. 2007. [76] A.C. Zecchin, A.R. Simpson, H.R. Maier, M. Leonard, A.J. Roberts, and M.J. Berrisford, "Application of two ant colony optimisation algorithms to water distribution system optimization," Mathematical & Computer Modelling, vol. 44, no. 5-6, pp. 451-468, Sep. 2006. [77] P.Y. Yin and J.Y. Wang, "Ant colony optimization for the nonlinear resource allocation problem," Applied Mathematics & Computation, vol. 174, no. 2, pp. 1438-1453, Mar. 2006. [78] J.S. Heo, K.Y. Lee, and R. Garduno-Ramirez, "Multiobjective control of power plants using particle swarm optimization techniques," IEEE Trans. Energy Conversion, vol. 21, no. 2, pp. 552-561, Jun. 2006. [79] S.H. Jacobson, L.A. McLay, S.N. Hall, D. Henderson, and D.E. Vaughan, "Optimal search strategies using simultaneous generalized hill climbing algorithms," Mathematical & Computer Modelling, vol. 43, no. 9-10, pp. 1061-1073, May 2006. [80] L. You and S. Wood, "Assessing the spatial distribution of crop areas using a cross-entropy method," Int-l J. Applied Earth Observation & Geoinformation, vol. 7, no. 4, pp. 310-323, Dec. 2005. [81] M.A. Arostegui Jr., S.N. Kadipasaoglu, and B.M. Khumawala, "An empirical comparison of tabu search, simulated annealing, and genetic algorithms for facilities location problems," Int-l J. Production Economics, vol. 103, no. 2, pp. 742-754, Oct. 2006. [82] S. Kannan, S.M.R. Slochanal, and N.P. Padhy, "Application and comparison of metaheuristic techniques to generation expansion planning problem," IEEE Trans. Power Systems, vol. 20, no. 1, pp. 466- 475, Feb. 2005. [83] E. Elbeltagi, T. Hegazy, and D. Grierson, "Comparison among five evolutionary-based optimization algorithms," Advanced Engineering Informatics, vol. 19, no. 1, pp. 43-53, Jan. 2005. [84] S. Kumar, S.H. Ong, S. Ranganath, and F.T. Chew, "A luminance- and contrast-invariant edge-similarity measure," IEEE Trans. Pattern Analysis & Machine Intelligence, vol. 28, no. 12, pp. 2042-2048, Dec. 2006. [85] J.D. Clayden, M.E. Bastin, and A.J. Storkey, "Improved segmentation reproducibility in group tractography using a quantitative tract similarity measure," NeuroImage, vol. 33, no. 2, pp. 482-492, Nov. 2006. [86] P. Paclik, J. Novovicova, and R.P.W. Duin, "Building road-sign classifiers using a trainable similarity measure," IEEE Trans. Intelligent Transportation Systems, vol. 7, no. 3, pp. 309-321, Sep. 2006. [87] Y. Peng and C.W. Ngo, "Clip-based similarity measure for querydependent clip retrieval and video summarization," IEEE Trans. Circuits & Systems for Video Technology, vol. 16, no. 5, pp. 612-627, May 2006.[88] F. van der Meer, "The effectiveness of spectral similarity measures for the analysis of hyperspectral imagery," Int-l J. Applied Earth Observation & Geoinformation, vol. 8, no. 1, pp. 3-17, Jan. 2006. [89] M. Popescu, J.M. Keller, and J.A. Mitchell, "Fuzzy measures on the gene ontology for gene product similarity," IEEE/ACM Trans. Computational Biology & Bioinformatics, vol. 3, no. 3, pp. 263-274, Jul.-Sep. 2006. [90] X. Chen, J. Tian, and X. Yang, "A new algorithm for distorted fingerprints matching based on normalized fuzzy similarity measure," IEEE Trans. Image Processing, vol. 15, no. 3, pp. 767-776, Mar. 2006. [91] S. Lee and M.M. Crawford, "Unsupervised multistage image classification using hierarchical clustering with a bayesian similarity measure," IEEE Trans. Image Processing, vol. 14, no. 3, pp. 312-320, Mar. 2005. [92] B. Moghaddam, C. Nastar, and A. Pentland, "A bayesian similarity measure for deformable image matching," Image & Vision Computing, vol. 19, no. 5, pp. 235-244, Apr. 2001. [93] D. Skerl, B. Likar, and F. Pernus, "A protocol for evaluation of similarity measures for rigid registration," IEEE Trans. Medical Imaging, vol. 25, no. 6, pp. 779-791, Jun. 2006. [94] H. N├║├▒ez, M. S├ánchez-Marrè, U. Cortés, J. Comas, M. Mart├¡nez, I. Rodr├¡guez-Roda, and M. Poch, "A comparative study on the use of similarity measures in case-based reasoning to improve the classification of environmental system situations," Environmental Modelling & Software, vol. 19, no. 9, pp. 809-819, Sep. 2004. [95] M. Kirac, G. Ozsoyoglu, and J. Yang., "Annotating proteins by mining protein interaction networks," Bioinformatics, vol. 22, no. 14, pp. e260- e270, Jul. 2006. [96] S. Ray and M. Craven, "Learning statistical models for annotating proteins with function information using biomedical text," BMC Bioinformatics, vol. 6, no. 1, pp. rec. S18, May 2005. [97] J.V. Ponomarenko, P.E. Bourne, and I.N. Shindyalov, "Assigning new GO annotations to protein data bank sequences by combining structure and sequence homology," Proteins, vol. 58, no. 4, pp. 855-865, Mar. 2005. [98] R.M. Othman, S. Deris, R.M. Illias, Z. Zakaria, and S.M. Mohamad, "Automatic clustering of gene ontology by genetic algorithm," Int'l J. Information Technology, vol. 3, no. 1, pp. 37-46, Apr. 2006. [99] R.M. Othman, S. Deris, R.M. Illias, H.T. Alashwal, R. Hassan, and F. Mohamed, "Incorporating semantic similarity measure in genetic algorithm: an approach for searching the gene ontology terms," Int'l J. Computational Intelligence, vol. 3, no. 3, pp. 257-266, May 2006. [100] R.M. Othman, S. Deris, and R.M. Illias, "UTMGO: a tool for searching a group of semantically related gene ontology terms and application to annotation of anonymous protein sequence," Int'l J. Biomedical Sciences, vol. 1, no. 2, pp. 111-119, Jul. 2006. [101] E. Takashima, Y. Murata, N. Shibata, and M. Ito, "Techniques to improve exploration efficiency of parallel self-adaptive genetic algorithms by dispensing with iteration and synchronization," Systems & Computers in Japan, vol. 37, no. 14, pp. 25-33, Dec. 2006. [102] Rahul, D. Chakraborty, and A. Dutta, "Optimization of FRP composites against impact induced failure using island model parallel genetic algorithm," Composites Science & Technology, vol. 65, no. 13, pp. 2003-2013, Oct. 2005. [103] K. Katayama, H. Hirabayashi, and H. Narihisa, "Analysis of crossovers and selections in a coarse-grained parallel genetic algorithm," Mathematical & Computer Modelling, vol. 38, no. 11-13, pp. 1275- 1282, Dec. 2003.