Published December 2, 2013 | Version v1
Working paper Open

Massively Parallel Algorithm for Multiple Sequence Alignment Based on Artificial Bee Colony

  • 1. National Centre for Supercomputing Applications, Bulgaria

Contributors

  • 1. National Centre for Supercomputing Applications, Bulgaria

Description

In silico biological sequence processing is a key task in molecular biology. This scientific area requires powerful computing
resources for exploring large sets of biological data. Parallel in silico simulations based on methods and algorithms for analysis of
biological data using high-performance distributed computing is essential for accelerating the research and reducing the investment.
Multiple sequence alignment is a widely used method for biological sequence processing. The goal of this method is DNA and protein sequences alignment. This paper presents an innovative parallel algorithm MSA_BG for multiple alignment of biological sequences that is highly scalable and locality aware. The MSA_BG algorithm we describe is iterative and is based on the concept of Artificial Bee Colony metaheuristics and the concept of algorithmic and architectural spaces correlation. The metaphor of the ABC
metaheuristics has been constructed and the functionalities of the agents have been defined. The conceptual parallel model of
computation has been designed and the algorithmic framework of the designed parallel algorithm constructed. Experimental
simulations on the basis of parallel implementation of MSA_BG algorithm for multiple sequences alignment on heterogeneouc
compact computer cluster and supercomputer BlueGene/P have been carried out for the case study of the influenza virus variability
investigation. The performance estimation and profiling analyses have shown that the parallel system is well balanced both in
respect to the workload and machine size.

Files

WP114.pdf

Files (967.7 kB)

Name Size Download all
md5:d820e73d101a141a06c2acf1eed7a97a
967.7 kB Preview Download

Additional details

Funding

PRACE-2IP – PRACE - Second Implementation Phase Project 283493
European Commission