Published June 20, 2014 | Version v1
Working paper Open

Novel HPC Technologies for Rapid Analysis in Bioinformatics

Creators

  • 1. Centre Informatique National de l'Enseignement Supérieur, 950, rue de Saint Priest, 34097 Montpellier Cedex 5, France.
  • 1. Centre Informatique National de l'Enseignement Supérieur, 950, rue de Saint Priest, 34097 Montpellier Cedex 5, France.
  • 2. Irish Centre for High-End Computing, 7/F Tower Blg., Trinity Technology & Enterprise Campus, Grand Canal Quay, Dublin 2, Ireland.
  • 3. Grand Equipement National de Calcul Intensif, 12, rue de l'Eglise, 75015 Paris, France.
  • 4. NSilico Lifescience Ltd., Melbourne Building, CIT Campus, Bishopstown, Cork City, Ireland.

Description

NSilico is an Irish based SME that develops software for the life sciences sector, providing bioinformatics and
medical informatics systems to a range of clients. One of the major challenges that their users face is the
exponential growth of high-throughput genomic sequence data and the associated computational demands to
process such data in a fast and efficient manner. Genomic sequences contain gigabytes of nucleotide data that
require detailed comparison with similar sequences in order to determine the nature of functional, structural and
evolutionary relationships. In this regard NSilico has been working with computational experts from CINES
(France) and ICHEC (Ireland) under the PRACE SHAPE programme to address a key problem that is the rapid
alignment of short DNA sequences to reference genomes by deploying the Smith-Waterman algorithm on an
emerging many-core technology, the Intel Xeon Phi co-processor. This white paper will discuss some of the
parallelisation and optimisation strategies adopted to achieve performance improvements of the algorithm
keeping in mind both existing and future versions of the hardware. The outcome has been an extremely
successful collaboration between PRACE and NSilico, resulting in an implementation of the algorithm that can
be readily deployed to realise significant performance gains from the next generation of many-core hardware.

Files

WP195.pdf

Files (285.8 kB)

Name Size Download all
md5:61cd1410ce0de7185cc72d64ea95705c
285.8 kB Preview Download

Additional details

Funding

PRACE-3IP – PRACE - Third Implementation Phase Project 312763
European Commission