On the tradeoff between speedup and energy consumption in high performance computing - A bioinformatics case study

Sachin Pawaskar, Hesham H. Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

High Performance Computing has been very useful to researchers in the Bioinformatics, Medical and related fields. The bioinformatics domain is rich in applications that require extracting useful information from very large and continuously growing sequence of databases. Automated techniques such as DNA sequencers, DNA microarrays & others are continually growing the dataset that is stored in large public databases such as GenBank and Protein DataBank. Most methods used for analyzing genetic/protein data have been found to be extremely computationally intensive, providing motivation for the use of powerful computers or systems with high throughput characteristics. In this paper, we provide a case study for one such bioinformatics application called BLAT running in a high performance computing environment. We use sequences gathered from researchers and parallelize the runs to study the performance characteristics under three different query and data partitioning models. This research highlights the need to carefully develop a parallel model with energy awareness in mind, based on our understanding of the application and then appropriately designing a parallel model that works well for the specific application and domain. We found that the BLAT program is highly parallelizable and a high degree of speedup is achievable. The experiments suggest that the speed up depends on model used for query and database segmentation.

Original languageEnglish (US)
Title of host publicationProceedings of the 9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010
PublisherACTA Press
Pages218-225
Number of pages8
ISBN (Print)9780889868205
DOIs
StatePublished - 2010
Event9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010 - Innsbruck, Austria
Duration: Feb 16 2010Feb 18 2010

Publication series

NameProceedings of the 9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010

Conference

Conference9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010
Country/TerritoryAustria
CityInnsbruck
Period2/16/102/18/10

Keywords

  • Bioinformatics
  • Energy awareness
  • High performance computing
  • Parallel processing
  • Sequence comparisons

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'On the tradeoff between speedup and energy consumption in high performance computing - A bioinformatics case study'. Together they form a unique fingerprint.

Cite this