OPTSDNA: Performance evaluation of an efficient distributed bioinformatics system for DNA sequence analysis

Bioinformation. 2013 Sep 23;9(16):842-6. doi: 10.6026/97320630009842. eCollection 2013.

Abstract

Storage of sequence data is a big concern as the amount of data generated is exponential in nature at several locations. Therefore, there is a need to develop techniques to store data using compression algorithm. Here we describe optimal storage algorithm (OPTSDNA) for storing large amount of DNA sequences of varying length. This paper provides performance analysis of optimal storage algorithm (OPTSDNA) of a distributed bioinformatics computing system for analysis of DNA sequences. OPTSDNA algorithm is used for storing various sizes of DNA sequences into database. DNA sequences of different lengths were stored by using this algorithm. These input DNA sequences are varied in size from very small to very large. Storage size is calculated by this algorithm. Response time is also calculated in this work. The efficiency and performance of the algorithm is high (in size calculation with percentage) when compared with other known with sequential approach.

Keywords: DNA Sequence; Distributed Bioinformatics System; Optimal Storage; Performance Measurement; Sequential Approach.