A high performance cloud computing platform for mRNA analysis

Annu Int Conf IEEE Eng Med Biol Soc. 2013:2013:1510-3. doi: 10.1109/EMBC.2013.6609799.

Abstract

Multiclass classification is an important technique to many complex bioinformatics problems. However, their performance is limited by the computation power. Based on the Apache Hadoop design framework, this study proposes a two layer architecture that exploits the inherent parallelism of GA-SVM classification to speed up the work. The performance evaluations on an mRNA benchmark cancer dataset have reduced 86.55% features and raised accuracy from 97.53% to 98.03%. With a user-friendly web interface, the system provides researchers an easy way to investigate the unrevealed secrets in the fast-growing repository of bioinformatics data.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Humans
  • Models, Theoretical
  • RNA, Messenger / analysis*
  • RNA, Messenger / genetics
  • Time Factors

Substances

  • RNA, Messenger