A high performance cloud computing platform for mRNA analysis

Feng-Seng Lin; Chia-Ping Shen; Hsiao-Ya Sung; Yan-Yu Lam; Jeng-Wei Lin; Feipei Lai

doi:10.1109/EMBC.2013.6609799

A high performance cloud computing platform for mRNA analysis

Annu Int Conf IEEE Eng Med Biol Soc. 2013:2013:1510-3. doi: 10.1109/EMBC.2013.6609799.

Authors

Feng-Seng Lin, Chia-Ping Shen, Hsiao-Ya Sung, Yan-Yu Lam, Jeng-Wei Lin, Feipei Lai

PMID: 24109986
DOI: 10.1109/EMBC.2013.6609799

Abstract

Multiclass classification is an important technique to many complex bioinformatics problems. However, their performance is limited by the computation power. Based on the Apache Hadoop design framework, this study proposes a two layer architecture that exploits the inherent parallelism of GA-SVM classification to speed up the work. The performance evaluations on an mRNA benchmark cancer dataset have reduced 86.55% features and raised accuracy from 97.53% to 98.03%. With a user-friendly web interface, the system provides researchers an easy way to investigate the unrevealed secrets in the fast-growing repository of bioinformatics data.

MeSH terms

Algorithms
Computational Biology / methods*
Humans
Models, Theoretical
RNA, Messenger / analysis*
RNA, Messenger / genetics
Time Factors

Substances

RNA, Messenger